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Preface 


In the past few decades, the field of quantum condensed matter physics has seen rapid 
and, at times, almost revolutionary development. Undoubtedly, the success of the field 
owes much to ground-breaking advances in experiment: already the controlled fabrication 
of phase coherent electron devices on the nanoscale is commonplace (if not yet routine), 
while the realization of ultra-cold atomic gases presents a new arena in which to explore 
strong interaction and condensation phenomena in Fermi and Bose systems. These, along 
with many other examples, have opened entirely new perspectives on the quantum physics 
of many-particle systems. Yet, important as it is, experimental progress alone does not, 
perhaps, fully explain the appeal of modern condensed matter physics. Indeed, in concert 
with these experimental developments, there has been a “quiet revolution” in condensed 
matter theory, which has seen phenomena in seemingly quite different systems united by 
common physical mechanisms. This relentless “unification” of condensed matter theory, 
which has drawn increasingly on the language of low-energy quantum field theory, betrays 
the astonishing degree of universality , not fully appreciated in the early literature. 

On a truly microscopic level, all forms of quantum matter can be formulated as a many- 
body Hamiltonian encoding the fundamental interactions of the constituent particles. How- 
ever, in contrast with many other areas of physics, in practically all cases of interest in 
condensed matter the structure of this operator conveys as much information about the 
properties of the system as, say, the knowledge of the basic chemical constituents tells us 
about the behavior of a living organism! Rather, in the condensed matter environment, 
it has been a long-standing tenet that the degrees of freedom relevant to the low-energy 
properties of a system are very often not the microscopic. Although, in earlier times, the 
passage between the microscopic degrees of freedom and the relevant low-energy degrees of 
freedom has remained more or less transparent, in recent years this situation has changed 
profoundly. It is a hallmark of many “deep” problems of modern condensed matter physics 
that the connection between the two levels involves complex and, at times, even controversial 
mappings. To understand why, it is helpful to place these ideas on a firmer footing. 

Historically, the development of modern condensed matter physics has, to a large extent, 
hinged on the “unreasonable” success and “notorious” failures of non-interacting theo- 
ries. The apparent impotency of interactions observed in a wide range of physical sys- 
tems can be attributed to a deep and far-reaching principle of adiabatic continuity, the 



quantum numbers that characterize a many-body system are determined by fundamen- 
tal symmetries (translation, rotation, particle exchange, etc.). Providing that the integrity 
of the symmetries is maintained, the elementary “quasi-particle” excitations of an inter- 
acting system can be usually traced back “adiabatically” to those of the bare particle 
excitations present in the non-interacting system. Formally, one can say that the radius 
of convergence of perturbation theory extends beyond the region in which the pertur- 
bation is small. For example, this quasi-particle correspondence, embodied in Landau’s 
Fermi-liquid theory, has provided a reliable platform for the investigation of the wide 
range of Fermi systems from conventional metals to 3 helium fluids and cold atomic Fermi 
gases. 

However, being contingent on symmetry, the principle of adiabatic continuity and, with 
it, the quasi-particle correspondence, must be abandoned at a phase transition. Here, inter- 
actions typically effect a substantial rearrangement of the many-body ground state. In the 
symmetry-broken phase, a system may - and frequently does - exhibit elementary exci- 
tations very different from those of the parent non-interacting phase. These elementary 
excitations may be classified as new species of quasi-particle with their own characteristic 
quantum numbers, or they may represent a new kind of excitation - a collective mode - 
engaging the cooperative motion of many bare particles. Many familiar examples fall into 
this category: when ions or electrons condense from a liquid into a solid phase, translational 
symmetry is broken and the elementary excitations - phonons - involve the motion of many 
individual bare particles. Less mundane, at certain field strengths, the effective low-energy 
degrees of freedom of a two-dimensional electron gas subject to a magnetic field (the quan- 
tum Hall system) appear as quasi-particles carrying a rational fraction (!) of the elementary 
electron charge - an effect manifestly non-perturbative in character. 

This reorganization lends itself to a hierarchical perspective of condensed matter already 
familiar in the realm of particle physics. Each phase of matter is associated with a unique 
“non-interacting” reference state with its own characteristic quasi-particle excitations - a 
product only of the fundamental symmetries that classify the phase. While one stays within 
a given phase, one may draw on the principle of continuity to infer the influence of inter- 
actions. Yet this hierarchical picture delivers two profound implications. Firstly, within the 
quasi-particle framework, the underlying “bare” or elementary particles remain invisible 
(witness the fractionally charged quasi-particle excitations of the fractional quantum Hall 
fluid!). (To quote from P. W. Anderson’s now famous article “More is different,” ( Science 
177 (1972), 393-6), “the ability to reduce everything to simple fundamental laws does not 
imply the ability to start from those laws and reconstruct the universe.”) Secondly, while 
the capacity to conceive of new types of interaction is almost unbounded (arguably the 
most attractive feature of the condensed matter environment!), the freedom to identify 
non-interacting or free theories is strongly limited, constrained by the space of fundamen- 
tal symmetries. When this is combined with the principle of continuity, the origin of the 
observed “universality” in condensed matter is revealed. Although the principles of adia- 
batic continuity, universality, and the importance of symmetries have been anticipated and 
emphasized long ago by visionary theorists, it is perhaps not until relatively recently that 
their mainstream consequences have become visible. 



How can these concepts be embedded into a theoretical framework? At first sight, the 
many-body problem seems overwhelmingly daunting. In a typical system, there exist some 
10 23 particles interacting strongly with their neighbors. Monitoring the collective dynamics, 
even in a classical system, is evidently a hopeless enterprise. Yet, from our discussion above, 
it is clear that, by focussing on the coordinates of the collective degrees of freedom, one 
may develop a manageable theory involving only a restricted set of excitations. The success 
of quantum field theory in describing low-energy theories of particle physics as a successive 
hierarchy of broken symmetries makes its application in the present context quite natural. 
As well as presenting a convenient and efficient microscopic formulation of the many-body 
problem, the quantum field theory description provides a vehicle to systematically identify, 
isolate, and develop a low-energy theory of the collective field. Moreover, when cast as a field 
integral, the quantum field theory affords a classification of interacting systems into a small 
number of universality classes defined by their fundamental symmetries (a phenomenon not 
confined by the boundaries of condensed matter - many concepts originally developed in 
medium- or high-energy physics afford a seamless application in condensed matter). This 
phenomenon has triggered a massive trend of unification in modern theoretical physics. 
Indeed, by now, several sub-fields of theoretical physics have emerged (such as conformal 
field theory, random matrix theory, etc.) that define themselves not so much through any 
specific application as by a certain conceptual or methodological framework. 

In deference to the importance attached to the subject, in recent years a number of 
texts have been written on the subject of quantum field theory within condensed matter. 
It is, therefore, pertinent for a reader to question the motivation for the present text. 
Firstly, the principal role of this text is as a primer aimed at elevating graduate students 
to a level where they can engage in independent research. Secondly, while the discussion 
of conceptual aspects takes priority over the exposure to the gamut of condensed matter 
applications, we have endeavored to keep the text firmly rooted in practical experimental 
application. Thirdly, as well as routine exercises, the present text includes extended problems 
which are designed to provide a bridge from formal manipulations to research-oriented 
thinking. Indeed, in this context, readers may note that some of the “answered” problems 
are deliberately designed to challenge: it is, after all, important to develop a certain degree 
of intuitive understanding of formal structures and, sadly, this can be acquired only by 
persistent and, at times, even frustrating training! 

With this background, let us now discuss in more detail the organization of the text. 
To prepare for the discussion of field theory and functional integral techniques we begin in 
Chapter 1 by introducing the notion of a classical and a quantum field. Here we focus on 
the problem of lattice vibrations in the discrete harmonic chain, and its “ancestor” in the 
problem of classical and quantum electrodynamics. The development of field integral meth- 
ods for the many-body system relies on the formulation of quantum mechanical theories in 
the framework of the second quantization. In Chapter 2 we present a formal and detailed 
introduction to the general methodology. To assimilate this technique, and motivate some 
of the examples discussed later in the text, a number of separate and substantial appli- 
cations are explored in this chapter. In the first of these, we present (in second-quantized 
form) a somewhat cursory survey of the classification of metals and insulators, identifying a 
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canonical set of model Hamiltonians, some of which form source material for later chapters. 
In the case of the one-dimensional system, we will show how the spectrum of elementary 
collective excitations can be inferred using purely operator methods within the framework 
of the bosonization scheme. Finally, to close the chapter, we will discuss the application of 
the second quantization to the low-energy dynamics of quantum mechanical spin systems. 
As a final basic ingredient in the development of the quantum field theory, in Chapter 3 we 
introduce the Feynman path integral for the single-particle system. As well as represent- 
ing a prototype for higher-dimensional field theories, the path integral method provides a 
valuable and recurring computational tool. This being so, we have included in this chapter 
a pedagogical discussion of a number of rich and instructive applications which range from 
the canonical example of a particle confined to a single or double quantum well, to the 
tunneling of extended objects (quantum fields), quantum dissipation, and the path integral 
formulation of spin. 

Having accumulated all of the necessary background, in Chapter 4 we turn to the formula- 
tion and development of the field integral of the quantum many-particle system. Beginning 
with a discussion of coherent states for Fermi and Bose systems, we develop the many- 
body path integral from first principles. Although the emphasis in the present text is on 
the field integral formulation, the majority of early and seminal works in the many-body 
literature were developed in the framework of diagrammatic perturbation theory. To make 
contact with this important class of approximation schemes, in Chapter 5 we explore the 
way diagrammatic perturbation series expansions can be developed systematically from the 
field integral. Employing the </> 4 - theory as a canonical example, we describe how to explore 
the properties of a system in a high order of perturbation theory around a known refer- 
ence state. To cement these ideas, we apply these techniques to the problem of the weakly 
interacting electron gas. 

Although the field integral formulation provides a convenient means to organize pertur- 
bative approximation schemes as a diagrammatic series expansion, its real power lies in 
its ability to identify non-trivial reference ground states, or “mean-fields,” and to provide 
a framework in which low-energy theories of collective excitations can be developed. In 
Chapter 6, a fusion of perturbative and mean-field methods is used to develop analyti- 
cal machinery powerful enough to address a spectrum of rich applications ranging from 
metallic magnetism and superconductivity to superfluidity. To bridge the gap between the 
(often abstract) formalism of the field integral, and the arena of practical application, it is 
necessary to infer the behavior of correlation functions. Beginning with a brief survey of con- 
cepts and techniques of experimental condensed matter physics, in Chapter 7 we highlight 
the importance of correlation functions and explore their connection with the theoretical 
formalism developed in previous chapters. In particular, we discuss how the response of 
many-body systems to various types of electromagnetic perturbation can be described in 
terms of correlation functions and how these functions can be computed by field theoretical 
means. 

Although the field integral is usually simple to formulate, its properties are not always 
easy to uncover. Amongst the armory of tools available to the theorist, perhaps the 
most adaptable and versatile is the method of the renormalization group. Motivating 
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our discussion with two introductory examples drawn from a classical and a quantum 
theory, in Chapter 8 we become acquainted with the renormalization group method as a 
concept whereby nonlinear theories can be analyzed beyond the level of plain perturbation 
theory. With this background, we then proceed to discuss renormalization methods in more 
rigorous and general terms, introducing the notion of scaling, dimensional analysis, and the 
connection to the general theory of phase transitions and critical phenomena. To conclude 
this chapter, we visit a number of concrete implementations of the renormalization group 
scheme introduced and exemplified on a number of canonical applications. 

In Chapter 9, we turn our attention to low-energy theories with non-trivial forms of 
long-range order. Specifically, we will learn how to detect and classify topologically non- 
trivial structures, and to understand their physical consequences. Specifically, we explore the 
impact of topological terms (i.e. 0-terms, Wess-Zumino terms, and Chern-Simons terms) 
on the behavior of low-energy field theories solely through the topology of the underlying 
field configurations. Applications discussed in this chapter include persistent currents, ’t 
Hooft’s 0-vacua, quantum spin chains, and the quantum Hall effects. 

So far, our development of field theoretic methodologies has been tailored to the consid- 
eration of single-particle quantum systems, or many-body systems in thermal equilibrium. 
However, studies of classical nonequilibrium systems have a long and illustrious history, 
dating back to the earliest studies of thermodynamics, and these days include a range of 
applications from soft matter physics to population dynamics and ecology. At the same time, 
the control afforded by modern mesoscopic semiconducting and metallic devices, quantum 
optics, as well as ultracold atom physics now allow controlled access to quantum systems 
driven far from equilibrium. For such systems, traditional quantum field theoretical method- 
ologies are inappropriate. 

Starting with the foundations of non-equilibrium statistical mechanics, from simple one- 
step processes, to reaction-diffusion type systems, in Chapter 10 we begin by developing 
Langevin and Fokker-Planck theory, from which we establish classical Boltzmann trans- 
port equations. We then show how these techniques can be formulated in the language of 
the functional integral developing the Doi-Peliti and Martin-Siggia-Rose techniques. We 
conclude our discussion with applications to nonequilibrium phase transitions and driven 
lattice gases. These studies of the classical nonequilibrium system provide a platform to 
explore the quantum system. In Chapter 11, we develop the Keldysh approach to quantum 
non-equilibrium systems based, again, on the functional integral technique. In particular, 
we emphasize and exploit the close connections to classical nonequilibrium field theory, and 
present applications to problems from the arena of quantum transport. 

To focus and limit our discussion, we have endeavored to distill material considered 
“essential” from the “merely interesting” or “background.” To formally acknowledge and 
identify this classification, we have frequently included reference to material which we believe 
may be of interest to the reader in placing the discussion in context, but which can be 
skipped without losing the essential thread of the text. These intermissions are signaled in 
the text as “Info” blocks. 

At the end of each chapter, we have collected a number of pedagogical and instructive 
problems. In some cases, the problems expand on some aspect of the main text requiring only 
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an extension, or straightforward generalization, of a concept raised in the chapter. In other 
cases, the problems rather complement the main text, visiting fresh applications of the same 
qualitative material. Such problems take the form of case studies in which both the theory 
and the setting chart new territory. The latter provide a vehicle to introduce some core areas 
of physics not encountered in the main text, and allow the reader to assess the degree to 
which the ideas in the chapter have been assimilated. With both types of questions to make 
the problems more inclusive and useful as a reference, we have included (sometimes abridged, 
and sometimes lengthy) answers. In this context, Section 6.5 assumes a somewhat special 
role: the problem of phase coherent electron transport in weakly disordered media provides 
a number of profoundly important problems of great theoretical and practical significance. 
In preparing this section, it became apparent that the quantum disorder problem presents 
an ideal environment in which many of the theoretical concepts introduced in the previous 
chapters can be practiced and applied - to wit diagrammatic perturbation theory and series 
expansions, mean-field theory and collective mode expansions, correlation functions and 
linear response, and topology. We have therefore organized this material in the form of an 
extended problem set in Chapter 6. 

This concludes our introduction to the text. Throughout, we have tried to limit the range 
of physical applications to examples which are rooted in experimental fact. We have resisted 
the temptation to venture into more speculative areas of theoretical condensed matter at 
the expense of excluding many modern and more-circumspect ideas which pervade the con- 
densed matter literature. Moreover, since the applications are intended to help motivate and 
support the field theoretical techniques, their discussion is, at times, necessarily superficial. 
(For example, the hundreds pages of text in this volume could have been invested in their 
entirety in the subject of superconductivity!) Therefore, where appropriate, we have tried 
to direct interested readers to the more specialist literature. 

In closing, we would like to express our gratitude to Jakob Miiller-Hill, Tobias Micklitz, 
Jan Muller, Natalja Strelkova, Franjo-Frankopan Velic, Andrea Wolff, and Markus Zowislok 
for their invaluable assistance in the proofreading of the text. Moreover, we would also like 
to thank Julia Meyer for her help in drafting problems. Finally we would like to acknowledge 
Sasha Abanov for his advice and guidance in the drafting of the chapter on Topology. 

As well as including additional material on the formulation of functional field integral 
methods to classical and quantum nonequilibrium physics in Chapters 10 and 11, in prepar- 
ing the second edition of the text, we have endeavored to remove some of the typographical 
errors that crept into the first edition. Although it seems inevitable that some errors will 
still have escaped identification, it is clear that many many more would have been missed 
were it not for the vigilance of many friends and colleagues. In this context, we would partic- 
ularly like to acknowledge the input of Piet Brouwer, Christoph Bruder, Chung-Pin Chou, 
Jan von Delft, Karin Everschor, Andrej Fischer, Alex Gezerlis, Sven Gnutzmann, Colin 
Kiegel, Tobias Luck, Patrick Neven, Achim Rosch, Max Schafer, Matthias Sitte, Nobuhiko 
Taniguchi, and Matthias Vojta. 
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From particles to fields 


To introduce some fundamental concepts of field theory, we begin by considering two simple model 
systems - a one-dimensional "caricature” of a solid, and a freely propagating electromagnetic wave. 
As well as exemplifying the transition from discrete to continuous degrees of freedom, these examples 
introduce the basic formalism of classical and quantum field theory, the notion of elementary excitations, 
collective modes, symmetries, and universality - concepts which will pervade the rest of the text. 


One of the more remarkable facts about condensed matter physics is that phenomenology of 
fantastic complexity is born out of a Hamiltonian of comparative simplicity. Indeed, it is not 
difficult to construct microscopic “condensed matter Hamiltonians” of reasonable generality. 
For example, a prototypical metal or insulator might be described by the many-particle 
Hamiltonian, H = H e + Hi + H e [ where 

H ‘ = E,^+SwU»(r ( -r J ), 1 

* = } m 

H, I = E„V.,(R,-r,). ( 

Here, r, (R/) denote the coordinates of the valence electrons (ion cores) and H e ,H[, and R e i 

describe the dynamics of electrons, ions and the interaction of electrons and ions, respec- 
tively (see Fig. 1.1). Of course, the Hamiltonian Eq. (1.1) can be made “more realistic,” for 
example by remembering that electrons and ions carry spin, adding disorder, or introduc- 
ing host lattices with multi-atomic unit-cells. However, for developing our present line of 
thought the prototype H will suffice. 

The fact that a seemingly innocuous Hamiltonian like Eq. (1.1) is capable of generating the 
vast panopticon of metallic phenomenology can be read in reverse order: one will normally 
not be able to make theoretical progress by approaching the problem in an “ ab initio ” 
manner, i.e. by an approach that treats all microscopic constituents as equally relevant 
degrees of freedom. 

How then can successful analytical approaches be developed? The answer to this question 
lies in a number of basic principles inherent in generic condensed matter systems. 
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From particles to fields 



Figure 1.1 A one-dimensional cartoon of a (metallic) solid. Positively charged ions located at 
positions R i are surrounded by a conduction electron cloud (electron coordinates denoted by r ( ). 
While the motion of the ions is massively constrained by the lattice potential Vji (indicated by the 
solid line and its harmonic approximation shown dashed), t he dynamics of the electrons is affected 
by their mutual interaction (14e) and their interaction with the core ions (Vei)- 

1. Structural reducibility: Not all components of the Hamiltonian (1.1) need to be treated 
simultaneously. For example, when the interest is foremost in the vibrational motion of 
the ion lattice, the dynamics of the electron system can often be neglected or, at least, 
be treated in a simplistic manner. Similarly, much of the character of the dynamics of 
the electrons is independent of the ion lattice, etc. 

2. In the majority of condensed matter applications, one is interested not so much in the 
full profile of a given system, but rather in its energetically low-lying dynamics. This is 
motivated partly by practical aspects (in daily life, iron is normally encountered at room 
temperature and not at its melting point), and partly by the tendency of large systems 
to behave in a “universal” manner at low temperatures. Here universality implies that 
systems differing in microscopic detail (e.g. different types of interaction potentials, ion 
species, etc.) exhibit common collective behavior. As a physicist, one will normally seek 
for unifying principles in collective phenomena rather than to describe the peculiarities of 
individual species. However, universality is equally important in the practice of condensed 
matter theory. It implies, for example, that, at low temperatures, details of the functional 
form of microscopic interaction potentials are of secondary importance, i.e. that one may 
employ simple model Hamiltonians. 

3. For most systems of interest, the number of degrees of freedom is formidably large with 
N = 0(1O 23 ). However, contrary to first impressions, the magnitude of this figure is 
rather an advantage. The reason is that in addressing condensed matter problems we 
may make use of the concepts of statistics and that (precisely due to the largeness of 
N) statistical errors tend to be negligibly small. 1 

4. Finally, condensed matter systems typically possess a number of intrinsic symmetries. 
For example, our prototype Hamiltonian above is invariant under simultaneous trans- 
lation and rotation of all coordinates, which expresses the global Galilean invariance of 
the system (a continuous set of symmetries). Spin rotation invariance (continuous) and 


1 The importance of this point is illustrated by the empirical observation that the most challenging systems in 

physical sciences are of medium (and not large) scale, e.g., metallic clusters, medium-sized nuclei or large atoms 
consist of <9(10 1 -10 2 ) fundamental constituents. Such problems are well beyond the reach of few-body quantum 
mechanics while not yet accessible to reliable statistical modeling. Often the only viable path to approaching 
systems of this type is massive use of phenomenology. 



1.1 Classical harmonic chain: phonons 
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time-reversal invariance (discrete) are other examples of frequently encountered sym- 
metries. The general importance of symmetries cannot be over emphasized: symmetries 
entail the conservation laws that simplify any problem. Yet in condensed matter physics, 
symmetries are “even more” important. A conserved observable is generally tied to an 
energetically low-lying excitation. In the universal low-temperature regimes in which we 
will typically be interested, it is precisely the dynamics of these low-level excitations that 
governs the gross behavior of the system. In subsequent sections, the sequence “symme- 
try i — ^ conservation law h-> low-lying excitations” will be encountered time and again. At 
any rate, identification of the fundamental symmetries will typically be the first step in 
the analysis of a solid state system. 

To understand how these basic principles can be used to formulate and explore “effective 
low-energy” field theories of solid state systems we will begin our discussion by focussing on 
the harmonic chain; a collection of atoms bound by a harmonic potential. In doing so, we 
will observe that the universal characteristics encapsulated by the low-energy dynamics 2 of 
large systems relate naturally to concepts of field theory. 


1.1 Classical harmonic chain: phonons 

Returning to the prototype Hamiltonian (1.1) discussed earlier, let us focus on the dynamical 
properties of the positively charged core ions that constitute the host lattice of a crystal. 
For the moment, let us neglect the fact that atoms are quantum objects and treat the ions 
as classical entities. To further simplify the problem, let us consider an atomic chain rather 
than a generic d-dimensional solid. In this case, the positions of the ions can be specified 
by a sequence of coordinates with an average lattice spacing a. Relying on the reduction 
principle (1) we will first argue that, to understand the behavior of the ions, the dynamics 
of the conduction electrons are of secondary importance, i.e. we will set H e = H ei = 0. 

At strictly zero temperature, the system of ions will be frozen out, i.e. the one-dimensional 
ion coordinates Ri = Rj = la settle into a regularly spaced array. Any deviation from 
a perfectly regular configuration has to be paid for by a price in potential energy. For 
low enough temperatures (principle 2), this energy will be approximately quadratic in the 
small deviation from the equilibrium position. The reduced low-energy Hamiltonian of 
our system then reads 

H = £ \m * I (Kl+1 -*<■=#! ■ < L2 > 

i=i L J 

where the coefficient k s determines the steepness of the lattice potential. Notice that H can 
be interpreted as the Hamiltonian of N point-like particles of mass M elastically connected 
by springs with spring constant k s (see Fig. 1.2). 


2 In this text, we will focus on the dynamical behavior of large systems, as opposed to their static structural 
properties. In particular, we will not address questions related to the formation of definite crystallographic 
structures in solid state systems. 



4 


From particles to fields 


TU . M K 

— i — i — i — i — i — i — 

(I-l)a la (I+l)a 

Figure 1.2 Toy model of a one-dimensional solid; a chain of elastically bound massive point parti- 
cles. 


Lagrangian formulation and equations of motion 

What are the elementary low- 
energy excitations of the system? To 
answer this question we might, in 
principle, attempt to solve Hamil- 
ton’s equations of motion. Indeed, 
since H is quadratic in all coordi- 
nates, such a program is, in this 
case, feasible. However, we must bear 
in mind that few of the problems 
encountered in general solid state 
physics enjoy this property. Further, 
it seems unlikely that the low-energy dynamics of a macroscopically large chain - which 
we know from our experience will be governed by large-scale wave-like excitations - is 
adequately described in terms of an “atomistic” language; the relevant degrees of freedom 
will be of a different type. We should, rather, draw on the basic principles 1-4 set out 
above. Notably, we have so far paid attention neither to the intrinsic symmetry of the 
problem nor to the fact that N is large. 

Crucially, to reduce a microscopic model to an effective low-energy theory, the Hamil- 
tonian is often not a very convenient starting point. Usually, it is more efficient to start 
out from an action. In the present case, the Lagrangian action corresponding to a time 
interval [0,to] is defined as S = f*° dt L(R, R), where (R, R) = {Ri, Ri} symbolically rep- 
resents the set of all coordinates and their time derivatives. The Lagrangian L related to 
the Hamiltonian (1.2) is given by 


Joseph-Louis Lagrange 1736- 
1813 

A mathematician who excelled 
in all fields of analysis, number 
theory, and celestial mechanics. 

In 1788 he published Mecanique 
Analytique, which summarised all the work done in 
the field of mechanics since the time of Newton, 
and is notable for its use of the theory of differen- 
tial equations. In it he transformed mechanics into 
a branch of mathematical analysis. 



l = t-u = Y, 


M^L-^(R I+1 - Rl -a ) 2 


(1.3) 


where T and U denote respectively the kinetic and potential energy. 

Since we are interested in the properties of the large- IV system, we can expect boundary 
effects to be negligible. This being so, we are at liberty to impose on our atomic chain the 
topology of a circle, i.e. we adopt periodic boundary conditions identifying Rn+i = Ri- 
Further, anticipating that the effect of lattice vibrations on the solid is weak (i.e. long- 
range atomic order is maintained) we may assume that the deviation of the ions from their 
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equilibrium position is small (\Ri(t) — Ri\ -C a), and the integrity of the solid is maintained. 
With Ri{t) = Ri + <f>i(t) (<f>N+i = <t>i) the Lagrangian (1.3) assumes the simplified form 

L = XJ [y $ - y (fefei _ <M 2 J • 

To make further progress, we will now make use of the fact that we are not concerned with 
the behavior of our system on “atomic” scales. (In any case, for such purposes a modeling 
like the one above would be much too primitive!) Rather, we are interested in experimentally 
observable behavior that manifests itself on macroscopic length scales (principle 2). For 
example, one might wish to study the specific heat of the solid in the limit of infinitely 
many atoms (or at least a macroscopically large number, 0(1O 23 )). Under these conditions, 
microscopic models can usually be substantially simplified (principle 3). In particular, it is 
often permissible to subject a discrete lattice model to a so-called continuum limit, i.e. 
to neglect the discreteness of the microscopic entities and to describe the system in terms 
of effective continuum degrees of freedom. 

In the present case, taking a continuum limit 
amounts to describing the lattice fluctuations <f>i 
in terms of smooth functions of a continuous vari- 
able x (see the figure where the [horizontal] dis- 
placement of the point particles has been plotted 
along the vertical). Clearly such a description makes sense only if relative fluctuations on 
atomic scales are weak (for otherwise the smoothness condition would be violated). However, 
if this condition is met - as it will be for sufficiently large values of the stiffness constant 
k s - the continuum description is much more powerful than the discrete encoding in terms 
of the “vector” {</>/}. All steps that we will need to take to go from the Lagrangian to 
concrete physical predictions will be much easier to formulate. 

Introducing continuum degrees of freedom and applying a first-order Taylor expan- 
sion, 3 let us define 

i . \ N i r L 

(pi -l a l / 2 <p(x) , <j>i + i — cf)i — > a 3 / 2 d x (j)(x)\ , — ► - / dx, 

lx=/o \x=Ia ^ a J o 

where L = Na. Note that, as defined, the functions <j>(x,t ) have dimensionality [length] 1 / 2 . 
Expressed in terms of the new degrees of freedom, the continuum limit of the Lagrangian 
then reads 

L[4>] = J o dx C{<t>,d x <t>,4>), C(<j>,d x <t>,4>) = ™<fi 2 - (1.4) 

3 Indeed, for reasons that will become clear, higher-order contributions to the Taylor expansion are immaterial in 
the long-range continuum limit. 
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Figure 1.3 Schematic visualization of a field: a mapping <i> from a base manifold M into a target 
space T (in this case, T are the real numbers; but, in general, T can be more complicated). A 
functional assigns to each <fi a real number 5[0]. The grid embedded into M indicates that fields in 
condensed matter physics arise as continuum limits of discrete mappings. 


where the Lagrangian density L has dimensionality [energy] /[length] and we have desig- 
nated the particle mass by the more common symbol m = M. Similarly, the classical action 
assumes the continuum form 


s[4>\ = j 

f dt /.[<£ = j 

!dt l 

f dx £((/), d x (/),<j)). 
0 


We have thus succeeded in abandoning the IV-point particle description in favor of one 
involving continuous degrees of freedom, a (classical) field. The dynamics of the latter 
are specified by the functionals L and S, which represent the continuum generalizations 
of the discrete classical Lagrangian and action, respectively. 

INFO The continuum variable <{) is our first encounter with a field. Before proceeding with 
our example, let us pause to make some preliminary remarks on the general definition of these 
objects. This will help to place the subsequent discussion of the atomic chain into a broader 
context. Formally, a field is a smooth mapping 

<j> : M T, 

from a certain manifold M, 4 often called the “base manifold,” into a “target” or “field manifold” 
T (see Fig. 1.3). 5 In our present example, M = [0, L\ x [0, t\ C R 2 is the product of intervals in 
space and time, and T = R. In fact, the factorization M C TZ x T into a space-like manifold 
1Z multiplied by a one-dimensional time-like manifold T is inherent in most applications of 
condensed matter physics. 6 


4 If you are unfamiliar with the notion of manifolds (for a crash course, see page 537), think of M and T as subsets 
of some vector space. For the moment, this limitation won’t do any harm. 

5 In some (rare) cases it becomes necessary to define fields in a more general sense (e.g. as sections of mathematical 
objects known as fiber bundles). However, in practically all condensed matter applications the more restrictive 
definition above will suffice. 

6 By contrast, the condition of Lorentz invariance implies the absence of such factorizations in relativistic field 
theory. In classical statistical field theories, i.e. theories probing the thermodynamic behavior of large systems, 
M is just space-like. 
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However, the individual factors 1Z and T may, of course, be more complex than in our proto- 
typical problem above. As to the target manifold, not much can be said in general; depending 
on the application, the realizations of T range from real or complex numbers over vector spaces 
and groups to the “fanciest objects” of mathematical physics. 

In applied field theory, fields appear not as final objects but rather as input to functionals (see 
Fig. 1.3). Mathematically, a functional S : <j> £[(/>] £ 1 is a mapping that takes a field as its 

argument and maps it into the real numbers. The functional profile 5[0] essentially determines 
the character of a field theory. Notice that the argument of a functional is commonly indicated 
in square brackets [ ] . 

While these formulations may appear unnecessarily abstract, remembering the mathematical 
backbone of the theory often helps to avoid confusion. At any rate, it takes some time and practice 
to get used to the concept of fields and functionals. Conceptual difficulties in handling these 
objects can be overcome by remembering that any field in condensed matter physics arises as 
the limit of a discrete mapping. In the present example, the field 4>(x) is obtained as a continuum 
approximation of the discrete vector {4>i} € R w ; the functional L[(f>] is the continuum limit of 
the function L : R' v — » R, etc. While in practical calculations fields are usually easier to handle 
than their discrete analogs, it is sometimes helpful to think about problems of field theory in 
a discrete language. Within the discrete picture, the mathematical apparatus of field theory 
reduces to finite-dimensional calculus. 


Although Eq. (1.4) contains the full information about the model, we have not yet learned 
much about its actual behavior. To extract concrete physical information from Eq. (1.4) we 
need to derive equations of motion. At first sight, it may not be entirely clear what is 
meant by the term “equations of motion” in the context of an infinite-dimensional model: 
the equations of motion relevant for the present problem are obtained as the generaliza- 
tion of the conventional Lagrange equations of iV-particle classical mechanics to a model 
with infinitely many degrees of freedom. To derive these equations we need to generalize 
Hamilton’s extremal principle (i.e. the route from an action to the associated equations of 
motion) to infinite dimensions. As a warm-up, let us briefly recapitulate how the extremal 
principle works for a system with one degree of freedom. 

Suppose the dynamics of a classical point particle with coordinate x(t) is described by the 
classical Lagrangian L(x. x), and action S[x] = f dt L(x, x). Hamilton’s extremal prin- 
ciple states that the configurations x(t) that are actually realized are those that extremize 
the action, (jS'fa:] = 0. This means (for a substantiated discussion, see Section 1.2 below) 
that, for any smooth curve t e-t y(t), 

lim -(5[a: + ey] — S[x]) = 0. (1.6) 

e— >0 e 

To first order in e, the action has to remain invariant. Applying this condition, one finds 
that it is fulfilled if and only if x satisfies Lagrange’s equation of motion 




(1.7) 


EXERCISE Recapitulate the derivation of (1.7) from the classical action. 
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Figure 1.4 Schematic showing the variation of the field associated with the action functional. 
Notice that the variation erj is supposed to vanish on the boundaries of the base M = [0, L\ x [0, t]. 


In Eq. (1.5) we are dealing with a system of infinitely many degrees of freedom, <p(x, t). Yet 
Hamilton’s principle is general and we may see what happens if Eq. (1.5) is subjected to an 
extremal principle analogous to Eq. (1.6). To do so, we substitute <f>(x, t) — > cj>(x. t)+erj(x, t) 
into Eq. (1.5) and require vanishing of the first-order contribution to an expansion in e 
(see Fig. 1.4). When applied to the specific Lagrangian (1.4), substituting the “varied” field 
leads to 

S[(f> + erj] = S[(j>\ + e J dt J dx (mifir) — k s a 2 d x <j>d x r)j + 0(e 2 ). 

Integrating by parts and requiring that the contribution linear in e vanishes, one obtains 

lim - (Sf|$ + erj] — £[0]) = — J dt J dx (mcj) — k s a 2 d 2 (j^j rj = 0. 

(Notice that the boundary terms vanish identically.) Now, since r] was defined to be an 
arbitrary smooth function, the integral above can vanish only if the factor in parentheses is 
globally vanishing. Thus the equation of motion takes the form of a wave equation 


( md 2 — k s a 2 d 2 ^j <j> = 0. 


The solutions of Eq. (1.8) have the general form 
(f) + (x — vt) +cj)-{x + vt) where v = a^/k s /m, and 
<j)± are arbitrary smooth functions of the argu- 
ment. From this we can deduce that the basic low-energy elementary excitations of our 
model are lattice vibrations propagating as sound waves to the left or right at a con- 
stant velocity v (see the figure). 7 The trivial behavior of our model is of course a direct 
consequence of its simplistic definition - no dissipation, dispersion, or other non-trivial 
ingredients. Adding these refinements leads to the general classical theory of lattice vibra- 
tions (such as that described in the text by Ashcroft and Mermin 8 ). Finally, notice that 



7 Strictly speaking, the modeling of our system enforces a periodicity constraint 4>±{x + L) = (/>±(x). However, in 
the limit of a large system, this aspect becomes inessential. 

8 N. W. Ashcroft and N. D. Mermin, Solid State Physics (Holt-Saunders International, 1983). 
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the elementary excitations of the chain have little in common with its “microscopic” con- 
stituents (the atomic oscillators). Rather they are collective excitations, i.e. elementary 
excitations comprising a macroscopically large number of microscopic degrees of freedom. 

INFO The “relevant” excitations of a condensed matter system can, but need not, be 
of collective type. For example, the interacting electron gas (a system to be discussed in detail 
below) supports microscopic excitations - charged quasi-particles standing in 1:1 correspondence 
with the electrons of the original microscopic system - while the collective excitations are plasmon 
modes of large wavelength and engaging many electrons. Typically, the nature of the fundamental 
excitations cannot be straightforwardly inferred from the microscopic definition of a model. 
Indeed, the mere identification of the relevant excitations often represents the most important 
step in the solution of a condensed matter problem. 


Hamiltonian formulation 

An important characteristic of any 
excitation is its energy. How much 
energy is stored in the sound waves 
of the harmonic chain? To address 
this question, we need to switch back 
to a Hamiltonian formulation. Once 
again, this is achieved by general- 
izing standard manipulations from 
point mechanics to the continuum. 

Remembering that, for a Lagrangian 
of a point particle, p = d± L is the momentum conjugate to the coordinate x, let us consider 
the Lagrangian density and define 9 


Sir William Rowan Hamilton 
1805-65 

A mathematician credited with 
the discovery of quaternions, 
the first non-commutative alge- 
bra to be studied. He also 
invented important new methods 
in mechanics. (Image from W. R. Hamilton, Col- 
lected Papers, vol. II, Cambridge University Press, 
1940. 



B4>{x) 


as the canonical momentum associated with (}) (at the point x). In common with <p, 
the momentum 7t is a continuum degree of freedom. At each space point it may take an 
independent value. Notice that tt(x) is nothing but the continuum generalization of the 
lattice momentum Pj of Eq. (1.2). (Applied to Pj, a continuum approximation like <pi — > 
4>{x) would produce nix).) The Hamiltonian density is then defined as usual through the 
Legendre transformation, 


d x <j>, 7r ) = (n<j) — d x (j), q 




( 1 . 10 ) 


In field theory literature it is popular to denote the momentum by a Greek letter. 
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from where the full Hamiltonian is obtained as H = dx H. 

EXERCISE Verify that the transition L — > H is a straightforward continuum generalization of 
the Legendre transformation of the N - particle Lagrangian {</>/}). 


Having introduced a Hamiltonian, we are in a position to determine the energy of the sound 
waves. Application of Eq. (1.9) and (1.10) to the Lagrangian of the atomic chain yields 
7r (x,t) = m<j)(x,t) and 

H[n,<P] = J dx + ^y-( d x <p . (1.11) 


Considering, say, a right- moving sound-wave excitation, ) = 4>+(x — vt), we find that 
7r (x,t) = — mvd x (f>+{x — vt) and H[tt, <j)] = k s a 2 f dx[d x <j)+(x — vt)] 2 = k s a 2 f dx [d x (f>+(x)] 2 , 
i.e. a positive definite, time-independent expression, as one would expect. 

Before proceeding, let us note an interesting feature of the energy functional: in the limit 
of an infinitely shallow excitation, d x (f>+ — > 0, the energy vanishes. This sets the stage 
for the last of the principles (4) hitherto unconsidered, symmetry. The Hamiltonian of 
an atomic chain is invariant under simultaneous translation of all atom coordinates by a 
fixed increment: </>/ — > <j>i + 6, where (5 is constant. This expresses the fact that a global 
translation of the solid as a whole does not affect the internal energy. Now, the ground state 
of any specific realization of the solid will be defined through a static array of atoms, each 
located at a fixed coordinate Ri = la =$> = 0. We say that the translational symmetry is 

“spontaneously broken,” i.e. the solid has to decide where exactly it wants to rest. However, 
spontaneous breakdown of a symmetry does not imply that the symmetry disappeared. On 
the contrary, infinite-wavelength deviations from the pre-assigned ground state come close 
to global translations of (macroscopically large portions of) the solid and, therefore, cost 
a vanishingly small amount of energy. This is the reason for the vanishing of the sound 
wave energy in the limit d x (j) 0. It is also our first encounter with the aforementioned 
phenomenon that symmetries lead to the formation of soft, i.e. low-energy, excitations. A 
much more systematic exposition of these connections will be given in Chapter 6. 

To conclude our discussion of the 
classical harmonic chain, let us con- 
sider the specific heat, a quantity 
directly accessible in experiment. A 
rough estimate of this quantity can 
be readily obtained from the micro- 
scopic harmonic Hamiltonian (1.2). 
According to the principles of statis- 
tical mechanics, the thermodynamic energy density is given by 


Ludwig Boltzmann 1844-1906 

A physicist whose greatest achievement was in the 
development of statistical mechanics, which explains 
and predicts how the properties of atoms (such as 
mass, charge, and structure) determine the visible 
properties of matter (such as viscosity, thermal con- 
ductivity, and diffusion). 


_ 1 J dT e~P H H 
U ~L fdYe-P 11 


— In J d fe /3H , 


where f3 = 1/k^T, Z = f dTe is the Boltzmann partition function and the phase 

space volume element dT = JI/Li dRidPi. We will set ks = 1 throughout. The specific heat 
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is then obtained as c = Otu. To determine the temperature dependence of this quantity, 
we make use of the fact that, upon rescaling of integration variables, Ri — > /3 _ 1 / 2 A/, 
Pi — > ^ _ 1 / 2 17, the exponent f3H(R,P) — > H(X,Y) becomes independent of temperature 
(a property that relies on the quadratic dependence of H on both R and P). The integration 
measure transforms as dT — > Y\^ =1 dXi dYi = f3~ N dT'. Expressed in terms of the 

rescaled variables, one obtains the energy density 

u=-^d f) \n(p- N K) = pT, 

where p = N/L is the density of atoms, and we have made use of the fact that the constant 
K = fdT' is independent of temperature. We thus find a temperature indepen- 

dent specific heat c = p. Notice that c is fully universal, i.e. independent of the material 
constants M and k s determining H . (In fact, we could have anticipated this result from the 
equipartition theorem of classical mechanics, i.e. the law that in a system with N degrees 
of freedom, the energy scales as U = NT.) 

How do these findings compare with experiment? Figure 1.5 shows the specific heat of 
the insulating compound EUC 0 O 3. 10 For large temperatures, the specific heat approaches 
a constant value, in accord with our analysis. However, for lower temperatures, substantial 
deviations from c = const, appear. Yet, this strong temperature dependence does not reflect 
a failure of the simplistic microscopic modeling. Rather, the deviation is indicative of a 
quantum phenomenon. Indeed, we have so far totally neglected the quantum nature of 
the atomic oscillators. In the next chapter we will rectify this deficiency and discuss how 
the effective low-energy theory of the harmonic chain can be promoted to a quantum field 
theory. However, before proceeding with the development of the theory let us pause to 
introduce a number of mathematical concepts that surfaced above, in a way that survives 
generalization to richer problems. 


1.2 Functional analysis and variational principles 

Let us revisit the derivation of the equations of motion associated with the harmonic chain, 
Eq. (1.8). Although straightforward, neither was the calculation efficient, nor did it reveal 
general structures. In fact, what we did - expanding explicitly to first order in the vari- 
ational parameter e - had the same status as evaluating derivatives by explicitly taking 
limits: f'(x) = lim e _,.o ((f( x + e) — f(x))/e). Moreover, the derivation made explicit use of 
the particular form of the Lagrangian, thereby being of limited use with regard to a gen- 
eral understanding of the construction scheme. Given the importance attached to extremal 
principles in all of field theory, it is worthwhile investing some effort in constructing a more 
efficient scheme for the general variational analysis of continuum theories. In order to carry 
out this program we first need to introduce a mathematical tool of functional analysis, 
namely the concept of functional differentiation. 


In metals, the specific heat due to lattice vibrations exceeds the specific heat of the free conduction electrons 
for temperatures larger than a few degrees kelvin. 
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Figure 1.5 Specific heat c p of the insulator EUC 0 O 3 . At large temperatures, the specific heat starts 
to approach a constant value, as predicted by the analysis of the classical harmonic chain. However, 
for small temperatures, deviations from c v = const, are substantial. Such deviations can be ascribed 
to quantum mechanical effects. (Courtesy of M. Kriener, A. Reichl, T. Lorenz, and A. Freimuth.) 


In working with functionals, one is often concerned with how a given functional behaves 
under (small) variations of its argument function. In particular, given a certain function /, 
suspected to make a functional F[f] stationary, one would like to find out whether, indeed, 
the functional remains invariant under variations / —1 f + h, where h is an “infinitely small” 
increment function. In ordinary analysis, questions of this type are commonly addressed by 
exploring derivatives , i.e. we need to generalize the concept of a derivative to functionals. 
This is achieved by the following definition: a functional F is called differentiable if 

F[f + eg\-F[f}=e-DF f [g}+ 0 (e 2 ), 

where the differential DFf is a linear functional (i.e. one with DFf[gi + <72] = DFf [gi\ + 
DFf[g-2\), e is a small parameter, and g is an arbitrary function. The subscript indicates 
that the differential generally depends on the “base argument” /. A functional F is said to 
be stationary on /, if and only if DFf = 0. 

In principle, the definition above answers our question concerning a stationarity condition. 
However, to make use of the definition, we still need to know how to compute the differential 
DF and how to relate the differentiability criterion to the concepts of ordinary calculus. In 
order to understand how answers to these questions can be systematically found, it is helpful 
to return temporarily to a discrete way of thinking, i.e. to interpret the argument / of a 
functional F[f] as the limit N — »• 00 of a discrete vector f = {/„ = f(x n ), n = 1 , . . . , TV}, 
where {x n } denotes a discretization of the support of / (cf. the harmonic chain, <j> O /). 
Prior to taking the continuum limit, N — »• 00, f has the status of an TV-dimensional vector 
and F({) is a function defined over TV-dimensional space. After the continuum limit, f — > f 
becomes a function itself and F( f) — > F[f] becomes a functional. 

Now, within the discrete picture, it is clear how the variational behavior of functions is to 
be analyzed. For example, the condition that, for all e and all vectors g, the linear expansion 
of F(f + eg) ought to vanish is simply to say that the ordinary differential, dFf, defined 




1.2 Functional analysis and variational principles 


13 


through 

F(f + eg) - F{ f) = e • dF t (g) + 0(e 2 ), 

must be zero. In practice, one often expresses conditions of this type in terms of a certain 
basis. In a Cartesian basis of N unit vectors, e n , n = 1 . . . . , TV, dFf(g) = (VFf,g), where 
(f, g) = Yln=ifn9n denotes the standard scalar product, VFf = {df n F} represents the 
gradient with the partial derivative defined as 

d fn F{ f) = lim t [F(f + ee n ) - F(f)] . (1.12) 

From these identities, the differential is identified as 

dF t (g) = J2d fn F(f)g n . (1.13) 

The vanishing of the differential amounts to vanishing of all partial derivatives df n F = 0. 

Equations (1.12) and (1.13) can now be straightforwardly generalized to the continuum 
limit whereupon the summation defining the finite-dimensional scalar product translates to 
an integral, 

N 

i f ,g> = ^2fn9n -* {f,g) = 

n= 1 

The analog of the nth unit vector is a ^-distribution, e„ — > S x , where 5 x (x') = S(x — x'), as 
can be seen from the following correspondence: 

/„=(f,e n ) ='^2f m {e n )m -> f{x)={f,S x ) = J dx' f(x')5 x (x'). 

Here {e n ) m = 6 nrn denotes the rnth component of the nth unit vector. The correspondence 
(unit vector O 5-distribution) is easy to memorize: while the components of e„ vanish, save 
for the nth component, that equals unity, S x is a function that vanishes everywhere, save for 
x where it is infinite. That a unit component is replaced by “infinity” reflects the fact that 
the support of the 5-distribution is infinitely narrow; to obtain a unit-normalized integral 
f 5 X , the function must be singular. 

As a consequence of these identities, Eq. (1.13) translates to the continuum differential, 

DF f[9] = J dx ( L14 ) 

where the generalization of the partial derivative, 

is commonly denoted by “5” instead of “3.” Equations (1.14) and (1.15) establish the con- 
ceptual connection between ordinary and functional differentiation. Notice that we have 
not yet learned how to calculate the differential practically, i.e. to evaluate expressions 
like Eq. (1.15) for concrete functionals. Nevertheless, the identities above are very use- 
ful, enabling us to generalize more-complex derivative operations of ordinary calculus by 


/ 


dx f(x)g(x). 
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Table 1.1 Summary of basic definitions of discrete and continuum 
calculus. 


Entity 

Discrete 

Continuum 

Argument 

vector f 

function / 

Function(al) 

multidimensional function F( f) 

functional F [/] 

Differential 

dF t ( g) 

DF f \g\ 

Cartesian basis 

e n 

s x 

Scalar product ( , ) 

En fn9n 

fdx f(x)g{x) 

“Partial derivative” 

df n F( f) 

mn 

<5/(x) 


straightforward transcription. For example, the generalization of the standard chain rule, 
df n F( g(f)| = T,m d 9 m F ( g) |g=g(f) d f n 9m{ f) reads 

^[g[/]] = f , SF[g] I Sg(y)[f] 

$f(x) J S 9(V ) \g=g[f] s f( x ) 

Here g[f] is the continuum generalization of an R m -valued function, g : M n — >• M m , a 
function whose components g(y)[f ] are functionals by themselves. Furthermore, given some 
functional F[f], we can construct its Taylor expansion as 

tVl-F»rJ^ /*.*» . 

where (exercise) 

Sf(X2)6f{ Xl ) = J.ItO 6^2 (F [/ + + ^ ~ F[f + Cl<5a:i] ~ F[f + €2Sx * ] +F[f]) 

generalizes a two-fold partial derivative. The validity of these identities can be made plau- 
sible by applying the transcription, Table 1.1, to the corresponding relations of standard 
calculus. To actually verify the formulae, one has to take the continuum limit of each step 
taken in the discrete variant of the corresponding proofs. At any rate, experience shows that 
it takes some time to get used to the concept of functional differentiation. However, after 
some practice it will become clear that this operation is not only extremely useful but also 
as easy to handle as conventional partial differentiation. 

We finally address the question how to compute functional derivatives in practice. In 
doing so, we will make use of the fact that, in all but a few cases, the functionals encountered 
in field theory are of the structure 


£[</>]= f <Tx erf*). 

Jm 


(1.16) 


Here, we assume the base manifold M to be parameterized by an m-dimensional coordinate 
vector x = {ay,}. (In most practical applications, m = d + 1, and x = (xq. as'i , . . . , x,i) 
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contains one time-like component xq = t and d space-like components Xk,k = I .... . d. 11 ) 
We further assume that the field manifold has dimensionality n and that <p‘\ i = 1 , ,n, 
are the coordinates of the field. 

What makes the functional 5[^>] easy to handle is that all of its information is stored 
in the function C. Owing to this simplification, the functional derivative can be related to 
an ordinary derivative of £. To see this, all that we have to do is to evaluate the general 
definition Eq. (1.14) on the functional S: 


S’ 0 +<O]-S[<j>] = [ d m x[C{(j) + €6,8^ + 
Jm 



0 i e + 0(e 2 ), 


where in the last line we have assumed that the field variation vanishes on the bound- 
ary of the base manifold, 0\dM = 0. Comparison with Eq. (1.14) identifies the functional 
derivative as 


_ dC dC 

8<j) i (x) d(f i (x) M d(d ll (j) i (x)) ' 


We conclude that stationarity of the functional (1.16) is equivalent to the condition 


Vx, i 


dC 


-a, 


dC 


dd u . 


= o. 


(1.17) 


Equation (1.17) is known as the Euler— Lagrange equation of field theory. In fact, for d = 
0 and xo = t, Eq. (1.17) reduces to the familiar Euler-Lagrange equation of a point particle 
in n-dimensional space. For d = 1 and (xo,xi) = (t,x) we get back to the stationarity 
equations discussed in the previous section. In the next section we will apply the formalism 
to a higher-dimensional problem. 


1.3 Maxwell’s equations as a variational principle 

As a second example, let us consider the archetype of classical field theory, classical electro- 
dynamics. Indeed, as well as exemplifying the application of continuum variational principles 
on a problem with which we are all acquainted, this example illustrates the unifying poten- 
tial of the approach: that problems as different as the low-lying vibrational modes of a 
crystalline solid, and electrodynamics can be described by almost identical language indi- 
cates that we are dealing with a useful formalism. Specifically, our aim will be to explore 
how the equations of motion of electrodynamics, the inhomogeneous Maxwell equations, 

V • E = p, V x B — d t E = j, (1.18) 


Following standard conventions we denote space-like components by small Latin indices k = 1, . . . , d. In 
trast, space-time indices are denoted by Greek indices [A = 0, . . . , d. 
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can be obtained from variational principles. (For simplicity, we restrict ourselves to a vacuum 
theory, i.e. E = D and B = H. Further, we have set the velocity of light to unity, c = 1. 
Within the framework of the variational principle, the homogeneous equations, 

VxE + a f B=0, V • B = 0, (1.19) 

are regarded as ab initio constraints imposed on the “degrees of freedom” E and B.) 

To formulate Maxwell’s theory as 
a variational principle we require (1) 
a field formulated in a set of suitable 
“generalized coordinates,” and (2) its 
action. As to coordinates, the natural 
choice will be the coefficients of the 
electromagnetic (EM) 4-potential, 

A ^ = (<fi, — A), where <t> is the scalar 
and A is the vector potential. The 
potential A is unconstrained and 
uniquely determines the fields E and 
B through the standard equations E = — V0 — d t A and B = V x A. (In fact, the set of 
coordinates A M is “overly free” in the sense that gauge transformations A M — x A M + d^T, 
where T is an arbitrary function, leave the physical fields invariant. Later we will comment 
explicitly on this point.) The connection between A and the physical fields can be expressed 
in a more symmetric way by introducing the EM field tensor, 

I" 0 Ei £2 E$ 

' —Ei 0 — B3 B2 

—E 2 -B3 0 —Bi 

—E 3 —B2 Bi 0 

The relation between fields and potential now reads A v — d v A^ where x M = (t, — x) 

and = (d t , V). 


F = {FpA = 


1 

(1.20) 

J 


James Clerk Maxwell 1831-79 

Scottish theoretical physicist and 
mathematician. Amongst many 
other achievements, he is credited 
with the formulation of the theory 
of electromagnetism, synthesizing 
all previous unrelated experiments 
and equations of electricity, mag- 
netism and optics into a consistent theory. (He is also 
known for creating the first true color photograph in 
1861.) 



EXERCISE Confirm that this connection follows from the definition of the vector potential. To 
verify that the constraint (1.19) is automatically included in the definition (1.20), compute the 
construct d + d^F + d F where (Xop) represent arbitrary but di event indices. This 
produces four different terms, identified as the left-hand side of Eq. (1.19). Evaluation of the 
same construct on Fp = t),, A — d A lt produces zero by the symmetry of the right-hand side. 
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As to the structure of the action 
S' [A] we can proceed in different 
ways. One option would be to regard 
Maxwell’s equations as fundamen- 
tal, i.e. to construct an action that 
produces these equations upon vari- 
ation (by analogy with the situation 
in classical mechanics where the 
action functional was designed so as 
to reproduce Newton’s equations). 

However, we can also be a little bit 
more ambitious and ask whether the structure of the action can be motivated indepen- 
dently of Maxwell’s equations. In fact, there is just one principle in electrodynamics as 
“fundamental” as Maxwell’s equations: symmetry. A theory of electromagnetism must be 
Lorentz invariant, i.e. invariant under relativistic coordinate transformations. 


Hendrik Antoon Lorentz 1853- 
1928 

1902 Nobel Laureate in Physics 
(with Pieter Zeeman) in recog- 
nition of the extraordinary ser- 
vice they rendered by their 
researches into the influence of 
magnetism upon radiation phe- 
nomena. Lorentz derived the transformation equa- 
tions subsequently used by Albert Einstein to 
describe space and time. 



INFO Let us briefly recapitulate the notion of Lorentz invariance. Suppose we are given a 
4- vector X fl . A linear coordinate transformation X l: — » X'^ = T /( X is a Lorentz transformation 
if it leaves the 4-metric 


1 

9 = {<? M } = _! , <1.21) 

-1 


invariant: T T gT = g. To concisely formulate the invariance properties of relativistic theories, it 
is common to introduce the notion of raised and lowered indices. Defining X ,J = g IJ X , Lorentz 
invariance is expressed as X^X^ = X'^X'j. 


Aided by the symmetry criterion, we can attempt to conjecture the structure of the action 
from three basic assumptions, all independent of Maxwell’s equations: the action should 
be invariant under (a) Lorentz transformations and (b) gauge transformations, and (c) it 
should be simple! The most elementary choice compatible with these conditions is 

S[A] = j d A x (d + c 2 A^f ) , (1.22) 

where d 4 x = n,, dx lt = dt dx i dx -2 dx 3 denotes the measure, j tl = (p, — j) the 4-current, 
and ci i2 are undetermined constants. Up to quadratic order in A, Eq. (1.22) in fact defines 
the only possible structure consistent with gauge and Lorentz invariance. 

EXERCISE Using the continuity equation = 0, verify that the Ay-coupling is gauge invari- 
ant. (Hint: Integrate by parts.) Verify that a contribution like f A M A M would not be gauge 
invariant. 
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Having defined a trial action, we can apply the variational principle Eq. (1.17) to compute 
equations of motion. In the present context, the role of the field (j> is taken by the four 
components of A. Variation of the action with respect to A /( gives four equations of motion, 


ac m 

0A„ v d(d v A^) U ’ 


fj, = 0, . . . ,3, 


(1.23) 


where the Lagrangian density is defined by S = f d 4 x £. With the specific form of £, it is 
straightforward to verify that = czj 1 ' and Oq^a^F = —Ac\F^ v . Substitution of these 

building blocks into the equations of motion finally yields 4cj d l 'F V n = c^j^- Comparing 
this with the definition of the field tensor (1.20), and setting C 1 /C 2 = 1/4, we arrive at 
Maxwell’s equations (1.18). We finally fix the overall multiplicative constant c\ (= C 2 / 4 ) by 
requiring that the Hamiltonian density associated with the Lagrangian density £ reproduce 
the known energy density of the EM field (see Problem 1.8). This leads to C\ = —1/4, so 
that we have identified 


£(A M , d v A v ) = -\f p „F» v + A^, 


(1.24) 


as the Lagrangian density of the electromagnetic field. The corresponding action is 
given by S[A] = f d 4 x F{A li ,d v A^). 

At first sight, this result does not look particularly surprising. After all, Maxwell’s equa- 
tions can be found on the first page of most textbooks on electrodynamics. However, further 
reflection will show that our achievement is actually quite remarkable. By invoking only 
symmetry, the algebraic structure of Maxwell’s equations has been established unambigu- 
ously. We have thus proven that Maxwell’s equations are relativistically invariant, a fact 
not obvious from the equations themselves. Further, we have shown that Eq. (1.18) are the 
only equations of motion linear in the current-density distribution and consistent with the 
invariance principle. One might object that, in addition to symmetry, we have also imposed 
an ad hoc “simplicity” criterion on the action 5 [A]. However, later we will see that this was 
motivated by more than mere aesthetic principles. 

Finally, we note that the symmetry-oriented modeling that led to Eq. (1.22) is illustrative 
of a popular construction scheme in modern field theory. The symmetry-oriented approach 
stands complementary to the “microscopic” formulation exemplified in Section 1.1. Crudely 
speaking, these are the two principal approaches to constructing effective low-energy field 
theories: 


> Microscopic analysis: Starting from a microscopically defined system, one projects 
onto those degrees of freedom that one believes are relevant for the low-energy dynam- 
ics. Ideally, this “belief” is backed up by a small expansion parameter stabilizing the 
mathematical parts of the analysis. Advantages: The method is rigorous and fixes the 
resulting field theory completely. Disadvantages: The method is time-consuming and, for 
sufficiently complex systems, not even viable. 

> Symmetry considerations: One infers an effective low-energy theory on the basis of 
only fundamental symmetries of the physical system. Advantages: The method is fast and 
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elegant. Disadvantages: It is less explicit than the microscopic approach. Most impor- 
tantly, it does not fix the coefficients of the different contributions to the action. 

Thus far, we have introduced some basic concepts of field theoretical modeling in condensed 
matter physics. Starting from a microscopic model Hamiltonian, we have illustrated how 
principles of universality and symmetry can be applied to distill effective continuum field 
theories capturing the low-energy content of the system. We have formulated such theories in 
the language of Lagrangian and Hamiltonian continuum mechanics, respectively, and shown 
how variational principles can be applied to extract concrete physical information. Finally, 
we have seen that field theory provides a unifying framework whereby analogies between 
seemingly different physical systems can be uncovered. In the next section we discuss how 
the formalism of classical field theory can be elevated to the quantum level. 


1.4 Quantum chain 


Earlier we saw that, at low temperatures, the excitation profile of the classical atomic chain 
differs drastically from that observed in experiment. Generally, in condensed matter physics, 
low-energy phenomena with pronounced temperature sensitivity are indicative of a quantum 
mechanism at work. To introduce and exemplify a general procedure whereby quantum 
mechanics can be incorporated into continuum models, we next consider the low-energy 
physics of the quantum mechanical atomic chain. 

The first question to ask is conceptual: how can a model like Eq. (1.4) be quantized 
in general? Indeed, there exists a standard procedure for quantizing continuum theories, 
which closely resembles the quantization of Hamiltonian point mechanics. Consider the 
defining Eq. (1.9) and (1.10) for the canonical momentum and the Hamiltonian, respectively. 
Classically, the momentum n(x) and the coordinate 4>(x) are canonically conjugate variables: 
{7 t(:e), <f>(x')} = —5(x—x') where {, } is the Poisson bracket and the 5-function arises through 
continuum generalization of the discrete identity {Pi,Ri>} = —5u>, 1,1' = 1 , ... ,7V. The 
theory is quantized by generalization of the canonical quantization procedure for the discrete 
pair of conjugate coordinates ( Ri,Pi ) to the continuum: (i) promote 4>{x) and 7r(a;) to 
operators, (j> *-> <fi, n ir, and (ii) generalize the canonical commutation relation [Pi, Ri /] = 
— ihSu i to 12 


[7r(a;), <f>(x')\ = —ih6(x — x'). 


(1.25) 


Operator- valued functions like (j> and 7r are generally referred to as quantum fields. For 
clarity, the relevant relations between canonically conjugate classical and quantum fields 
are summarized in Table 1.2. 


INFO By introducing quantum fields, we have departed from the conceptual framework laid 
out on page 6: being operator-valued, the quantized field no longer represents a mapping into an 


Note that the dimensionality of both the quantum and the classical continuum field is compatible with the 
dimensionality of the Dirac (5-function, [<5(x — x')\ = [length] -1 , i.e. [<j>(x)] — [4>i] • [length] -1 ^ 2 and similarly 
for 7 r. 
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Table 1.2 Relations between discrete and continuum canonically 
conjugate variables/ operators. 



Classical 

Quantum 

Discrete 

{P I: R,,} --8,,' 

Pi, /?//] = - iHSu ’ 

Continuum 

{7r(a), 4>(x')} = —5(x — x') 

[k(x), <t>(x'j\ = — ih5(x — x') 


ordinary differentiable manifold. 13 It is thus legitimate to ask why we bothered to give a lengthy 
exposition of fields as “ordinary” functions. The reason is that, in the not too distant future, 
after the framework of functional field integration has been introduced, we will return to the 
comfortable ground of the definition of page 6. 


Employing these definitions, the classical Hamiltonian density (1.10) becomes the quantum 
operator 

= ^ft 2 + (i- 26 ) 

The Hamiltonian above represents a quantum field theoretical formulation of the problem 
but not yet a solution. In fact, the development of a spectrum of methods for the analysis of 
quantum field theoretical models will represent a major part of this text. At this point the 
objective is merely to exemplify the way physical information can be extracted from models 
like Eq. (1.26). As a word of caution, let us mention that the following manipulations, while 
mathematically straightforward, are conceptually deep. To disentangle different aspects of 
the problem, we will first concentrate on plain operational aspects. Later, in Section 1.4, 
we will reflect on “what has really happened.” 

As with any function, operator-valued functions can be represented in a variety of different 
ways. In particular, they can be subjected to Fourier transformation, 


f Qk _ 1 

\ ftk l|I® 


f 


dx e { 


ikx 


4>{x) 
ft(x) ’ 


0( x ) _ 1 y- {±ifex f <f>k 

ft(x) TM? ^ y 7Tfc ’ 


(1.27) 


where '// jk represents the sum over all Fourier coefficients indexed by quantized momenta 
k = 2n m/L, mfZ (not to be confused with the “operator momentum” fr!). Note that 
the real classical field <j>(x) quantizes to a Hermitian quantum field <f>(x), implying that 
(pk = (and similarly for tt/. The corresponding Fourier representation of the canonical 
commutation relations reads (exercise) 


<t>k'] = —iftSkk 1 - 


(1.28) 


At least if we ignore the mathematical subtlety that a linear operator can also be interpreted as an element of 
a certain manifold. 
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When expressed in the Fourier representation, making use of the identity 

h+k',0 

[ dx(d x 4>) 2 = ^ ~2(-ik4>k)(-ik'4>k ') I ( dxe~ t( - k+k ' )x =^2k 2 ^ k $_ k = ^k 2 \4> k \ 2 
k,k' k k 

together with a similar relation for f (lx it 2 , the Hamiltonian H = 
f Ax assumes the near diagonal form 

H = ^2 + . ( L29 ) 

where u) k = u|fe| and v = a^/k s /m denotes the classical sound wave 
velocity. In this form, the Hamiltonian can be identified as nothing but 
a superposition of independent harmonic oscillators . 14 This result is actually not difficult 
to understand (see figure): Classically, the system supports a discrete set of wave excitations, 
each indexed by a wave number k = 2-Kvn/L. (In fact, we could have performed a Fourier 
transformation of the classical fields <j>{x) and tt(x) to represent the Hamiltonian function 
as a superposition of classical harmonic oscillators.) Within the quantum picture, each of 
these excitations is described by an oscillator Hamiltonian operator with a ^-dependent 
frequency. However, it is important not to confuse the atomic constituents, also oscillators 
(albeit coupled), with the independent collective oscillator modes described by H. 

The description above, albeit perfectly valid, still suffers from a deficiency: our analy- 
sis amounts to explicitly describing the effective low-energy excitations of the system (the 
waves) in terms of their microscopic constituents (the atoms). Indeed the different con- 
tributions to H keep track of details of the microscopic oscillator dynamics of individual 
fc-modes. However, it would be much more desirable to develop a picture where the relevant 
excitations of the system, the waves, appear as fundamental units, without explicit account 
of underlying microscopic details. (As with hydrodynamics, information is encoded in terms 
of collective density variables rather than through individual molecules.) As preparation for 
the construction of this improved formulation of the system, let us temporarily focus on a 
single oscillator mode. 



Revision of the quantum harmonic oscillator 
Consider a standard harmonic oscillator (HO) Hamiltonian 


H ~2 m + ^T 


14 The only difference between Eq. (1.29) and the canonical form of an oscillator Hamiltonian H = p 2 /(2m) + 
moj 2 x 2 / 2 is the presence of the sub-indices k and — k (a consequence of <££ = As we will show shortly, 

this difference is inessential. 
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Figure 1.6 Low-lying energy levels/states of the harmonic oscillator. 


The first few energy levels e n = u (n + |) and the associated Hermite polynomial eigen- 
functions are displayed schematically in Fig. 1.6. (To simplify the notation we henceforth 
set h = 1.) 

The HO has, of course, the status of a single-particle problem. However, the equidistance 
of its energy levels suggests an alternative interpretation. One can think of a given energy 
state e n as an accumulation of n elementary entities, or quasi-particles, each having energy 
u>. What can be said about the features of these new objects? First, they are structureless, 
i.e. the only “quantum number” identifying the quasi-particles is their energy io (otherwise 
n-particle states formed of the quasi-particles would not be equidistant). This implies that 
the quasi-particles must be bosons. (The same state co can be occupied by more than one 
particle, see Fig. 1.6.) 

This idea can be formulated in quantitative terms by employing the formalism of ladder 
operators in which the operators p and x are traded for the pair of Hermitian adjoint 
operators a = y/ T! ^ l (x+ ^ p ), = \p^-{x — Up to a factor of i, the transformation 

(x,p) — > (a, at) is canonical, i.e. the new operators obey the canonical commutation relation 

[a, fit] = 1. (1.30) 

More importantly, the a-representation of the Hamiltonian is very simple, namely 

H = uj^afa + ^ , (1-31) 

as can be checked by direct substitution. Suppose, now, we had been given a zero eigenvalue 
state |0) of the operator a: a|0) = 0. As a direct consequence, H |0) = (w/2)|0), i.e. |0) is 
identified as the ground state of the oscillator. 15 The complete hierarchy of higher energy 
states can now be generated by setting | n) = (n!) -1 / 2 (at)"|0). 

EXERCISE Using the canonical commutation relation, verify that H\n) = u)(n + l/2)|n) and 

(n|n) = 1. 

Formally, the construction above represents yet another way of constructing eigenstates of 
the quantum HO. However, its “real” advantage is that it naturally affords a many-particle 
interpretation. To this end, let us declare |0) to represent a “vacuum” state, i.e. a state with 
zero particles present. Next, imagine that a^|0) is a state with a single featureless particle 


3 This can be verified by explicit construction. Switching to a real-space representation, the solution of the 
equation [x + ^/(rau;)] (x |0) = 0 obtains the familiar ground state wavefunction (x|0) = y/mu> /2Tre~ muJX ^ 2 . 
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k, k 2 k 3 


Figure 1.7 Diagram visualizing an excited state of the chain. Here, the number of quasi-particles 
decreases with increasing energy w*. 

(the operator a* does not carry any quantum number labels) of energy u. Similarly, (dt)"|0) 
is considered as a many-body state with n particles, i.e. within the new picture, af is an 
operator that creates particles. The total energy of these states is given by w x (occupation 
number). Indeed, it is straightforward to verify (see exercise above) that cda|n) = n|n), i.e. 
the Hamiltonian basically counts the number of particles. While, at first sight, this may 
look unfamiliar, the new interpretation is internally consistent. Moreover, it achieves what 
we had asked for above, i.e. it allows an interpretation of the HO states as a superposition 
of independent structureless entities. 

INFO The representation above illustrates the capacity to think about individual quantum 
problems in complementary pictures. This principle finds innumerable applications in modern 
condensed matter physics. The existence of different interpretations of a given system is by no 
means heretical but, rather, reflects a principle of quantum mechanics: there is no “absolute” 
system that underpins the phenomenology. The only thing that matters is observable phenomena. 
For example, we will see later that the “fictitious” quasi-particle states of oscillator systems 
behave as “real” particles, i.e. they have dynamics, can interact, be detected experimentally, etc. 
From a quantum point of view these object are, then, real particles. 


Quasi-particle interpretation of the quantum chain 

Returning to the oscillator chain, one can transform the Hamiltonian (1.29) to a form 
analogous to (1.31) by defining the ladder operators 16 



(1.32) 


With this definition, applying the commutation relations Eq, (1.28), one finds that the 
ladder operators obey commutation relations generalizing Eq. (1.30): 

= $kk' i [dfc,ctfc'] = =0. (1.33) 


As to the consistency of these definitions, recall that <££ = <fi-k and = 7r_fc. Under these conditions the 
second of the definitions following in the text follows from the first upon taking the Hermitian adjoint. 
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Figure 1.8 Phonon spectra of the transition metal oxide S^RuCb measured along different axes in 
momentum space. Notice the approximate linearity of the low-energy branches (acoustic phonons) 
at small momenta. Superimposed at high frequencies are various branches of optical phonons. 
(Source: Courtesy of M. Braden, II. Physikalisches Institut, Universitat zu Koln.) 


Expressing the operators {4>ki^k) i n terms of (a k , a\), it is now straightforward to bring the 
Hamiltonian into the quasi-particle oscillator form (exercise) 

H = ^2,u k 0 . (1.34) 

Equations (1.34) and (1.33) represent the final result of our analysis. The Hamiltonian H 
takes the form of a sum of harmonic oscillators with characteristic frequencies c o k - In the 
limit k — »• 0 (i.e. long wavelength), one finds uj k — > 0; excitations with this property are said 
to be massless. 

An excited state of the system is indexed by a set {n k } = (ni,ri 2 , • • • ) of quasi-particles 
with energy {cu*,} (see Fig. 1.7). Physically, the quasi-particles of the harmonic chain are 
identified with the phonon modes of the solid. A comparison with measured phonon 
spectra (Fig. 1.8) reveals that, at low momenta, ~ |fc| in agreement with our simplistic 
model (even in spite of the fact that the spectrum was recorded for a three-dimensional 
solid with non-trivial unit cell - universality!). While the linear dispersion was already a 
feature of the classical sound wave spectrum, the low-temperature specific heat reflected 
non-classical behavior. It is left as an exercise (problem 1.8) to verify that the quantum 
nature of the phonons resolves the problem with the low-temperature specific heat discussed 
in Section 1.1. (For further discussion of phonon modes in atomic lattices we refer to Chapter 
2 of the text by Kittel. 17 ) 


1.5 Quantum electrodynamics 

The generality of the procedure outlined above suggests that the quantization of the EM 
field Eq. (1.24) proceeds in a manner analogous to the phonon system. However, there are 
a number of practical differences that make quantization of the EM field a harder (but 
also more interesting!) enterprise. Firstly, the vectorial character of the vector potential, in 
combination with the condition of relativistic covariance, gives the problem a non-trivial 
internal geometry. Closely related, the gauge freedom of the vector potential introduces 
redundant degrees of freedom whose removal on the quantum level is not easily achieved. For 


C. Kittel, Quantum Theory of Solids, 2nd edition (Wiley, 1987). 
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example, quantization in a setting where only physical degrees of freedom are kept - i.e. the 
two polarization directions of the transverse photon field - is technically cumbersome, the 
reason being that the relevant gauge condition is not relativistically covariant. In contrast, a 
manifestly covariant scheme, while technically more convenient, introduces spurious “ghost 
degrees of freedom” that are difficult to remove. To circumvent a discussion of these issues, 
we will not discuss the problem of EM field quantization in full detail. 18 On the other hand, 
the photon field plays a much too important role in condensed matter physics for us to 
drop the problem altogether. We will therefore aim at an intermediate exposition, largely 
insensitive to the problems outlined above but sufficiently general to illustrate the main 
principles. 


Field quantization 

Consider the Lagrangian of the matter-free EM field, L = — \ f d 3 x F llv F IIM . As a first step 
towards quantization of this system we fix a gauge. In the absence of charge, a particularly 
convenient choice is the Coulomb gauge, V • A = 0, with the scalar component <1> = 
0. (Keep in mind that, once a gauge has been set, we cannot expect further results to 
display “gauge invariance.”) Using the gauge conditions, one may verify that the Lagrangian 
assumes the form 

L=\ J d 3 x [(d t Af - (V x A) 2 ] . (1.35) 

By analogy with our discussion of the atomic chain, we would now proceed to “decouple” 
the theory by expanding the action in terms of eigenfunctions of the Laplace operator. 
The difference with our previous discussion is that we are dealing (i) with the full three- 
dimensional Laplacian (instead of a simple second derivative) acting on (ii) the vectorial 
quantity A that is (iii) subject to the constraint V • A = 0. It is these aspects which lead 
to the complications outlined above. 

We can circumvent these difficulties by considering cases where the geometry of the system 
reduces the complexity of the eigenvalue problem. This restriction is less artificial than it 
might appear. For example, in anisotropic electromagnetic waveguides, the solutions of the 
eigenvalue equation can be formulated as 19 

— V 2 Rfc(x) = A fe R fe (x), (1.36) 

where k € K is a one- dimensional index and the vector-valued functions R& are real and 
orthonormalized, f R*, • R*,/ = S^k' ■ The dependence of the eigenvalues A k on k depends on 
details of the geometry (see Eq. (1.38) below) and need not be specified for the moment. 

I N FO An electrodynamic waveguide is a quasi-one-dimensional cavity with metallic bound- 
aries (see Fig. 1.9). The practical advantage of waveguides is that they are good at confining EM 


Readers interested in learning more about EM field quantization are referred to, e.g., L. H. Ryder, Quantum 
Field Theory (Cambridge University Press, 1996). 

More precisely, one should say that Eq. (1.36) defines the set of eigenfunctions relevant for the low-energy 
dynamics of the waveguide. More-complex eigenfunctions of the Laplace operator exist but they carry much 
higher energy. 
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Figure 1.9 EM waveguide with rectangular cross-section. The structure of the eigenmodes of the 
EM field is determined by boundary conditions at the walls of the cavity. 


waves. At large frequencies, where the wavelengths are of order meters or less, radiation loss in 
conventional conductors is high. In these frequency domains, hollow conductors provide the only 
practical way of transmitting radiation. 

EM field propagation inside a waveguide is constrained by boundary conditions. Assuming 
the walls of the system to be perfectly conducting, 

E, (x b ) - 0, B . (x b |pO, (1.37) 

where Xb is a point at the system boundary and Ey (B_l) is the parallel (perpendicular) com- 
ponent of the electric (magnetic) field. 

For concreteness, returning to the problem of field quantization, let us consider a cavity with 
uniform rectangular cross-section L y x L z . To conveniently represent the Lagrangian of the 
system, we wish to express the vector potential in terms of eigenfunctions Rk that are consistent 
with the boundary conditions (1.37). A complete set of functions fulfilling this condition is given 
by 

( ci cos(k x x) sin (k y y) sin(fe z «)\ 

C2sin(k x x)cos(k y y)sin(k z z) ) . 

C 3 sin(/ca,a:) sm(k y y) cos(k z z) I 

Here, ki = riiir/Li, m £ N, * = x, y, z, Nk is a factor normalizing Rk to unit modulus, and the 
coefficients c. ( are subject to the condition c\k x + C 2 k y + c^k z = 0. Indeed, it is straightforward 
to verify that a general superposition of the type A(x, t) = ak(f)Rk(x), «k(i) £ M, is diver- 
genceless, and generates an EM field compatible with (1.37). Substitution of Rk into Eq. (1.36) 
identifies the eigenvalues as Ak = k x + k y + ki . In the physics and electronic engineering litera- 
ture, eigenfunctions of the Laplace operator in a quasi-one-dimensional geometry are commonly 
described as modes. As we will see shortly, the energy of a mode (i.e. the Hamiltonian evaluated 
on a specific mode configuration) grows with |Ak|- In cases where one is interested in the low- 
energy dynamics of the EM field, only configurations with small |Ak| are relevant. For example, 
let us consider a massively anisotropic waveguide with L z < L y <C L x . In this case the modes 
with smallest | A*, | are those with k z = 0, k v = n /L y , and k x = k -C L~ y . (Why is it not possible 
to set both k y and k z to zero?) With this choice, 

Afe=fc2+ (f) ’ (L38) 

and a scalar index k suffices to label both eigenvalues and eigenfunctions Rfc. A caricature of 
the spatial structure of the functions Rfc is shown in Fig. 1.9. The dynamical properties of these 
configurations will be discussed in the text. 



1.5 Quantum electrodynamics 


27 


Returning to the problem posed by Eq. (1.35) and (1.36), one can expand the vector poten- 
tial in terms of eigenfunctions R& as A(x, t) = Xfc afc(f)Rfc(x), where the sum runs over 
all allowed values of the index parameter k. (In a waveguide, k = nn/L, n 6 N, where 
L is the length of the guide.) Substituting this expansion into Eq. (1.35) and using the 
normalization properties of R^, we obtain L = | Xfc ( ( - y ‘k ~ A^o^), i- e - a decoupled repre- 
sentation where the system is described in terms of independent dynamical systems with 
coordinates Q:; ;: . From this point on, quantization proceeds along the lines of the standard 
algorithm. Firstly, define momenta through the relation irk = da k 1. = dfe. This produces 
the Hamiltonian H = ^ Xt. {^k^k + ^k&k&k)- Next quantize the theory by promoting fields 
to operators a*, — > &k and itk fr k, and declaring [ir k,6ik'] = — iSkk >• The quantum Hamil- 
tonian operator, again of harmonic oscillator type, then reads 

H = \ ^2 (#*** + ul&kotk) , 

fc 

where ui‘ k = A/ c . Following the same logic as marshaled in Section 1.4, we then define ladder 
operators a*, = ■dr^£((Xk + 7r*) 9 a\ = ~ whereupon the Hamiltonian 

assumes the now familiar form 

F = ^> fc (ak+J)- ( L39 ) 

For the specific problem of the first excited mode in a waveguide of width L y , uik = 
[. k 2 + (7t/Lj,) 2 ] 1 ^ 2 . Equation (1.39) represents our final result for the quantum Hamiltonian 
of the EM waveguide. Before concluding this section let us make a few comments on the 
structure of the result: 

t> Firstly, notice that the construction above almost completely paralleled our previous dis- 
cussion of the harmonic chain. 20 The structural similarity between the two systems finds 
its origin in the fact that the free field Lagrangian (1.35) is quadratic and, therefore, bound 
to map onto an oscillator-type Hamiltonian. That we obtained a simple one- dimensional 
superposition of oscillators is due to the boundary conditions specific to a narrow waveg- 
uide. For less restrictive geometries, e.g. free space, a more complex superposition of 
vectorial degrees of freedom in three-dimensional space would have been obtained. How- 
ever, the principal mapping of the free EM field onto a superposition of oscillators is 
independent of geometry. 

i> Physically, the quantum excitations described by Eq. (1.39) are, of course, the photons of 
the EM field. The unfamiliar appearance of the dispersion relation ojk is again a peculiarity 
of the waveguide. However, in the limit of large longitudinal wave numbers k » L~ 1 , the 
dispersion approaches ojk ~ |fc|, i.e. the relativistic dispersion of the photon field. Also 
notice that, due to the equality of the Hamiltonians (1.34) and (1.39), all that has been 
said about the behavior of the phonon modes of the atomic chain carries over to the 
photon modes of the waveguide. 


20 Technically, the only difference is that instead of index pairs (fc, —k) all indices (fc, k) are equal and positive. 
This can be traced back to the fact that we have expanded in terms of the real eigenfunctions of the closed 
waveguide instead of the complex eigenfunctions of the circular oscillator chain. 
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t> As with their phononic analogue, the oscillators described by Eq. (1.39) exhibit zero-point 
fluctuations. It is a fascinating aspect of quantum electrodynamics that these oscilla- 
tions, caused by quantization of the most relativistic field, surface at various points of 
non-relativistic physics. In the section below two prominent manifestations of zero-point 
fluctuations in condensed matter physics will be briefly discussed. 


Vacuum fluctuations in matter 

The quantum oscillation of the electromagnetic field manifests itself in a multitude of physi- 
cal phenomena. In the following, we shall briefly discuss some of the more prominent effects 
caused by the dynamics of the quantum vacuum. 

One of the most important phenomena induced by vacuum fluctuations is the Casimir 
effect : 21 two parallel conducting plates embedded into the vacuum exert an attractive 
force on each other. This phenomenon is not only of conceptual importance - better than 
anything else it demonstrates that the vacuum is “alive” - but also of applied relevance. 
For example, the force balance of hydrophobic suspensions of particles of size 0.1-1 /zm in 
electrolytes is believed to be strongly influenced by Casimir forces. At least it is applica- 
tions in colloidal chemistry which prompted Casimir to his famous analysis of the idealized 
vacuum problem. Qualitatively, the origin of the Casimir force is readily understood. Like 
their classical analog, quantum photons exert a certain radiation pressure on macroscopic 
media. The difference with the classical case is that, due to zero-point oscillations, even the 
quantum vacuum is capable of creating radiation pressure. For a single conducting body 
embedded into the infinite vacuum, the net pressure vanishes by symmetry. However, for 
two parallel plates, the situation is different. Mode quantization arguments similar to the 
ones used in the previous section show that the density of quantum modes between the 
plates is lower than in the semi-infinite outer spaces. Hence, the force (density) created by 
outer space exceeds the counter-pressure from the inside; the plates “attract” each other. 
The unambiguous measurement of the Casimir force required advanced nano-technological 
instrumentation and has succeeded only recently. 22 

A second important phenomenon where vacuum fluctuations play a role is van der Waals 
forces: atoms or molecules attract each other by a potential that, at small separation r, 
scales as r -6 . Early attempts to explain classically the phenomenon produced results for the 
force that, in conflict with experimental findings, were strongly temperature dependent. It 
was considered a major breakthrough of the new quantum mechanics when London proposed 
a model whereby a temperature- independent r~ 6 law was obtained. The essence of London’s 


H. B. G. Casimir and D. Polder, The influence of retardation on the London-van der Waals forces, Phys. Rev. 
73, 360—72 (1948); H. B. G. Casimir, On the attraction between two perfectly conducting plates, Proc. Kon. 
Nederland. Akad. Wetensch. 51, 793-6 (1948). 

S. K. Lamoreaux, Demonstration of the Casimir force in the 0.6 to 6 m Range, Phys. Rev. Lett. 78, 5-9 (1997); 
G. Bressi et al., Measurement of the Casimir force between parallel metallic surfaces, Phys. Rev. Lett. 88, 
41804—9 (2002). For a review on recent developments see M. Bordag et al ., New developments in the Casimir 
effect, Phys. Rep. 353, 1-205 (2001). 
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Figure 1.10 On the creation of van der Waals forces by EM vacuum fluctuations. An atom or 
molecule B is exposed to fluctuations of the EM quantum vacuum (depicted by wavy lines) . This 
leads to the creation of a dipole element d«. A nearby partner atom A sees both the vacuum 
amplitude and the dipole field created by ds- The cooperation of the two field strengths leads to 
a net lowering of the energy of A. 


idea is easily explained: imagine each of the atoms/molecules as an oscillator. The zero- 
point motion of these oscillators, measured in terms of some coordinate x, creates a dipole 
moment of strength ~ x. At close distances, the systems interact through a dipole-dipole 
interaction. 23 Writing down coupled oscillator equations (see Problem 1.8), one readily 
obtains a lowering of the ground state energy by a contribution V(r) ~ r ~ 6 . 

However, there is a subtle point 
about the seemingly innocuous cou- 
pling of the quantum oscillator to the 
classical Coulomb potential. Close 
inspection (see Milonni’s text for 
details 24 ) shows that, to maintain 
the quantum commutation relations 
of the oscillator degrees of freedom, 
the EM field must not be assumed 
entirely classical. In fact, once the 
quantum nature of the EM field has 
entered the stage, it becomes possible to explain the force without need for phenomenolog- 
ical introduction of atomic oscillator degrees of freedom. 

Consider an atom A exposed to an electric field E (see Fig. 1.10). The field gives rise to 
the atomic Stark effect, i.e. a level shift of order W ~ a(E 2 (x^)), where a is the atomic 
polarizability, E = d t A is the operator representing the EM field, and (...) a quantum 
expectation value. For an atom in empty space, E(x^) = Eo(x^) is an operator measuring 
the vacuum fluctuations of the free field. However, in the neighborhood of a second atom 
B, an induced dipole contribution adds to Eq. The point is that the zero-point amplitude 
E 0 induces a dipole element d« oc aEo(x^) which, in turn, creates a dipole field E b 


Johannes Diderik van der Waals 
1837-1923 

1910 Nobel Laureate in physics 
in recognition of his work on the 
state equation of non-ideal gases. 

Van der Waals was one of the 
first to postulate inter-molecular 
forces, at a time when the exis- 
tence of atoms and molecules was still disputed. 
Other important work includes the derivation of first 
variants of thermodynamic scaling relations. 



23 In passing, we mention that various animal species benefit from the short-distance efficiency of the van der 
Waals force. For instance, geckos and spiders owe their ability to climb up planar surfaces of basically any 
material to the presence of bushels of ultra-fine hair (about three orders of magnitude thinner than human hair) 
on their feet. The tips of these hairs come close enough to the atoms of the substrate material to make the van 
der Waals force sizeable. Impressively, this mechanism provides a force of about two orders of magnitude larger 
than that required to support a spider’s full body weight. Both spiders and geckos have to “roll” their feet off 
the surface to prevent getting stuck by the enormous power of the forces acting on their many body hairs! 

24 P. W. Milonni, The Quantum Vacuum (Academic Press, 1994). 
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contributing to the total field amplitude felt by atom A: E(x^) = (Eo + E B )(x^) (see 
Fig. 1.10). The dominant contribution to the level shift due to the proximity of B is then 
given by 

V = W - a<E§(x A )> ~ a(E 0 (x A ) • E B (x A )) ~ a 2 (E^ (x A )ir(x A - x B )E(x B )>, (1.40) 

where K is a matrix kernel describing the geometric details of the dipole interaction. (Here, 
we have subtracted the vacuum level shift because it represents an undetectable offset.) 
Notice that Eq. (1.40) predicts the van der Waals interaction to be proportional to the 
square of the zero-point field amplitude. 

Unfortunately, the quantitative 
evaluation of the matrix element, 
involving a geometric average over 
all polarization directions of the 
quantum field amplitude, is some- 
what involved. As a final result of a 
calculation 25 one obtains 

(141) 

where r = |x^ — x B | is the distance between the atoms and cuo is the transition frequency 
between the ground state and the first excited state of the atom. (This parameter enters 
the result through the dependence of the microscopic polarizability on the transition fre- 
quencies.) 


Johannes Stark 1874-1957 

Nobel laureate 1919 for his dis- 
covery of the Stark effect in 1913. 

Later, Stark became infamous for 
the role he played during the 
third Reich. He attacked theo- 
retical physics as “Jewish” and 
stressed that scientific positions in Nazi Germany 
should only be held by pure-blooded Germans. Image 
©The Nobel Foundation 
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1.6 Noether’s theorem 

According to a basic paradigm of physics every continuous symmetry of a system entails a 
conservation law. 26 Conservation laws, in turn, greatly simplify the solution of any prob- 
lem which is why one gets acquainted with the correspondence (symmetry O conservation 
law) at a very early stage of the physics curriculum, e.g. the connection between rotational 
symmetry and the conservation of angular momentum. However, it is not all that trivial 
to see (at least within the framework of Newtonian mechanics) that the former entails 
the latter. One needs to know what to look for (angular momentum) to identify the con- 
served quantity corresponding to rotational invariance. It is one of the major advantages of 
Lagrangian over Newtonian mechanics that it provides one with a tool - Noether’s theorem 
- to automatically identify the conservation laws generated by the symmetries of classical 
mechanics. 


Detailed, e.g., in Milonni, The Quantum Vacuum , Section 3.11. 

Before exploring the ramifications of symmetries and conservation laws for fields, it may be instructive to 
recapitulate Noether’s theorem in the context of classical point-particle mechanics - see, e.g., L. D. Landau and 
E. M. Lifshitz, Classical Mechanics (Pergamon, 1960). 
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What happens when one advances 
from point to continuum mechan- 
ics? 27 Clearly, multidimensional con- 
tinuum theories leave more room for 
the emergence of complex symme- 
tries; even more so than in classical 
mechanics, we are in need of a tool 
identifying the corresponding con- 
servation laws. Fortunately, it turns 
out that Noether’s theorem of point 
mechanics affords a more or less 
straightforward generalization to 
higher dimensions. Starting from 
the general form of the action of a 
continuum system, Eq. (1.16), the 
continuum version of Noether’s the- 
orem will be derived below. In not 
referring to a specific physical problem, our discussion will be somewhat dry. This lack 
of physical context is, however, more than outweighed by the general applicability of the 
result: the generalized form of Noether’s theorem can be - without much further thought - 
applied to generate the conservation laws of practically any physical symmetry. In this 
section, we will illustrate the application of the formalism on the simple (yet important) 
example of space-time translational invariance. A much more intriguing case study will be 
presented in Section 4.3 after some further background of quantum field theory has been 
introduced. 


Amalie E. Noether 1882-1935 

German mathematician known 
for her groundbreaking contri- 
butions to abstract algebra and 
theoretical physics. Alive at a 
time when women were not supposed to attend 
college preparatory schools, she was often forbidden 
from lecturing under her own name. Despite these 
obstacles, Noether became one of the greatest 
algebraists of the century. Described by Albert Ein- 
stein as the most significant creative mathematical 
genius thus far produced since the higher education 
of women began, she revolutionized the theories 
of rings, fields, and algebras. In physics, Noether’s 
theorem explains the fundamental connection 
between symmetry and conservation laws. In 1933, 
she lost her teaching position due to being a Jew 
and a woman, and was forced out of Germany by 
the Nazis. 



Symmetry transformations 

The symmetries of a physical system manifest themselves in the invariance of its action 
under certain transformations. Mathematically, symmetry transformations are described by 
two pieces of input data: firstly, a mapping M — >• M, x h4 x'{x) that assigns to any point of 
the base manifold some “transformed” point; secondly, the field configurations themselves 
may undergo some change, i.e. there may be a mapping ((/> : M — > T) i-k (ft : M — )• T) that 
defines transformed values </>'( x ') = F [{d>(a:)}] in terms of the “old” field. It is important to 
understand that these two operations may, in general, be independent of each other. 


Later on we shall see that all we are discussing here carries over to the quantum level. 
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For example, suppose we want to explore the 
invariance properties of a theory under transla- 
tions in space-time. In this case, we can consider 
a mapping x' = x+a, a £ R m , </>'(x') = 4>{x). This 
describes the translation of a field by a fixed offset 
a in space-time (see figure). The system is trans- 
lationally invariant if and only if S[0] = As 
a second example, let us probe rotational sym- 
metry: x' = Rx, where R £ 0(m) is a rotation 
of Euclidean space-time. In this case it would, in 
general, be unphysical to define <f>'{x') = 4>(x). 

To illustrate this point, consider the example of 
a vector field in two dimensions n = m = 2 (see 
the figure.) A properly rotated field configuration 
is defined by <fi'(x') = R(j>(x), i.e. the field ampli- 
tude actively participates in the operation. In fact, one often considers symmetry operations 
where only the fields are transformed while the base manifold is left untouched. 28 For exam- 
ple, the intrinsic 29 rotational invariance of a magnet is probed by x' = x, m!(x) = R ■ m(x), 
where the vector field m describes the local magnetization. 

To understand the impact of a symmetry transformation, it is fully sufficient to con- 
sider its infinitesimal version. (Any finite transformation can be generated by successive 
application of infinitesimal ones.) Consider, thus, the two mappings 


<p ‘ 

y r \^F) 


'*-1 x 

X x' 



4> l {x) -4 <j> H (x ') 


dxfe> 

duj a 


UJ a (x), 


= ^(x) +u a (x)F*[i 


(1.42) 


expressing the change of both fields and coordinates to linear order in a set of param- 
eter functions {u> a } characterizing the transformation. (For a three-dimensional rotation, 
(wi,W 2 ,W 3 ) = (< p,0,ip ) would be the rotation angles, etc.) The functionals {F*} - which 
need not depend linearly on the field <j>, and may explicitly depend on the coordinate x - 
define the incremental change — <f>(x). 

We now ask how the action Eq. (1.16) changes under the transformation Eq. (1.42), i.e. 
we wish to compute the difference 


AS = 


J d m x > €.(4> H (x r ) , d x ^ <j) n (x')J 


d m x jC(<j> z (x),d X/i <p l (x)). 


For example, the standard symmetry transformations of classical mechanics, q(t ) — >• q (t), belong to this class: 
the coordinate vector of a point particle, q (a “field” in 0 + 1 space— time dimensions) changes while the “base” 
(time t ) does not transform. 

“Intrinsic” means that we rotate just the spins but not the entire magnet (as in our second example). 
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Inserting Eq. (1.42), using the fact that dx'^/dx^ = 8^ + ( d/dx v ) (u a dx^/du) a ), together 
with 30 \dx'^/dx v \ ~ 1 + {d/dXfj) ( u! a dxn/du> a ), one obtains 

A S ~ J d m x (1 + dx^uJad^x^C^ + F*u> a , (<W _ d x ^(u> a d Ua x„))d Xl/ (</>* + F*u a )) 

— j d m x C{<j) l {x), d Xft (j)‘(x)). 


So far, we did not use the fact that the transformation was actually meant to be a symmetry 
transformation. By definition, we are dealing with a symmetry if, for constant parameters 
Lj a — a uniform rotation, global translation, etc. - the action difference AS vanishes. In 
other words, the leading contribution to the action difference of a symmetry transformation 
must be linear in the derivatives d IJ: uj a . A straightforward expansion of the formula above 
for AS shows that these terms are given by 

AS sy =' - J d m x j^(x)d )l u a , 

where the so-called Noether current is given by 


dC „ i rx \ dx v 

"V du a 


d(d^) 


(1.43) 


For a completely general field configuration, there is not much can be said about the Noether 
current (whether or not the theory possesses the symmetry). However, if the field </> obeys 
the classical equations of motion and the theory is symmetric, the Noether current is locally 
conserved, = 0. This follows from the fact that, for a solution <j> of the Euler-Lagrange 
equations, the linear variation of the action in any parameter must vanish. Specifically, 
the vanishing of A < S'[</>] for arbitrary symmetry transformations {w a } enforces the condition 
dpj* = 0. (But keep in mind the fact that the conservation law holds only on the classical 
level!) Therefore, in summary, we have Noether’s theorem: 


A continuous symmetry entails a classically conserved current. 


The local conservation of the current entails the existence of a globally conserved “charge.” 
For example, for a base manifold x = (xo, aq , . . . , x ( i) in Euclidean (1 + d)-dimensional 
space-time, integration over the space-like directions, and application of Stokes’ theorem, 
gives <9 q Q a = 0, where 31 



(1.44) 


is the conserved charge and we have assumed that the current density vanishes at spatial 
infinity. 


30 This follows from J' SN*' 4et ^ = exp tr In. ~ exp ^ x ^ (oj a ^ ^ ~ 1 - — — (A a ^ 

(Exercise: Show that det A = exptr In A , where A is a linear operator. Hint: Use the eigenbasis.) 

31 Notice that the integral involved in the definition of Q runs only over spatial coordinates. 
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Notice that nowhere in the discussion above have we made assumptions about the internal 
structure of the Lagrangian. In particular, all results apply equally to the Minkowskian and 
the Euclidean formulation of the theory. 


Example: translational invariance 


As an example that is as elementary as it is important, we consider the case of space-time 
translational invariance. The corresponding symmetry transformation is defined by itL = 
Xn+a-n, = <k{ x )- The infinitesimal version of this transformation reads as x ^ 

where we have identified the parameter index a with the space-time index p. Noether’s 
current, which in the case of translational invariance is called the energy— momentum 
tensor, is given by IT: 


Ty,( x ) = 


dC 


-d v <p - s^c. 


The conserved “charges” corresponding to this quantity are 


Specifically, P° is the energy and P l ,l = 1, . . . , d the total momentum carried by the system. 


EXERCISE Consider the one-dimensional elastic chain discussed in Section 1.1. Convince your- 
self that the continuity equation of the system assumes the form dt J X2 dxp(x,t) = dt(<j>(x 2 ,t) — 
<f>(xi,t)), where p(x, t) is the local density of the medium. Use this result to show that p = d x <t>- 
Show that the momentum f dx (particle density) x (velocity) = f dxpdt<f> carried by the system 
coincides with the Noether momentum. 


EXERCISE Compute the energy-momentum tensor for the Lagrangian of the free electromag- 
netic field. Derive the corresponding energy-momentum vector and convince yourself that you 
obtain results familiar from electrodynamics. 

Admittedly, translational invariance is not a particularly exciting symmetry and, if this 
were the only example of relevance, the general discussion above would not have been worth 
the effort. However, later in the text, we shall exemplify (see, for example, Section 4.3) how 
concepts hinging on Noether’s theorem can be applied to obtain far-reaching results that 
are not readily accessible by different means. 


1.7 Summary and outlook 

In this chapter we have introduced the general procedure whereby classical continuum 
theories are quantized. Employing the elementary harmonic oscillator as a example, we have 
seen that the Hilbert spaces of these theories afford different interpretations. Of particular 
use was a quasi-particle picture whereby the collective excitations of the continuum theories 
acquired the status of elementary particles. 
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Suspiciously, both examples discussed in this text, the harmonic quantum chain and free 
quantum electrodynamics, led to exactly solvable free field theories. However, it takes 
only little imagination to foresee that only few continuum theories will be as simple. Indeed, 
the exact solvability of the atomic chain would have been lost had we included higher-order 
contributions in the expansion in powers of the lattice displacement. Such terms would 
hinder the free wave-like propagation of the phonon modes. Put differently, phonons would 
begin to scatter off each other (i.e. interact). Similarly, the free status of electrodynamics is 
lost once the EM field is allowed to interact with a matter field. Needless to say, interacting 
field theories are infinitely more complex, but also more interesting, than the systems 
considered so far. 

Technically, we have seen that the phonon/photon interpretation of the field theories 
discussed in this chapter could conveniently be formulated in terms of ladder operators. 
However, the applications discussed so far provide only a glimpse of the advantages of 
this language. In fact, the formalism of ladder operators, commonly described as “second 
quantization,” represents a central, and historically the oldest, element of quantum field 
theory. The next chapter is devoted to a more comprehensive discussion of both the formal 
aspects and applications of this formulation. 


1.8 Problems 

Electrodynamics from a variational principle 

Choosing the Lorentz-gauged components of the vector potential as generalized coordinates, the aim 
of this problem is to show how the wave equations of electrodynamics can be obtained as a variational 
principle. 

Electrodynamics can be described by Maxwell’s equations or, equivalently, by wave-like 
equations for the vector potential. Working in the Lorentz gauge, d t (j) = V • A, these 
equations read ( d 2 — V 2 ) <p = p, (<9 2 — V 2 ) A = j. Using relativistically covariant notation, 
the form of the equations can be compressed further to d IJ d l ‘ A i/ = j v . Starting from the 
Lagrangian action, S' [A] = — f d 4 x ( \F IJU F IU/ + A^jT) , obtain these equations by applying 
the variational principle. Compare the Lorentz gauge representation of the action of the 
field with the action of the elastic chain. What are the differences/parallels? 

Answer: 

Using the definition of the electromagnetic field tensor F flv = dpA^ — d v A^, and integrating 
by parts, the action assumes the form 

S[A] = — J d A x (~±A„ | d^A" - Qj'ff'A*} + fpA^j . 

Owing to the Lorentz gauge condition, the second contribution in the square brackets van- 
ishes and we obtain 5 [A] = — f d A x {^d tt A v d il A v + jpA 1 *) , where we have again integrated 
by parts. Applying the general variational Eq. (1.17) one finally obtains the wave equation. 
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Hamiltonian of electromagnetic field, 


Here it is shown that the Hamiltonian canonically conjugate to the Lagrangian of the electromagnetic 
field indeed coincides with the energy density familiar from elementary electrodynamics. 

Consider the electromagnetic field in the absence of matter, j = 0. Verify that the total 
energy stored in the field is given by H = f rf 3 tW(x) where %(x) = E 2 (x) + B 2 (x) is the 
familiar expression for the EM energy density. (Hint: Use the vacuum form of Maxwell’s 
equations and the fact that, for an infinite system, the energy is defined only up to surface 
terms.) 

Answer: 

Following the canonical prescription, let us first consider the Lagrangian density, 

£ = -\f^ v = l - YSfioM - diAoWoAi - Mo) & A i ~ djAiMAj - djA,). 

We next determine the components of the canonical momentum through the relation 7 = 
dg 0 A^£ : 7To = 0, 7 Tj = <9 q A t — diAg = E,. Using the fact that diAj — djA j is a component 
of the magnetic field, the Hamiltonian density can now be written as 

n = n ll d 0 A l ,-£= i(2E-5 0 A-E 2 + B 2 ) ( = } i(2E- V</> + E 2 + B 2 ) 

( = } ^(2V • (E</>) + E 2 + B 2 ), 

where (1) is based on addition and subtraction of a term 2E • V<^>, and (2) on V • E = 0 
combined with the identity V • (a/) = V • a/ + a • V/ (valid for general vector [scalar] 
functions a (/)). Substitution of this expression into the definition of the Hamiltonian yields 

H=^Jd 3 x (2V • (E </>) + E 2 + B 2 ) = * J d 3 x (E 2 + B 2 ) , 

where we have used the fact that the contribution V • (E<i>) is a surface term that vanishes 
upon integration by parts. 


Phonon specific heat 

In the text, we have stated that the mode quantization of elastic media manifests itself in low- 
temperature anomalies of the specific heat. In this problem, concepts of elementary quantum statistical 
mechanics are applied to determine the temperature profile of the specific heat. 

Compute the energy density u = — (1 /L)dp\nZ of one-dimensional longitudinal phonons 
with dispersion u>k = t'|fc|, where Z = tve~^ H denotes the quantum partition function. As 
an intermediate result, show that the thermal expectation value of the energy density of 
the system can be represented as 

u = y ( Y + WfcTlB ( Wfc )) 


(1.45) 
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where ne(e) = (eJ d€ — l) 1 is the Bose-Einstein distribution function. Approximate the sum 
over k by an integral and show that c„ ~ T. At what temperature T c i does the specific heat 
cross over to the classical result c v = const? (Remember that the linear dispersion Wk = v\k\ 
is based on a quadratic approximation to the Hamiltonian of the system and, therefore, 
holds only for \k\ < A, where A is some cutoff momentum.) Reiterate the discussion of 
Section 1.4 for a d-dimensional isotropic solid of volume L d (i.e. assume that the atomic 
exchange constants remain the same in all directions). Show that the dispersion generalizes 
to Wk = v|k|, where k = (m , . . . ,nd)2ir/L and n, £ Z. Show that the specific heat shows 
the temperature dependence c v ~ T d . 

Answer: 

As discussed in the text, the quantum eigenstates of the system are given by |ni,ri 2 , . . .), 
where n m is the number of phonons of wavenumber k m = 2nm/L, E| ni , n2 ,...) = 
( n m + 1/2) = e m m the eigenenergy, and oj m = v\k m \. In the energy represen- 

tation, the quantum partition function then takes the form 

00 c -pu m /2 

Z = ire m = ^ e e ~^ m (n m +i/ 2 ) = 

states m=l,2,... n m = 0 m=l,2,... ^ 

where n m is the occupation number of the state with wavenumber k m . Hence, 

Differentiation with respect to /3 yields Eq. (1.45) and, replacing I dk, we 

arrive at 

u = C 1 + ^[ dk jff v * Ci+$- 2 C 2 . 

2tt J\ k \<\ e° v ' k ' - 1 

Here, C\ is the temperature-independent constant accounting for the “zero-point energies” 
(jj m / 2. In the second equality, we have scaled k — > [3k. This produces a prefactor £T 2 multi- 
plied by a temperature-independent (up to the temperature dependence of the boundaries 
A — > P A) integral that we denoted by CV Differentiation with respect to T then leads to the 
relation c v = 8 tu ~ T. However, for temperatures T > v A higher than the highest frequen- 
cies stored in the phonon modes, the procedure above no longer makes sense (formally, due 
to the now non-negligible temperature dependence of the boundaries). Yet in this regime 
we may expand — 1 ~ [3v\k\, which brings us back to the classical result c v = const. 

Consider now a d-dimensional solid with “isotropic” exchange coupling ( % ) (d , R+e i — 

c/iptj 2 (e t : unit vector in the i-direction) . Taking a continuum limit leads to a contribu- 
tion (fc s a 2 /2)(Vc^(x)) 2 . Proceeding along the lines of the text, we find that the relevant 
excitations are now waves with wave vector k = 27r(ni, . . . ,nd)/L and energy Wk = |k|u. 
Accordingly, the sum in our formula for the specific heat becomes ~ f d d k. Scaling 
fc,; — > pki then generates a prefactor /3 _d_1 and we arrive at the relation c„ ~ T d . 
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From particles to fields 

Van der Waals force 


We here discuss a semi-phenomenological approach to the van der Waals force: modelling the con- 
stituents participating in the interaction as quantum oscillators, we find that the ground state energy 
of the ensuing two-body system is lower than that of the isolated compounds. 


London’s phenomenological approach to explaining the van der Waals force starts out 
from the Hamiltonian 


h=pL + p 1 

2 m 2 m 


mul 

2 


(x 2 + x 2 ) + mKx 1 X 2 , 


where Xi is the coordinate of an oscillator phenomenologically representing one of two 
atoms i = 1,2. The atoms are coupled through a dipole-dipole interaction, where the 
dipole operators are proportional to Xi and K (r) = qe 2 /(mr 3 ) encapsulates the details of 
the interaction. Here r denotes the distance between the atoms and q is a dipole-dipole 
orientation factor. 

Compute the spectrum of the two-particle system described by H and verify that the 
dipole coupling leads to a lowering of the ground state energy by an amount V ( r ) = K 2 /8u%. 
Using the fact that the classical polarizability of an oscillator is given by a = e 2 /mui 2 this 
becomes V = q 2 ui 0 a 2 /8r 6 . Finally, using the fact that the directional average of the geo- 
metric factor q 2 equals 2, and multiplying by three to take account of the three-dimensional 
character of a “real” atomic oscillator, we recover the result Eq. (1.41). 


Answer: 

We begin by formulating the potential U of the oscillator system in a matrix notation, 
U = x T Ax, where x T = (xi,X2) t and 


^ m ( uU K 


Diagonalizing the matrix, A = UDU T , and transforming coordinates, x — > U T x = x', the 
system decouples into two independent oscillators, 



Pi 

2 TO 



where the new characteristic frequencies UJ 1/2 = ± it') 1 ^ 2 are determined by the eigen- 

values of the matrix A. The ground state of this system is given by Eq = (wi + W2)/2 sa 
tjj 0 — K 2 /8 u>q, which lies by an amount V = — K 2 / (8cUo) lower than the energy of the 
isolated atoms. 
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Second quantization 


The purpose of this chapter is to introduce and apply the method of second quantization, a tech- 
nique that underpins the formulation of quantum many-particle theories. The first part of the chapter 
focuses on methodology and notation, while the remainder is devoted to the development of applications 
designed to engender familiarity with and fluency in the technique. Specifically, we will investigate the 
physics of the interacting electron gas, charge density wave propagation in one-dimensional quantum 
wires, and spin waves in a quantum Heisenberg (anti)ferromagnet. Indeed, many of these examples and 
their descendants will reappear as applications in our discussion of methods of quantum field theory in 
subsequent chapters. 


In the previous chapter we encountered two field theories that could conveniently be rep- 
resented in the language of “second quantization,” i.e. a formulation based on the algebra 
of certain ladder operators a*,. 1 There were two remarkable facts about this formulation: 
firstly, second quantization provides a compact way of representing the many-body space 
of excitations; secondly, the properties of the ladder operators were encoded in a simple 
set of commutation relations (cf. Eq. (1.33)) rather than in some explicit Hilbert space 
representation. 

Apart from a certain aesthetic appeal, these observations would not be of much relevance 
if it were not for the fact that the formulation can be generalized to a comprehensive and 
highly efficient formulation of many-body quantum mechanics in general. In fact, second 
quantization can be considered the first major cornerstone on which the theoretical frame- 
work of quantum field theory was built. This being so, extensive introductions to the concept 
can be found throughout the literature. We will therefore not develop the formalism in full 
mathematical rigor but rather proceed pragmatically by first motivating and introducing 
its basic elements, followed by a discussion of the “second quantized” version of standard 
operations of quantum mechanics (taking matrix elements, changing bases, representing 
operators, etc.). The second part of the chapter is concerned with developing fluency in the 
method by addressing a number of applications. Readers familiar with the formalism may 
therefore proceed directly to these sections. 


The term “second quantization” is unfortunate. Historically, this terminology was motivated by the observation 
that the ladder operator algebra fosters an interpretation of quantum excitations as discrete “quantized” units. 
Fundamentally, however, there is nothing like “two” superimposed quantization steps in single- or many-particle 
quantum mechanics. Rather, one is dealing with a particular representation of the “first and only quantized” 
theory tailored to the particular problem at hand. 
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Second quantization 

2.1 Introduction to second quantization 
Motivation 


We begin our discussion by recapit- 
ulating some fundamental notions of 
many-body quantum mechanics, as for- 
mulated in the traditional language of 
symmetrized/anti-symmetrized wavefunc- 
tions. Consider the (normalized) set of 
wavefunctions |A) of some single-particle 
Hamiltonian H : H |A) = ca|A), where c\ 
are the eigenvalues. With this definition, 
the normalized two-particle wavefunction 


e A 

£4 

£3 

e 2 


£ o 


n \ 

_ _ 0 _ 

_ _ 1 _ 

_ _ 1 _ 
_ _3_ 


h. 5 

bosons 

V’f('^b) of two fermions (bosons) populating levels Ai and A2 is given by 
symmetrized (symmetrized) product 


n \ 

_ _o_ 

_ X 

_ _ 1 _ 
1 


fermions 
the anti- 


■(/;p(xi,:r2) = (<ari|Ai)<ar 2 |A2> - (aq|A 2 )(a;2|Ai)) , 

ifa(x i,x 2 ) = ^ ((xi|Ai)(a;2|A2) + (a;i|A 2 )(a;2|Ai)) . 


In the Dirac bracket representation, the two-body states |Ai,A 2 )f(b) corresponding to the 
wave functions ipF(B)(xi, £ 2 ) = ((aq| ® ( 2 : 2 !) |Ai, A 2 )f(b) above can be presented as 


|Ai, A 2 ) F (b) = (|Ai) ® 1^2) + Cl^2) 1 1 Ai)) , 


where £ = — 1 for fermions while £ = 1 for bosons. 


Note that the explicit symmetriza- 
tion of the wavefunctions is neces- 
sitated by quantum mechanical 
indistinguishability: for fermions 
(bosons) the wave function has to be 
anti-symmetric (symmetric) under 
particle exchange. 2 More gener- 
ally, an appropriately symmetrized 
N-particle wavefunction can be 
expressed in the form 


Enrico Fermi 1901-54 

Nobel Laureate in Physics in 1938 
for his demonstrations of the exis- 
tence of new radioactive elements 
produced by neutron irradiation, 
and for his related discovery of 
nuclear reactions brought about 
by slow neutrons. (Image © The 
Nobel Foundation.) 



|Ar, A 2 , . . • , X N ) = 1 , J2 C { 1 - sgnV)/ 2 \Xvi) ® \\vi) ® • • • ® |A vn), 

V N '- 1Ia^o(«a!) v 


( 2 . 1 ) 


2 Notice, however, that in two-dimensional systems the standard doctrine of fully symmetric/anti-symmetric many 
particle wave functions is too narrow and more general types of exchange statistics can be realized, cf. our 
discussion on page 41. 
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where n\ represents the total number of particles in state A (for fermions, Pauli exclusion 
enforces the constraint n\ < 1) - see the schematic figure above. The summation runs over 
all N\ permutations of the set of quantum numbers {Ai, . . . , A n}, and sgn V denotes the 
sign of the permutation V. (sgn V = 1 [—1] if the number of transpositions of two elements 
which brings the permutation ('Pi, V 2 , ■ ■ ■ , Vn) back to its original form (1,2,..., N) is 
even [odd].) The prefactor 1/ -\f N\ ][[[ A («*!) normalizes the many-body wavefunction. In 
the fermionic case, wave functions corresponding to the states above are known as Slater 
determinants. 

Finally, notice that it will be use- 
ful to assume that the quantum 
numbers {A*} defining the state 
|Ai, A 2 , . • . , Ajv) are ordered accord- 
ing to some convention. (For exam- 
ple, for A* = Xi a one-dimensional 
coordinate representation, we might 
order according to the rule x\ < 
x -2 < ■■■ < xi v.) Once an ordered 
sequence of states has been fixed we 
may - for notational convenience - 

label our quantum states by integers, A i = 1,2, Any initially non-ordered state (e.g. 

1 2, 1, 3)) can be brought into an ordered form (| 1, 2, 3)) at the cost of, at most, a change of 
sign. 

I N FO For the sake of completeness, let us spell out the connection between the permutation 

group and many-body quantum mechanics in a more mathematical language. The basic 

arena wherein IV-body quantum mechanics takes place is the product space, 



N copies 


David Hilbert 1862-1943 

He contributed to many branches 
of mathematics, including invari- 
ants, algebraic number fields, 
functional analysis, integral equa- 
tions, mathematical physics, and 
the calculus of variations. His 
work in geometry had the greatest influence in that 
area after Euclid. A systematic study of the axioms 
of Euclidean geometry led Hilbert to propose twenty 
one such axioms and he analyzed their significance. 



of N single-particle Hilbert spaces. In this space, we have a linear representation of the permu- 
tation group, S N , 3 assigning to each V € S N the permutation (no ordering of the As implied at 
this stage), 


V .H n *4 H N , | Ai) ••• |Aiv) !->■ |Api) ••• |Apjv). 


3 Recall that a linear representation of a group G is a mapping that assigns to each g £ G a linear mapping 
p g : V —¥ V of some vector space V. For notational convenience one usually writes g : V —4 V instead of 
p g : V —4 V . Conceptually, however, it is often important to carefully discriminate between the abstract group 
elements g and the matrices (also g) assigned to them by a given representation. (Consider, for example the 
symmetry group G = SU(2) of quantum mechanical spin. SU(2) is the two-dimensional group of unitary matrices 
with determinant one. However, when acting in the Hilbert space of a quantum spin S = 5, say, elements of 
SU(2) are represented by (25 + 1 = ll)-dimensional matrices.) Two representations p and p' that differ only by 
a unitary transformation, g E G : p g = U p' g U -1 , are called unitary equivalent. If a transformation U can be 
found such that all representation matrices p g assume a block structure, the representation is called reducible, 
and otherwise irreducible. Notice that the different sub-blocks of a reducible representation by themselves form 
irreducible representation spaces. The identification of all distinct irreducible representations of a given group 
is one of the most important objectives of group theory. 
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The identification of all irreducible subspaces of this representation is a formidable task that, 
thanks to a fundamental axiom of quantum mechanics, we need not address in full. All we need 
to know is that S N has two particularly simple one-dimensional irreducible representations: one 
wherein each V € S N acts as the identity transform P( ) = and, another, the alternat- 
ing representation V( ) = sgn V ■ . According to a basic postulate of quantum mechanics, 

the state vectors € H N describing bosons/fermions must transform according to the iden- 
tity/alternating representation. The subset J- N C P N of all states showing this transformation 
behavior defines the physical A’-body Hilbert space. To construct a basis of J - N , one may apply 
the symmetrization operator P s = V (anti-symmetrization operator P a = '^Z v (sgnP)P) to 
the basis vectors |Ai) • • • |Aat) of H N . Up to normalization, this operation obtains the states 
( 2 . 1 ). 

Some readers may wonder why we mention these representation-theoretic aspects. Being prag- 
matic, all we really need to know is the symmetrization/anti-symmetrization postulate, and its 
implementation through Eq. (2.1). Notice, however, that one may justly question what we actu- 
ally mean when we talk about the permutation exchange of quantum numbers. For example, 
when we compare wavefunctions that differ by an exchange of coordinates, we should, at least 
in principle, be able to tell by what physical operation we effect this exchange (for, otherwise, 
we cannot really compare them other than in a formal and, in fact, in an ambiguous sense). 

Oddly enough, decades passed before this crucial issue in quantum mechanics was critically 
addressed. In a now seminal work by Leinaas and Myrheim 4 it was shown that the standard 
paradigm of permutation exchange is far from innocent. In particular, it turned out that its appli- 
cability is tied to the dimensionality of space! Specifically, in two-dimensional spaces (in a sense, 
also in d = 1) a more elaborate scheme is needed. (Still one may use representation-theoretic 
concepts to describe particle exchange. However, the relevant group - the braid group - now 
differs from the permutation group.) Physically, these phenomena manifest themselves in the 
emergence of quantum particles different from both bosons and fermions. For a further discussion 
of these “anyons” we refer to Chapter 9. 

While representations like Eq. (2.1) can be used to represent the full Hilbert space of many- 
body quantum mechanics, a moment’s thought shows that this formulation is not at all 
convenient: 

t> It takes little imagination to anticipate that practical computation in the language of 
Eq. (2.1) will be cumbersome. For example, to compute the overlap of two wavefunctions 
one needs to form no less than (AH) 2 different products. 
t> The representation is tailor-made for problems with fixed particle number N . However, 
we know from statistical mechanics that for N = 0(1O 23 ) it is much more convenient to 
work in a grand canonical formulation where N is allowed to fluctuate. 
t> Closely related to the above, in applications one will often ask questions such as, “What is 
the amplitude for injection of a particle into the system at a certain space-time coordinate 
(ari,ti) followed by annihilation at some later time (£ 2 , £ 2 )?” Ideally, one would work 
with a representation that supports the intuition afforded by thinking in terms of such 


J. M. Leinaas and J. Myrheim, On the theory of identical particles, II Nuovo Cimento B 37 (1977), 1—23. 
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processes: i.e. a representation where the quantum numbers of individual quasi-particles 
rather than the entangled set of quantum numbers of all constituents are fundamental. 

The “second quantized” formulation of many-body quantum mechanics, as introduced in 
the next subsection, will remove all these difficulties in an elegant and efficient manner. 


The apparatus of second quantization 

Occupation number representation and Fock space 
Some of the disadvantages of the representation (2.1) can be avoided with relatively little 
effort. In our present notation, quantum states are represented by “TV-letter words” of the 
form 1 1, 1, 1, 1, 2, 2, 3, 3, 3, 4, 6, 6, . . .). Obviously, this notation contains a lot of redundancy. 
A more efficient encoding of the state above might read |4, 2,3, 1,0,2, . . .), where the ith 
number signals how many particles occupy state number i; no more information is needed 
to characterize a symmetrized state. (For fermions, these occupation numbers take a value 
of either zero or one.) This defines the “occupation number representation.” In the 
new representation, the basis states of F N are specified by |n-| , n 2 , . . .), where n i = N. 
Any state |T) in F N can be obtained by a linear superposition 

1^) = Y c m,n 2l ..>i,n 2 ,...). 

As pointed out above, eventually we will want to emancipate ourselves from the condition 
of a fixed particle number N. A Hilbert space large enough to accommodate a state with 
an undetermined number of particles is given by 

T= T N . (2.2) 

N=0 

Notice that the direct sum con- 
tains a curious contribution F° , 
the “vacuum space.” This is a one- 
dimensional Hilbert space which 
describes the sector of the theory 
with no particles present. Its single 
normalized basis state, the vacuum 
state, is denoted by |0). We will 
soon see why it is convenient to add 
this strange animal to our family of 
basis states. The space T is called 
Fock space and it defines the principal arena of quantum many-body theory. 

To obtain a basis of F . we need only take the totality of our previous basis states 
{ | ri i , n- 2 , ■ ■ ■)}, and drop the condition )Tb n, = N on the occupation numbers. A gen- 
eral many-body state | T) can then be represented by a linear superposition IT) = 


Vladimir Aleksandrovich Fock 
1898-1974 

One of the main participants in 
the history of the general theory 
of relativity in Russia. His ground- 
breaking contributions to many- 

body theory include the introduc- 

tion of Fock space and the development of per- 
haps the most important many-particle approxima- 
tion scheme, the Hartree-Fock approximation (see 
Chapter 5). 
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Sni n 2 c ni,ni,...\ n h n 2 f • •)■ Notice that states of different particle numbers may con- 
tribute to the linear superposition forming |\E'). We shall see that such mixtures play an 
important role, for example in the theory of superconductivity. 


Foundations of second quantization 

The occupation number representation introduced above provides a step in the right direc- 
tion, but it does not yet solve our main problem, the need for explicit symmetrization/anti- 
symmetrization of a large number of states in each quantum operation. 

As a first step towards the construction of a more powerful representation, let us recall 
an elementary fact of linear algebra: a linear map A : V — »• V of a vector space into itself is 
fully determined by defining the images Wi = Avi of the action of A on a basis {u,}. Now 
let us use this scheme to introduce a set of linear operators acting in Fock space. For every 
i = 1, 2, . . ., we define operators at : T — > T through 

aj[m, . . . ,rij, . . .) = (rii + . . . ,ni + 1, . . .), (2.3) 

where Si = Xq=i n j ■ I 11 the fermionic case, the occupation numbers n, have to be understood 
mod 2. Specifically, (1 + 1) =0 mod 2, i.e. the application of a\ to a state with rij = 1 
annihilates this state. 

Notice that by virtue of this definition we are able to generate every basis state of T by 
repeated application of ajs to the vacuum state. (From a formal point of view, this fact 
alone is motivation enough to add the vacuum space to the definition of Fock space.) Indeed, 
repeated application of Eq. (2.3) leads to the important relation 


> = IL 


4) 1 / 2 


(<4no>. 


(2.4) 


Notice that Eq. (2.4) presents a strong statement: the complicated permutation “entangle- 
ment” implied in the definition (2.1) of the Fock states can be generated by straightforward 
application of a set of linear operators to a single reference state. Physically, A 7 - fold applica- 
tion of operators of to the empty vacuum state generates an A-particle state, which is why 
the a*s are commonly called creation operators. Of course, the introduction of creation 
operators might still turn out to be useless, i.e. consistency with the properties of the Fock 
states (such as the fact that, in the fermionic case, the numbers rq = 0, 1 are defined only 
mod 2), might invalidate the simple relation (2.3) with its (n,-independent!) operator aj. 
However, as we shall demonstrate below, this is not the case. 

Consider two operators a\ and aj for i 5^ j . From the definition (2.3), one may readily 
verify that (ajaj — £ajat)|m, ri2, . . .) = 0. Holding for every basis vector, this relation implies 
that = 0, where 

[A,B] C = AB-QBA, 


i.e. [, ]^ = i = [, ] is the commutator and [, ]^=-i = {, } = [, ]+ the anti-commutator. 
Turning to the case i = j, we note that, for fermions, the two-fold application of at to any 
state leads to its annihilation. Thus, aj 2 = 0 is nilpotent, a fact that can be formulated as 
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Figure 2.1 Visualization of the generation of the Fock subspaces J- N by repeated action of creation 
operators on the vacuum space J-° . 


[aj,aj]+ = 0. For bosons we have, of course, [aj , a] ] = 0 (identical operators commute!). 
Summarizing, we have found that the creation operators obey the commutation relation 

: [ot,at] c = 0. (2.5) 

Now, quantum mechanics is a unitary theory so, whenever one meets a new operator A, 
one should determine its Hermitian adjoint . To understand the action of the Hermitian 
adjoints (aj)* = <H of the creation operators we may take the complex conjugates of all 
basis matrix elements of Eq. (2.3): 

(ni, . . . , rii, . . . | a. J | , . . . ,n(, . . .) = (nj + l) 1 ^ 2 C s< ^m,raj • • • &m,ri .+ 1 • • • 

=> (n , 1 ,...,n',...|a i |n 1 ,...,n i ,...)* = n] ,2 Q Si 5 n ^ ni . . .8 n '., ni -i ■ ■ ■ 

Holding for every bra (nj , . . . , n' , . . . | , the last line tells us that 

cij|n.i, . . . , rij, . . .j = n}J 2 C, Si \n\, . . . ,«* — !,•••), (2-6) 


a relation that identifies a t as an operator that “annihilates” particles. The action of creation 
and annihilation operators in Fock space is illustrated in Fig. 2.1. Creation operators 
^ : F n -> T* f increase the particle number by one, while annihilation operators a : 
J- N — > J- N ~ l lower it by one; the application of an annihilation operator to the vacuum 
state, aj|0) = 0, annihilates it. (Do not confuse the vector |0) with the number zero.) 

Taking the Hermitian adjoint of Eq. (2.5) we obtain [a^a^ = 0. Further, a straight- 
forward calculation based on the definitions (2.3) and (2.6) shows that [aj,aj]f = S t j. 
Altogether, we have shown that the creation and annihilation operators satisfy the algebraic 
closure relation 


[a*,aj]£ = 8ij, [a,,aj]f = 0, [aj,a]]^ = 0. 


(2.7) 


Given that the full complexity of Fock space is generated by application of at s to a single 
reference state, the simplicity of the relations obeyed by these operators seems remarkable 
and surprising. 


INFO Perhaps less surprising is that, behind this phenomenon, there lingers some mathematical 
structure. Suppose we are given an abstract algebra A of objects ai,a\ satisfying the relation 
(2.7). (Recall that an algebra is a vector space whose elements can be multiplied by each other.) 
Further suppose that A is irreducibly represented in some vector space V, i.e. that there is a 
mapping assigning to each m € A a, linear mapping a, : V —>■ V, such that every vector |v) €. V 
can be reached from any other |w) € V by (possibly iterated) application of operators a, and aj 




46 


Second quantization 


(irreducibility). 5 According to the Stone— von Neumann theorem (a) such a representation 
is unique (up to unitary equivalence), and (b) there is a unique state |0) 6 V that is anni- 
hilated by every a,. All other states can then be reached by repeated application of ajs. The 
precise formulation of this theorem, and its proof - a good practical exercise in working with 
creation/annihilation operators - are left as Problem 2.4. From the Stone-von Neumann theo- 
rem, we can infer that the Fock space basis could have been constructed in reverse. Not knowing 
the basis {|m, ri 2 , . . .)}, we could have started from a set of operators obeying the commutation 
relations (2.7) acting in some a priori unknown space T. Outgoing from the unique state |0), 
the prescription (2.4) would then have yielded an equally unique basis of the entire space T (up 
to unitary transformations). In other words, the algebra (2.7) fully characterizes the operator 
action and provides information equivalent to the definitions (2.3) and (2.6). 

Practical aspects 

Our next task will be to promote the characterization of Fock space bases introduced above 
to a full reformulation of many-body quantum mechanics. To this end, we need to find 
out how changes from one single-particle basis {| A)} to another {| A)} affect the operator 
algebra {oa}. (In this section we shall no longer use integers to identify different elements 
of a given single-particle basis. Rather, we use Greek labels A, i.e. o A creates a particle in 
state A.) Equally important, we need to understand in what way generic operators acting 
in many-particle Hilbert spaces can be represented in terms of creation and annihilation 
operators. 

> Change of basis: Using the resolution of identity id = X^a=o tl ie relations |A) = 

|A)(A|A), |A) = a A |0), an d | A) = at|0) immediately give rise to the transformation law 

( 2 . 8 ) 

In many applications, we will be dealing with continuous sets of quantum numbers (such 
as position coordinates). In these cases, the quantum numbers are commonly denoted 
by a bracket notation a\ — »• a(x) = ^ A (:r| A)ua and the summations appearing in the 
transformation formula above translate to integrals: a\ = f dx{\\x)a{x). 

EXERCISE The transformation from the coordinate to the Fourier momentum representation 
in a finite one-dimensional system of length L would read 

r L 

ak= dx(k\x)a(x), a(x) = \^{x\k)ak, (2-9) 

Jo k 

where {k\x) s {x\k)* = e~ lkx /VZ. 

5 To appropriately characterize the representation, we need to be a bit more precise. Within A, ai and at are 
independent objects, i.e. in general there exists no notion of Hermitian adjointness in A. We require, though, 
that the representation assigns to at the Hermitian adjoint (in V ) of the image of a*. Also, we have to require 
that [ai, at] £ A be mapped onto [ai, at] : V V where, in the latter expression, the commutator involves the 
ordinary product of matrices ai, at : V —¥ V . 
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> Representation of operators (one-body): Single-particle or one-body operators 0-\ 
acting in the 7V-particle Hilbert space fF N generally take the form <D\ = J2n=i where 
d n is an ordinary single-particle operator acting on the nth particle. A typical example 
is the kinetic energy operator T = p't /2 m, where p n is the momentum operator 

acting on the nth particle. Other examples include the one-particle potential operator 
V = V(x n ), where V (x) is a scalar potential, the total spin operator ^2 n S n , etc. Since 
we have seen that, by applying field operators to the vacuum space, we can generate the 
Fock space in general and any IV-particle Hilbert space in particular, it must be possible 
to represent any operator (D\ in an a-representation. 

Now, although the representation of n-body operators is, after all, quite straightforward, 
the construction can, at first sight, seem daunting. A convenient way of finding such 
a representation is to express the operator in terms of a basis in which it is diagonal, 
and only later transform to an arbitrary basis. For this purpose it is useful to define the 
occupation number operator 

(2.10) 

with the property that, for bosons or fermions (exercise), h\ (a^)"|0) = n(a^)”|0). 
Since fix commutes with all a y ^ x , Eq. (2.4) readily implies that n\ j \n\ 1 ,n \ 2 , . . .) = 
ri\ ] | riAi , n \ 2 ,...), i.e., h\ simply counts the number of particles in state A (hence the name 
“occupation number operator”). Let us now consider a one-body operator, O i, which is 
diagonal in the basis | A), with b = | A») (A* | , o\ i = (Aj|o|Aj). With this definition, 

one finds that 

( n A! - n '\ 2 > • • • \Oi \n Xl , n X2 , • . •) = ^ o Ai n Ai (n' Ai , n Aa , . . . |n Al , n Aa , . . .) 

= , «a 2 > • ■ ■ I ° x ^ l nA i > n A 2 5 • • •) • 

Since this equality holds for any set of states, one can infer the second quantized repre- 
sentation of the operator <D\, 

Oi = ^2,oxh\ = ^(A|o|A)a^o A . 

A=0 A=0 

The result is straightforward: a one-body operator engages a single particle at a time - the 
others are just spectators. In the diagonal representation, one simply counts the number 
of particles in a state A and multiplies by the corresponding eigenvalue of the one-body 
operator. Finally, by transforming from the diagonal representation to a general basis, 
one obtains the general result, 

(2.11) 
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To cement these ideas, let us consider some specific examples: representing the matrix 
elements of the single-particle spin operator as (Si) aa > = |(cq) aa ', where a, a' is a two- 
component spin index and ay are the Pauli spin matrices 



( 2 . 12 ) 


the spin operator of a many-body 
system assumes the form 

S = E a L'S a ^UAa. (2.13) 

A 

(Here, A denotes the set of additional 
quantum numbers, e.g. a lattice site 
index.) When second quantized in 
the position representation, one can 
show that the one-body Hamilto- 
nian for a free particle is given as a 
sum of kinetic and potential energy as 


Wolfgang Ernst Pauli 1900-58 

Nobel Laureate in Physics in 1945 
“for the discovery of the Exclusion 
Principle, also called the Pauli 
Principle.” The exclusion princi- 
ple crystallized the existing knowl- 
edge of atomic structure at the 
time it was postulated and it led 
to the recognition of the two-valued variable required 
to characterize the state of an electron. Pauli was 
the first to recognize the existence of the neutrino. 
(Image © The Nobel Foundation.) 



* = ) 

[ d d r a*(r) 

s; +v « 

a(r), 


(2.14) 


where p = — ihd . 

EXERCISE Starting with momentum representation (in which the kinetic energy is diagonal), 
transform to the position representation and thereby establish Eq. (2.14). 

The local density operator /3(r), measuring the particle density at a certain coordinate 
r, is simply given by 

P( r ) = a^(r)o(r). (2.15) 

Finally, the total occupation number operator, obtained by integrating over the particle 
density, is defined by N = f d d r a^(r)a(r). In a theory with discrete quantum numbers, this 
operator assumes the form N = 

d> Representation of operators (two-body): Two-body operators O 2 are needed to 
describe pairwise interactions between particles. Although pair-interaction potentials are 
straightforwardly included in classical many-body theories, their embedding into conven- 
tional many-body quantum mechanics is made cumbersome by particle indistinguishabil- 
ity. The formulation of interaction processes within the language of second quantization 
is considerably more straightforward: 

Initially, let us consider particles subject to the symmetric two-body potential 
V (r m , r n ) = V (r n . r m ) between two particles at position r TO and r n . Our aim is to 
find an operator V in second quantized form whose action on a many-body state gives 
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(presently, it is more convenient to use the original representation Eq. (2.1) rather than 
the occupation number representation) 

N ^ N 

F|ri,r 2 ,...rjv) = ^ V{r n , r ro )|ri, r 2 , . . . r^) = - ^ V(r n ,r m )\r 1 ,r 2 , . . .r N ). 

n<m n^m 

When this is compared with the one-point function, one might immediately guess that 

V = ^ J d d r j rf d r , o^(r)a t (r , )y(r, r')a(r')a(r). 

That this is the correct answer can be confirmed by applying the operator to a many-body 
state: 


a t (r)a t (r / )o(r , )a(r)|ri,r 2 , ...,r N ) = a t (r)a t (r , )o(r , )a(r)a t (ri) • • • a^r^lO) 



= '^2( n 1( K r — r„)a t (r 4i ) a^r')^') a^rx) • • • a f (r^^at^+i) • • • a f (r^)^) 
n=l 

N N 

= C" _1 <5( r - r„) 6 ( r ' ~ r m) at (j'«) at (%)' • • • flt ( r n-l)« t (rn+l) • • • Cl* (rjv)|0) 

n = 1 m(^n) 

N 

= ^2 s ( r - r n)fi(r' -r m )|ri,r 2 ,---rjv). 

n,m^n 


Multiplying by V (r, r')/2, and integrating over r and r', one confirms the valid- 
ity of the expression. It is left as an exercise to confirm that the naive expression 
\ f d dr I d d r'V{Y,Y')p{r)p{r') does not reproduce the two-body operator. More generally, 
turning to a non-diagonal basis, it is straightforward to confirm that a general two-body 
operator can be expressed in the form 


= ^2 
XX' ma' 


(2.16) 


where C^p/.A.A' = {p, p'\d 2 \X, X'). 

As well as the pairwise Coulomb interaction formulated above, another important 
interaction, frequently encountered in problems of quantum magnetism, is the spin— spin 
interaction. From our discussion of the second-quantized representation of spin S above, 
we can infer that the general spin-spin interaction can be presented in second-quantized 
form as 

V=^ fd d r f d d r’ J2 J ( r > r, )Sa/3 • S a ^'4(r)aJ t ,(r , )a^(r , )a^(r), 

J J aa'/ 3/3' 


where J(r, r') denotes the exchange interaction. 
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In principle, one may proceed in the same manner and represent general n-body interactions 
in terms of second-quantized operators. However, as n > 2 interactions appear infrequently, 
we refer to the literature for discussion. 

This completes our formal introduction to the method of second quantization. To develop 
fluency in the operation of the method, we will continue by addressing a number of problems 
chosen from the realm of condensed matter. In doing so, we will see that second quantization 
often leads to considerable simplification of the analysis of many-particle systems. The effec- 
tive model Hamiltonians that appear below provide the input for subsequent applications 
considered in this text. Readers not wishing to get distracted from our main focus - the 
development of modern methods of quantum field theory in the condensed matter setting 
- may safely skip the next sections and turn directly to Chapter 3 below. It is worthwhile 
keeping in mind, however, that the physical motivation for the study of various prototypical 
model systems considered later in the text is given in Section 2.2. 


2.2 Applications of second quantization 

Starting from the prototype Hamiltonian (1.1) introduced in Chapter 1, we have already 
explored generic aspects of lattice dynamics in condensed matter systems. In much of the 
remaining text we will explore examples from the complementary sector focusing on the 
electronic degrees of freedom. Drawing on the first of the principles discussed in Chapter 1, 
we will begin our discussion by reducing the full Hamiltonian to a form that contains the 
essential elements of the electron dynamics. As well as the pure electron sub-Hamiltonian 
H e , the reduced Hamiltonian will involve the interaction between the electrons and the 
positively charged ionic background lattice. However, typically, lattice distortions due to 
both the motion of the ions and the ion-ion interaction couple only indirectly. (Exercise: 
Try to think of a prominent example where the electron sector is crucially influenced by 
the dynamics of the host lattice.) To a first approximation, we may, therefore, describe the 
electron system through the simplified Hamiltonian, H = Hq + V^e, where 


H 0 = J d d ral( r) + E(r)j a a {r) 

V ee = \j d d r J d d r' V ee {r — Y')a\{Y)a\,{r')a a i{T , )a < j{r), 


V (r) = EiVeURi — r) denotes the lattice potential experienced by the electrons, and 
the coordinates of the lattice ions R j are assumed fixed. For completeness, we have also 
endowed the electrons with a spin index, cr =f / i- The Hamiltonian defines the problem of 
the interacting electron gas embedded in a solid state system. 

Despite its seemingly innocuous structure, the interacting electron Hamiltonian (2.17) 
accommodates a wide variety of electron phases from metals and magnets to insulators. To 
classify the phase behavior of the model, it is helpful to divide our considerations, focusing 
first on the properties of the non-interacting single-particle system Ho and then, later, 
restoring the electron interaction V ee . 
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As we know from Bloch’s theorem, 
eigenstates of a periodic Hamilto- 
nian can be presented in the form of 
Bloch waves 6 V’ kn(r) = e* k ' r Uk n (r), 
where the components of the crystal 
momentum k take values inside the 
Brillouin zone, ki £ [— n/a, 7r/a], and 
we have assumed that the periodicity 
of the lattice potential is the same in all directions, i.e. V(r + cte,) = V (r). The index n 
labels the separate energy bands of the solid, and the functions Uk n ( r + oe») = Uk„ (r ) 
are periodic on the lattice. Now, depending on the nature of the bonding, there are two 
complementary classes of materials where the general structure of the Bloch functions can 
be simplified significantly. 


Felix Bloch 1905-83 

Nobel Laureate in Physics in 
1952 "for the development [with 
Edward M. Purcell] of new meth- 
ods for nuclear magnetic precision 
measurements and discoveries in 
connection therewith.” (Image © 
The Nobel Foundation.) 



Nearly free electron systems 

For certain materials, notably the elemental metals drawn from groups I-IV of the periodic 
table, the outermost itinerant conduction electrons behave as if they were “nearly free,” i.e. 
their dynamic is largely oblivious to both the Coulomb potential created by the positively 
charged ion background and their mutual interaction. 

INFO Loosely speaking, Pauli exclusion of the bound state inner core electrons prevents the 
conduction electrons from exploring the region close to the ion core, thereby effectively screening 
the nuclear charge. In practice, the conduction electrons experience a renormalized pseudopo- 
tential, which accommodates the effect of the lattice ions and core electrons. Moreover, the high 
mobility of the conduction electrons provides an efficient method of screening their own mutual 
Coulomb interaction. In nearly free electron compounds, complete neglect of the lattice potential 
is usually a good approximation (as long as one considers crystal momenta remote from the 
boundaries of the Brillouin zone, ki = ±7r/a). 

In practice, this means that we are at liberty to set the Bloch function to unity, u^ n = 1, 
and regard the eigenstates of the non-interacting Hamiltonian as plane waves. This in turn 
motivates the representation of the field operators in momentum space Eq. (2.9), whereupon 
the non-interacting part of the Hamiltonian assumes the free particle form (once again, we 
have set h = 1) 

Ho = ^ 2m a *“r a k<7) (2-18) 

k 


For a more detailed discussion one may refer to one of the many texts on the basic elements of solid state physics, 
e.g., Ashcroft and Mermin, Solid State Physics (Holt-Saunders International, 1995). 
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where the sum extends over all wavevectors k and, as usual, summation convention of the 
spin indices is assumed. Turning to the Coulomb potential between conduction electrons, 
in the Fourier representation, the two-body interaction takes the form 

^ee = E T/ ee(q)4-qa4'+qa' a kV'a k <T, (2.19) 

k,k',q 

where (choosing units such that 4-7te = 1), K, e (q) = e 2 /q 2 represents the Fourier transform of 
the Coulomb interaction potential K, e (i') = e 2 /|r| . Now, as written, this expression neglects 
the fact that, in ionized solids, the negative charge density of the electron cloud will be 
compensated by the charge density of the positively ionized background. The latter can 
be incorporated into Eq. (2.19) by placing on the sum over q the restriction that q / 0 
(exercise). Taken together, the free electron Hamiltonian Ho and the Coulomb interaction 
potential F ee are known as the Jellium model. 

The interaction described by Eq. (2.19) can be 
illustrated graphically, as shown in the figure (for 
a more elaborate discussion of such diagrams, 
see Chapter 5): an electron of momentum k is 
scattered into a new momentum state k— q while 
another electron is scattered from k' — > k' + q. 

In concrete applications of low- 
temperature condensed matter 
physics, one will typically consider 
low excitation energies. The solution 
of such problems is naturally orga- 
nized around the zero-temperature 
ground state as a reference platform. 

However, the accurate calculation of 
the ground state energy of the system is a complicated problem of many-body physics that 
cannot be solved in closed form. Therefore, assuming that interactions will not substantially 
alter the ground state of the free particle problem Eq. (2.18) - which is often not the case! - 
one usually uses the ground state of the latter as a reference state. 

I N FO Deferring a more qualified discussion to later, a preliminary justification for this assump- 
tion can be given as follows. Suppose that the density of the electron gas is such that each of 
its N constituent particles occupies an average volume of 0(a d ). The average kinetic energy 
per particle is then estimated to be T ~ 1/ma 2 , while the Coulomb interaction potential will 
scale as V ~ e 2 /a. Thus, for a much smaller than the Bohr radius ao = l/e 2 m, the interac- 
tion contribution is much smaller than the average kinetic energy. In other words, for the dense 
electron gas, the interaction energy can indeed be treated as a perturbation. Unfortunately, for 
most metals one finds that a ~ oo and neither high- nor low- density approximations are strictly 
justified. 


Niels Henrik David Bohr 1885- 
1962 

Recipient of the 1922 Nobel Prize 
in Physics “for his services in the 
investigation of the structure of 
atoms and of the radiation ema- 
nating from them.” (Image © 
The Nobel Foundation.) 
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The ground state of the system occupied by N 
non-interacting particles can be readily inferred 
from Eq. (2.18). The Pauli principle implies that all 
energy states ek = k 2 /2m will be uniformly occu- 
pied up to a cutoff Fermi energy, Ep. Specifically, 
for a system of size L, allowed momentum states k 
have components fcj = 2nni/L, rij € Z. The summa- 
tion extends up to momenta with |k| < kp, where 
the Fermi momentum kp is defined through the 
relation kp/2m = Ep (see figure at right). The 
relation between the Fermi momentum and the 
occupation number can be established by dividing 
the volume of the Fermi sphere ~ kp by the momentum space volume per mode (2n/L) d , 
viz. N = C(kpL) d where C denotes a dimensionless geometry-dependent constant. 

In the language of second quantization, the ground state can now be represented as 

I n)=^ n 4J0), (2.20) 

|k| k F ,a 



where |0) denotes the state with zero electrons present. When the interaction is weak, 
one may anticipate that low-temperature properties will be governed by energetically low- 
lying excitations superimposed upon the state |fl). Therefore, remembering the philosophy 
whereby excitations rather than microscopic constituents play a prime role, one would like 
to declare the filled Fermi sea, |f2) (rather than the empty state |0)), to be the “physical vac- 
uum” of the theory. To make this compatible with the language of second quantization, we 
need to identify a new operator algebra Ck CT , cj^ such that the new operators Ck<r annihilate 
the Fermi sea. This can be easily engineered by defining 


t = a kcr’ k>k F , 

k<T a ka , k < k F , 


Ok<r, k > kp, 

C kcr — f , , , 

a L’ k < k F . 


( 2 . 21 ) 


It is then a straightforward matter to verify that Ck<r|fl) = 0, and that the canonical com- 
mutation relations are preserved. 

The Hamiltonian defined through Eq. (2.18) and (2.19), represented in terms of the 
operator algebra (2.21) and the vacuum (2.20), forms the basis of the theory of interactions 
in highly mobile electron compounds. The investigation of the role of Coulomb interactions 
in such systems will provide a useful arena to apply the methods of quantum field theory 
formulated in subsequent chapters. Following our classification of electron systems, let us 
now turn our attention to a complementary class of materials where the lattice potential 
presents a strong perturbation on the conduction electrons. In such situations, realized, 
for example, in transition metal oxides, a description based on “almost localized” electron 
states will be used to represent the Hamiltonian (2.17). 
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Let us consider a “rarefied” lattice in which the constituent ion cores are separated by a 
distance in excess of the typical Bohr radius of the valence band electrons. In this “atomic 
limit” the weight of the electron wavefunctions is “tightly bound” to the lattice centers. 
Here, to formulate a microscopic theory of interactions, it is convenient to expand the 
Hamiltonian in a local basis that reflects the atomic orbital states of the isolated ion. Such 
a representation is presented by the basis of Wannier states defined by 

I^Rn) = ~!= E e _ik ' R |V’kn), |VW> = £ eik ' R |^Rn), (2-22) 

where R denote the coordinates of the lattice centers, and X)k Z represents a summation 
over all momenta k in the first Brillouin zone. For a system with a vanishingly weak inter- 
atomic overlap, i.e. a lattice where V approaches a superposition of independent atomic 
potentials, the Wannier function tpn n (r) = (r|^> R „) converges on the nth orbital of an iso- 
lated atom centered at coordinate R. However, when the interatomic coupling is non-zero, 
i.e. in a “real” solid, the N formerly degenerate states labeled by n split to form an energy 
band (see figure below). Here, the Wannier functions (which, note, are not eigenfunctions 
of the Hamiltonian) differ from those of the atomic orbitals through residual oscillations in 
the region of overlap to ensure orthogonality of the basis. Significantly, in cases where the 
Fermi energy lies between two energetically separated bands, the system presents insulat- 
ing behavior. Conversely, when the Fermi energy is located within a band, one may expect 
metallic behavior. Ignoring the complications that arise when bands begin to overlap, we 
will henceforth focus on metallic systems where the Fermi energy is located within a definite 
band no- 

How can the Wannier basis be exploited 
to obtain a simplified representation of the 
general Hamiltonian (2.17)? The first thing 
to notice is that the Wannier states {IV’Rn)} 
form an orthonormal basis of the single- 
particle Hilbert space, i.e. the transformation 
between the real space and the Wannier 
representation is unitary, |r) = XX IV’rXV’rI 1 ’) = Xr VrWN’r)- (Here, since we are 
interested only in contributions arising from the particular “metallic” band no i n which 
the Fermi energy lies, we have dropped the remaining set of bands n ^ no and, with them, 
reference to the specific band index.) (Exercise: By focusing on just a single band no, in 
what sense is the Wannier basis now complete?) As such, it induces a transformation 

4( r ) = ^R( r ) a R.<r - ( r ) a L> ( 2 - 23 ) 

R 

between the real space and the Wannier space operator basis, respectively. In the second 
representation, following a convention commonly used in the literature, we have labeled 
the lattice center coordinates R = Rj by a counting index i = 1 , . . . , N . Similarly, the 



atomic limit : insulator i metal 
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unitary transformation between Bloch and Wannier states Eq. (2.22) induces an operator 
transformation 


y/N 






(2.24) 


We can now use the transformation formulae (2.23) and (2.24) to formulate a Wannier 
representation of the Hamiltonian (2.17). Using the fact that the Bloch states diagonalize 
the single-particle component Hq, we obtain 


fl. = 


. = £«L<« 


where we have defined tu> = N _1 e '- k -( R *- R ,') efc . The new representation of Hq describes 

electrons hopping from one lattice center i' to another, i. The strength of the hopping matrix 
element ta> is controlled by the effective overlap of neighboring atoms. In the extreme atomic 
limit, where the levels fk = const, are degenerate, tu> oc 5u> and no inter- atomic transport is 
possible. The tight-binding representation becomes useful when is non- vanishing, but 

the orbital overlap is so weak that only nearest neighbor hopping effectively contributes. 


EXERCISE Taking a square lattice geometry, and setting tw = —t for i, i! nearest neighbors 
and zero otherwise, diagonalize the two-dimensional tight-binding Hamiltonian Ho. Show that 
the eigenvalues are given by ek = — 2t(cos(fc x a) + cos (k y a)). Sketch contours of constant energy 
in the Brillouin zone and note the geometry at half-filling. 


To assess the potency of the tight-binding approximation, let us consider its application to 
two prominent realizations of carbon-based lattice systems, graphene and carbon nanotubes. 

INFO Graphene is a single layer of graphite: a planar hexagonal lattice of sp 2 -hybridized 7 
carbon atoms connected by strong covalent bonds of their three planar cr-orbitals. (See the 
Fig. 2.2 and figure overleaf for a schematic.) The remaining p 2 orbitals - oriented vertically to 
the lattice plane - overlap weakly to form a band of mobile electrons. For a long time, it was 
thought that graphene sheets in isolation will inevitably be destabilized by thermal fluctuations; 
only layered stacks of graphene would mutually stabilize to form a stable compound - graphite. 
It thus came as a surprise when in 2004 a team of researchers 8 succeeded in the isolation of 
large (micron-sized) graphene flakes on an SiCU substrate. (Meanwhile, the isolation of even free 
standing graphene layers has become possible. In fact, our whole conception of the stability of 
the compound has changed. It is now believed that whenever you draw a pencil line, a trail of 
graphene flakes will be left behind.) 

Immediately after its discovery, it became clear that graphene possesses unconventional con- 
duction properties. Nominally a gapless semiconductor, it has an electron mobility ~ 2x 
10 5 cm 2 /Vs, by far more than that of even the purest silicon based semiconductors; it shows man- 
ifestations of the integer quantum Hall effect qualitatively different from those of conventional 
two-dimensional electron compounds (cf. Chapter 9 for a general discussion of the quantum Hall 


Although this will not be necessary to follow the text, readers may find it rewarding to recapitulate the quantum 
chemistry of sp -hybridized carbon and its covalent bonds. 

K. S. Novoselov, et al ., Electric field effect in atomically thin carbon films, Science 306 (2004), 666-9. 
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Figure 2.2 Left: optical microscopy image of graphene flakes. Regions labeled by I’ define mono- 
layer graphene sheets. Right: STM image of the graphene samples shown in the left part. Images 
taken from E. Stolyarova et al., High-resolution scanning tunneling microscopy imaging of meso- 
scopic graphene sheets on an insulating surface, PNAS 104 (2007), 9210-12. 


effect), etc. Although an in-depth discussion of graphene is beyond the scope of this text, we note 
that it owes most of its fascinating properties to its band structure: electrons in graphene show 
a linear dispersion and behave like two-dimensional Dirac fermions! By way of an illustration of 
the concepts discussed above, we here derive this unconventional band dispersion from a tight 
binding formulation of the system. 


To a first approximation, graphene’s 7r-electron 
system can be modeled as a tight-binding Hamil- 
tonian characterized by a single hopping matrix 
element between neighboring atoms —t, and the 
energy offset e of the p-electron states. To deter- 
mine the spectrum of the system, one may intro- 
duce a system of bi-atomic unit cells (see the 
ovals in the schematic figure right) and two (non- 
orthogonal) unit vectors of the hexagonal lattice 
a! = (\/3, l)o/2 and a 2 = (y/3,—l)a/2, where 
a = [ail = |a 2 | ~ 2.46 A denotes the lattice con- 
stant. The tight-binding Hamiltonian can then 
be represented in the form H = — r ^(iaj(r)a 2 (r') + h.c.), where the sum runs over all 

nearest neighbor basis vectors of the unit cells (the coordinate vectors of the bottom left 
atom) and aj^(r) crea t ,es a state in the first (second) atom of the cell at vector r. Switching 
to a Fourier representation, the Hamiltonian assumes the form 



H = X) ( 4 * a 2° 

k a 


( 0 

-tf( k) \ 

( a i- ^ 

V -*/*( k ) 

o ) 

V az* J 


(2.25) 


where /( k) = 1 + e~ ikia + •<»<(- *»■+&)«. 
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Figure 2.3 (a) Spectrum of the tight-binding Hamiltonian (2.24) showing the point-like structure 
of the Fermi surface when Ep = 0. (b) A contour plot of the same. 

EXERCISE (Recall the concept of the reciprocal lattice in solid state theory.) To derive the 
Fourier representation above, show that a system of two reciprocal lattice vectors conjugate to the 
unit vectors above is given by G 1/2 = -^(1, ±\/3). Next, show that the Fourier decomposition 
of a field operator reads as a a (r) = ~^= ( fciGi + fc 2 G 2 ) ' r aa k; w here ki € [0, 2tt/o\ is 

quantized in units 27r/Lj. (L; is the extension of the system in the direction of a; and N its 
total number of unit cells.) Substitute this decomposition into the real space representation of 
the Hamiltonian to arrive at the Fourier representation. 

Diagonalizing the Hamiltonian, one obtains the dispersion relation, 9 

€k — ±f|/(k)| = ±t\/3 + 2cos(/cia) + 2cos((fci — £ 2 ) 0 ) + 2cos(fc2<z). (2.26) 

Here, in contrast to the square lattice tight-binding Hamiltonian, the half-filled system is 
characterized by a point-like Fermi surface (see Figure 2.3). When lightly doped away from 
half-filling, the spectrum divides into Dirac-like spectra with a linear dispersion. Fig. 2.3 
shows the full spectrum of the tight binding Hamiltonian. Notice that of the six Dirac 
points displayed in the figure only two are independent. The complementary four can be 
reached from those two points by addition of a reciprocal lattice vector and, therefore, do 
not represent independent states. 

EXERCISE Derive an explicit representation of the Dirac-Hamiltonian describing the low- 
energy physics of the system. To this end, choose two inequivalent (not connected by reciprocal 
lattice vectors) zero-energy points k 12 in the Brillouin zone. Expand the Hamiltonian (2.25) 
around these two points in small momentum deviations q = k — kj .,2 up to linear order. Show 
that in this approximation, H reduces to the sum of two two-dimensional Dirac Hamiltonians. 

Now, against this background, let us consider the carbon nanotube system. A single-wall 
nanotube describes a one-dimensional structure involving a graphene sheet rolled into a 
cylinder (see Fig 2.4 for an STM image of a carbon nanotube). Tubes of comparatively 
simple structure are obtained by rolling the hexagonal pattern of the sheet along one of its 
two axes of symmetry: along the ai-direction one obtains a zig-zag tube and along the 


P. R. Wallace, The band theory of graphite, Phys. Rev. 71 (1947), 622-34. 


58 


Second quantization 



Figure 2.4 STM image of a carbon nanotube contacted with Pt electrodes. (Source: Courtesy of 
C. Dekker.) 

(ai + a 2 )-direction an armchair tube (Exercise: Where do you see an armchair?). More 
complex, “chiral” structures involve twists along the circumference of the tube. 

By knowing the band structure of graphene, 
the dispersion of the nanotube system can 
be straightforwardly inferred. The situation is 
most easily visualized for the achiral geome- 
tries. Essentially a graphene sheet with peri- 
odic boundary conditions and length L in 
the direction transverse to the length of the 
tube, the quasi-one-dimensional system has 
a spectrum that obtains by projection of the 
two-dimensional dispersion onto lines indexed by the discrete values k± along the compact 
axis. For example, the dispersion of lowest transverse harmonic, k± = 0, of the armchair 
tube is given by (see the figure) 

ffc |( = ±t^3 + 4cos(fc||a/2) + 2cos(/c||a). 

Notice the presence of two nodal points in the spectrum. Electrons with longitudinal momen- 
tum close to one of these two “hot spots” propagate with approximately linear dispersion. 
The physics of effectively one- dimensional electron systems of this type will be discussed in 
Section 2.2 below. 

EXERCISE Verify the one-dimensional dispersion relation ejfe|| above. To this end, notice that 
the single-electron wave functions of the system must obey periodic boundary conditions tp(r + 
JV(ai +a 2 )) = V'(r) , where N is the number of cells in the transverse direction (i.e. L± = JVjai + 
a 2 | = Na\/ 3 is the circumference of the tube). Use this condition to obtain the quantization rule 
k\ + /c 2 = 27 t\/3 m/L±, where m is integer. When evaluated for the lowest harmonic, m = 0, 
the dispersion relation (2.26) collapses to (where fey = fci — fc 2 is the momentum in the 
longitudinal direction). Show that the zig-zag tube supports zero-energy excitations only for 
specific values of the transverse lattice spacings N. In general, it is insulating with a band gap 
of ca. 0.5 eV. In the case of the chiral tubes, it may be shown that a third of the tubes are 
semi-metallic while all others are insulating. 10 



For futher details on the electronic structure of carbon nanotubes, we refer to the text Physical Properties of 
Carbon Nanotubes by R. Saito, G. Dresselhaus, and M. S. Dresselhaus (Imperial College Press, 1998). 
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Interaction effects in the tight-binding system 

Although the pseudopotential of the nearly free electron system accommodates the effects 
of Coulomb interaction between the conduction and valence band electrons, the mutual 
Coulomb interaction between the conduction electrons themselves may lead to new physical 
phenomena. These effects can substantially alter the material parameters (e.g. effective 
masses), however, they neither change the nature of the ground state, nor that of the 
elementary quasi-particle excitations in any fundamental way - this is the content of Fermi- 
liquid theory, and a matter to which we will return. By contrast even weak interaction 
effects significantly influence the physics of the tight-binding system. Here, commensurability 
effects combined with interaction may drive the system towards a correlated magnetic state 
or an insulating phase. 

To understand why, let us re-express the interaction in field operators associated with 
the Wannier states. Once again, to keep our discussion simple (yet generic in scope), let 
us focus attention on a single sub-band and drop reference to the band index. Then, 
applied to the Coulomb interaction, the transformation (2.23) leads to the expansion V ee = 
'Eii'jf where 

Wii'if = ^J ddr J ddr ' , V , R i ( r )V’R J (j')V(r- rOV’R,, (r')V’R.., (r'). (2.27) 

Taken together, the combination of the contributions Hq and V ee , 


H = '^a\ ( 7 tii>ai > <7 + Uwjj'al^al/^aj'a'aja, 

W ii'jj' 


(2.28) 


where the sum of repeated spin indices is implied, defines the tight-binding representa- 
tion of the interaction Hamiltonian. Apart from the neglect of the neighboring sub-bands, 
the Hamiltonian is exact. Yet, to assimilate the effects of the interaction, it is useful to 
assess the relative importance of the different matrix elements drawing on the nature of the 
atomic limit that justified the tight-binding description. We will thus focus on contributions 
to Uu'jj' where the indices are either equal or, at most, nearest neighbors. Focusing on the 
most relevant of these matrix elements, a number of physically different contributions can 
be identified: 

t> The direct terms Uu'w = Vn' involve integrals over square moduli of Wannier func- 
tions and couple density ’’uctuations at neighboring sites, Va>hihi>, where n* = 

So- a i<T a iv This contribution accounts for the - essentially classical - interaction between 
charges localized at neighboring sites (see Fig. 2.5). In certain materials, interactions of 
this type have the capacity to induce global instabilities in the charge distribution known 
as charge density wave instabilities. 

> A second important contribution derives from the exchange coupling, which induces 
magnetic correlations among the electron spins. Setting J?- = Uijji, and making use of 
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(c) 


Figure 2.5 Different types of interaction mechanism induced by the tight-binding interaction V ee - 
The curves symbolically indicate wavefunction envelopes, (a) Direct Coulomb interaction between 
neighboring sites. Taking account of the exchange interaction, parallel alignment of spins (b) is 
preferred since it enforces anti-symmetry of the spatial wave function. By contrast, for anti-parallel 
spin configurations (c) the wave function amplitude in the repulsion zone is enhanced, (d) Coulomb 
interaction between electrons of opposite spin populating the same site. 



Pauli matrix identities (see below), one obtains 

X] U ini a L a j<7' a ^ ,a 3^ = ~ 2 Y1 ‘ + \fkiflA . 

i*3 V 1 

Such contributions tend to induce weak ferromagnetic coupling of neighboring spins 
(i.e. J F > 0). The fact that an effective magnetic coupling is born out of the electro- 
static interaction between quantum particles is easily understood. Consider two electrons 
inhabiting neighboring sites. The Coulomb repulsion between the particles is minimized 
if the orbital two-particle wave function is anti-symmetric and, therefore, has low ampli- 
tude in the interaction zone between the particles. Since the overall wavefunction must 
be anti-symmetric, the energetically favored real-space configuration enforces a symmet- 
ric alignment of the two spins. Such a mechanism is familiar from atomic physics where 
it is manifested as Hund’s rule. In general, magnetic interactions in solids are usually 
generated as an indirect manifestation of the much stronger Coulomb interaction. 

EXERCISE Making use of the Pauli matrix identity o a /3 • a-,g = 28 a g5p- 1 — SapS^g, show that 
Sj • S j = — aj a at^ai^a ja /2 — hifij/A where, as usual, S, = a\ a cr a paip/2 denotes the operator 
for spin 1/2, and the lattice sites i and j are assumed distinct. 

> Finally, far into the atomic limit, where the atoms are very well separated, and the overlap 
between neighboring orbitals is weak, the matrix elements iij and J, F are exponentially 
small in the interatomic separation. In this limit, the “on-site” Coulomb or Hubbard 
interaction, Una = U/2, '^2d cra iUima\ !T a\ !T , 0 ,^ 10 ,^ = JT Un^n^, generates the domi- 
nant interaction mechanism. Taking only the nearest neighbor contribution to the hopping 
matrix elements, and neglecting the energy offset due to the diagonal term, the effective 
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Hamiltonian takes a simplified form known as the Hubbard model, 


H = —t O-lcrtlja + U fiiffiij., 

<y> * 


(2.29) 


where (ij ) is a shorthand used to denote neighboring lattice sites. In hindsight, a model 
of this structure could have been proposed from the outset on purely phenomenological 
grounds: electrons tunnel between atomic orbitals localized on individual lattice sites 
while double occupancy of a lattice site incurs an energetic penalty associated with the 
mutual Coulomb interaction. 


Mott-Hubbard transition and the magnetic state 

Deceptive in its simplicity, the Hubbard model is acknowledged as a paradigm of strong 
electron correlation in condensed matter. Yet, after 40 years of intense investigation, the 
properties of this seemingly simple model system - the character of the ground state and 
nature of the quasi-particle excitations - are still the subject of controversy (at least in 
dimensions higher than one - see below). Nevertheless, given the importance attached to this 
system, we will close this section with a brief discussion of the remarkable phenomenology 
that is believed to characterize the Hubbard system. 

As well as dimensionality, the phase behavior of the Hubbard Hamiltonian is character- 
ized by three dimensionless parameters: the ratio of the Coulomb interaction scale to the 
bandwidth U/t, the particle density or filling fraction n (i.e. the average number of electrons 
per site), and the (dimensionless) temperature, T/t. The symmetry of the Hamiltonian 
under particle-hole interchange (exercise) allows one to limit consideration to densities in 
the range 0 < n < 1 while densities 1 < n < 2 can be inferred by “reflection.” 

Focusing first on the low tem- 
perature system, in the dilute limit 
n -C 1, the typical electron wave- 
length is greatly in excess of the 
site separation and the dynamics 
are free. Here the local interaction 
presents only a weak perturbation 
and one can expect the properties of 
the Hubbard system to mirror those 
of the weakly interacting nearly free 
electron system. While the inter- 
action remains weak one expects a 
metallic behavior to persist. 

By contrast, let us consider the half-filled system where the average site occupancy is 
unity. Here, if the interaction is weak, U/t <C 1, one may again expect properties reminiscent 


Sir Neville Francis Mott 1905- 
96 

Nobel Laureate in Physics in 
1977, with Philip W. Anderson 
and John H. van Vleck, for their 
“fundamental theoretical investi- 
gations of the electronic structure 
of magnetic and disordered sys- 
tems." Amongst his contributions to science, Mott 
provided a theoretical basis to understand the transi- 
tion of materials from metallic to nonmetallic states 
(the Mott transition). (Image © The Nobel Foun- 
dation.) 
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of the weakly interacting electron system. 11 If, on the other hand, the interaction is very 
strong, U/t » 1, site double occupancy is inhibited and electrons in the half- filled system 
become “jammed”: migration of an electron to a neighboring lattice site necessitates site 
double occupancy incurring an energy cost U. Here, in this strongly correlated state, the 
mutual Coulomb interaction between the electrons drives the system from a metallic to an 
insulating phase with properties very different from those of a conventional band insulator. 

INFO Despite the ubiquity of the experimental phenomenon (first predicted in a celebrated 
work by Mott) the nature of the Mott— Hubbard transition from the metallic to the insulat- 
ing phase in the half-filled system has been the subject of considerable discussion and debate. 
In the original formulation, following a suggestion of Peierls, Mott conceived of an insulator 
characterized by two “Hubbard bands” with a bandwidth ~ t separated by a charge gap U. 12 
States of the upper band engage site double occupancy while those states that make up the lower 
band do not. The transition between the metallic and the insulating phase was predicted to 
occur when the interaction was sufficiently strong that a charge gap develops between the bands. 
Later, starting from the weakly interacting Fermi-liquid, Brinkman and Rice 13 proposed that 
the transition was associated with the localization of quasi-particles created by an interaction 
driven renormalization of the effective mass. Finally, a third school considers the transition to 
the Mott insulating phase as inexorably linked to the development of magnetic correlations in 
the weak coupling system - the Slater instability. 

In fact, the characterizations of the transition above are not mutually exclusive. Indeed, in the 
experimental system, one finds that all three possibilities are, in a sense, realized. In particular, 
a transition between the Mott insulating phase and an itinerant electron phase can be realized 
in two ways. In the first case, one can reduce the interaction strength U/t while, in the second, 
one can introduce charge carriers (or vacancies) into the half-filled system. Experimentally, the 
characteristic strength of the interaction is usually tuned by changing the bandwidth t through 
external pressure (see Fig. 2.6), while a system may be doped away from half-filling by chemical 
substitution. Remarkably, by focusing on the scaling behavior close to the critical end-point, 
researchers have been able to show that the Mott transition in this system belongs to the uni- 
versality class of the three-dimensional Ising model (see the discussion of critical phenomena in 
Chapter 8). Lowering the temperature, both the Mott insulating phase and the strongly corre- 
lated metallic phase exhibit a transition to a magnetic phase where the local moments order 
antiferromagnetically (for the explanation of this phenomenon, see below). 

Experimentally, it is often found that the low-temperature phase of the Mott insulator is 
accompanied by the anti-ferromagnetic ordering of the local moments. The origin of 
these magnetic correlations can be traced to a mechanism known as superexchange and 
can be understood straightforwardly within the framework of the Hubbard model system. 


1 In fact, one has to exercise some caution since the commensurability of the Fermi wavelength with the lattice 
can initiate a transition to an insulating spin density wave state characterized by a small quasi-particle energy 
gap. In later chapters, we will discuss the nature of this Slater instability (J. C. Slater, Magnetic effects and 
the Hartree— Fock equation, Phys. Rev. 82 (1951), 538—41) within the framework of the quantum field integral. 

12 N. F. Mott, The basis of the electron theory of metals with special reference to the transition metals, Proc. 
Roy. Soc. A 62 (1949), 416-22 - for a review see, e.g., N. F. Mott, Metal-insulator transition, Rev. Mod. Phys. 
40 (1968), 677-83, or N. F. Mott, Metal-Insulator Transitions , 2nd ed. (Taylor and Francis, 1990). 

13 W. Brinkman and T. M. Rice, Application of Gutzwiller’s variational method to the metal-insulator transition, 
Phys. Rev. B 2 (1970), 4302-4. 
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Figure 2.6 Conductivity of Cr-doped V 2 O 3 as a function of decreasing pressure and temperature. 
At temperatures below the Mott-Hubbard transition point (Pc, = 3738 bar, T c = 457.5 K) the 
conductivity reveals hysteretic behavior characteristic of a first-order transition. (Reprinted from 
P. Limelette, A. Georges, D. Jerome, et al., Universality and critical behavior at the Mott transition, 
Science 302 (2003), 89-92. Copyright 2003 AAAS.) 


To this end, one may consider a simple “two-site” system from which the characteristics 
of the infinite lattice system can be inferred. For the two-site system, at half-filling (i.e. 
with just two electrons to share between the two sites), one can identify a total of six 
basis states: two spin-polarized states an d four states with 5* otal = 0: 

|si) = |s 2 ) = a^a^lfl), |di) = a^a^lO), and | cfe) = a^a^lfi). Recalling 

the constraints imposed by the Pauli principle, it is evident that the fully spin polarized 
states are eigenstates of the Hubbard Hamiltonian with zero energy, while the remaining 
eigenstates involve superpositions of the basis states |sj) and \di). In the strong coupling 
limit U/t ($> 1, the ground state will be composed predominantly of states with no double 
occupancy, |sj). To determine the precise structure of the ground state, we could simply 
diagonalize the 4x4 Hamiltonian exactly - a procedure evidently infeasible in the large 
lattice system. Instead, to gain some intuition for the extended system, we will effect a 
perturbation theory which projects the insulating system onto a low-energy effective spin 
Hamiltonian. Specifically, we will treat the hopping part of the Hamiltonian H t as a weak 
perturbation of the Hubbard interaction Hu- 

To implement the perturbation theory, it is helpful to invoke a canonical transformation 
of the Hamiltonian, namely 


H^H' = e~ t6 He t6 = e~^ 6 ’ ] H = H - t[6, H] + ^ [6, [6, H}\ + ■ ■ ■ , 


(2.30) 
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where the exponentiated commutator is defined by the series expansion on the right. 
EXERCISE Prove the second equality. (Hint: Consider the derivative of H' with respect to t.) 

By choosing the operator O such that H t + t[Hjj, O] = 0, all terms at first order in t can 
be eliminated from the transformed Hamiltonian. As a result, the effective Hamiltonian is 
brought to the form 

H' =H u + t -\H t ,0]+0(t 3 ). (2.31) 

Applying the ansatz, tO = [P s Ht Hi —P<\HtP s \/U, where P s and P& are operators that project 
onto the singly and doubly occupied subspaces respectively, the first-order cancellation is 
assured. 

EXERCISE To verify this statement, take the matrix elements of the first-order equation with 
the basis states. Alternatively, it can be confirmed by inspection, noting that P S P#“T, Hu P s 
( i and, in the present case, P s H t Ps = PdH t Pd =0 

Substituting this result into Eq. (2.31) and projecting onto the singly occupied subspace 
one obtains 

P S H'P S = -i P s H t P d H t P s = -2^P S (l + «L4 CT ,ai CT m 2CT ) P s = J (s t • S 2 - ^ , 

where J = 4 1 2 /U denotes the strength of the anti-ferromagnetic exchange interaction 
that couples the spins on neighboring sites. 

EXERCISE Remembering the anti-commutation relations of the electron operators, find the 
matrix elements of the Hubbard Hamiltonian on the four basis states |sj) and |di). Diagonalizing 
the 4x4 matrix Hamiltonian, obtain the eigenstates of the system. In the strong coupling system 
U/t 1, determine the spin and energy dependence of the ground state. 

The perturbation theory above shows 
that electrons subject to a strong 
local repulsive Coulomb interac- 
tion have a tendency to adopt an 
antiparallel or antiferromagnetic spin 
configuration between neighboring 
sites. This observation has a simple 
physical interpretation. Anti-parallel 
spins can take advantage of the 
hybridization (however small) and 
reduce their kinetic energy by hop- 
ping to a neighboring site (see Fig. 2.7). Parallel spins on the other hand are restricted from 
participating in this virtual process by the Pauli principle. This mechanism, which involves 
a two-step process, was first formulated by Anderson 14 and is known as superexchange. 


Philip W. Anderson 1923- 

Nobel Laureate in Physics in 
1977, with Sir Neville Mott and 
John H. van Vleck, for their 
“fundamental theoretical investi- 
gations of the electronic structure 
of magnetic and disordered sys- 
tems." Anderson made numerous contributions to 
theoretical physics from theories of localization and 
antiferromagnetism to superconductivity. (Image © 
The Nobel Foundation.) 



P. W. Anderson, Antiferromagnetism. Theory of superexchange interaction, Phys. Rev. 79 (1950) 350-6. 
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Figure 2.7 Top: hybridization of spin polarized states is forbidden by Pauli exclusion. Bottom: 
superexchange mechanism by which two antiparallel spins can lower their energy by a virtual 
process in which the upper Hubbard band is occupied. 


The calculation presented above is easily generalized to an extended lattice system. Once 
again, projecting onto a basis in which all sites are singly occupied, virtual exchange pro- 
cesses favor an antiferromagnetic arrangement of neighboring spins. Such a correlated mag- 
netic insulator is described by the quantum spin-(l/2) Heisenberg Hamiltonian 


H = J Sm • S„, 

(mn) 


(2.32) 


where, as usual, (mn) denotes a sum of neighboring spins on the lattice and the positive 
exchange constant J ~ t 2 /U . While, in the insulating magnetic phase, the charge degrees 
of freedom remain “quenched,” spin fluctuations can freely propagate. 

When doped away from half-filling, the behavior of the Hubbard model is notoriously 
difficult to resolve. The removal of electrons from the half-filled system introduces vacancies 
into the “lower Hubbard band” that may propagate through the lattice. For a low con- 
centration of holes, the strong coupling Hubbard system may be described by the effective 
t-J Hamiltonian, 


H t ^j = -t£ P s al 

, a a na P s + J Sm • S„. 

(mn) 

(mn) 


However, the passage of vacancies is frustrated by the antiferromagnetic spin correlations 
of the background. Here transport depends sensitively on the competition between the 
exchange energy of the spins and the kinetic energy of the holes. Oddly, at J = 0 (i.e. 
U = oo), the ground state spin configuration is known to be driven ferromagnetic by a 
single hole while, for J > 0, it is generally accepted that a critical concentration of holes is 
required to destabilize antiferromagnetic order. 

EXERCISE At U to oo, all 2 N states of the half-filled Hubbard model are degenerate - each 
site is occupied by a single electron of arbitrary spin. This degeneracy is lifted by the removal 
of a single electron from the lower Hubbard band. In such a case, there is a theorem due to 
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Nagaoka 15 that, on a bipartite lattice (i.e. one in which the neighbors of one sublattice belong to 
the other sublattice), the ground state is ferromagnetic. For a four-site “plaquette” with three 
electrons determine the eigenspectrum of the Hubbard system with U = oo within the manifold 
(a) Stota! = 3/2, and (b) £t otal = 1/2. In each case, determine the total spin of the ground state. 
(Hint: In (b) there are a total of 12 basis states - here it useful to arrange these states in the 
order in which they are generated by application of the Hamiltonian.) 


I N FO The rich behavior of the Mott-Hubbard system is nowhere more exemplified than in the 
ceramic cuprate system - the basic material class of the high-temperature superconductors. 
Cuprates are built of layers of Cu02 separated by heavy rare earth ions such as lanthanum. Here, 
the copper ions adopt a square lattice configuration separated by oxygen ions. At half-filling, 
electrons in the outermost occupied shell of the copper sites in the plane adopt a partially filled 
3d 9 configuration, while the oxygen sites are completely filled. Elevated in energy by a frozen 
lattice distortion, the Fermi energy lies in the d x 2_ y 2 orbital of copper. According to a simple 
band picture, the single band is exactly half-filled (one electron per Cu site) and, therefore, 
according to the standard band picture, should be metallic. However, strong electron interaction 
drives the cuprate system into an insulating antiferromagnetic Mott-Hubbard phase. 

When doped away from half-filling (by, for example, replacing the rare earth atoms by others 
with a different stoichiometry; see the figure, where the phase diagram of La 2 - a: Sr a; Cu 04 is shown 
as a function of the concentration x of Sr atoms replacing La atoms and temperature), charge 
carriers are introduced into the “lower Hubbard band.” However, in this case, the collapse of the 
Hubbard gap and loss of antiferromagnetic (AF) order is accompanied by the development of a 
high-temperature unconventional superconducting (SC) phase whose mechanism is believed to be 
rooted in the exchange of antiferromagnetic spin fluctuations. Whether the rich phenomenology 
of the cuprate system is captured by the Hubbard model remains a subject of great interest and 
speculation. 


T 



This concludes our preliminary survey of the rich 
phenomenology of the interacting electron system. 
Notice that, so far, we have merely discussed ways 
to distill a reduced model from the original micro- 
scopic many-body Hamiltonian (2.17). However, save 
for the two examples of free field theories analyzed 
in Chapter 1, we have not yet learned how meth- 
ods of second quantization can be applied to actually 
solve problems. To this end, in the following section 
we will illustrate the application of the method on a 
prominent strongly interacting problem. 


Y. Nagaoka, Ferromagnetism in a narrow, almost half-filled s-band, Phys. Rev. 147 (1966), 392—405. 
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Within the context of many-body physics, a theory is termed free if the Hamiltonian is 
bilinear in creation and annihilation operators, i.e. H ~ where H may be 

a finite- or infinite-dimensional matrix. 16 Such models are “solvable” in the sense that the 
solution of the problem simply amounts to a diagonalization of the matrix H llv (subject to 
the preservation of the commutation relations of the operators a and td). However, only a 
few models of interest belong to this category. In general, interaction contributions typically 
quartic in the field operators are present and complete analytical solutions are out of reach. 

Yet there are a few precious examples of genuinely interacting systems that are amenable 
to (nearly) exact solution. In this section we will address an important representative of 
this class, namely the one-dimensional interacting electron gas. Not only is the analysis of 
this system physically interesting but, in addition, it provides an opportunity to practice 
working with the second quantized operator formalism on a deeper level. 

Qualitative discussion 

Consider the nearly free electron Hamiltonian (2.18) and (2.19) reduced to a one-dimensional 
environment. Absorbing the chemical potential E-p into the definition of the Hamiltonian, 
and neglecting spin degrees of freedom (e.g. one might consider a fully spin polarized band), 

^ = ^ a k(^m~ EF ) ak + ^L ^ V (ti a l-A '+ q a k' a k- (2.33) 

k ' 2 kk',qjtO 

INFO At first sight, the treatment of a one-dimensional electron system may seem an aca- 
demic exercise. However, effective one-dimensional interacting fermion systems are realized in 
a surprisingly rich spectrum of materials. We have already met with carbon nanotubes 
above. A nanotube is surrounded by clouds of mobile electrons (see earlier discussion in sec- 
tion 2.2). With the latter, confinement of the circumferential direction divides the system into 
a series of one-dimensional bands, each classified by a sub-band index and a wavenumber k. 
At low temperatures, the Fermi surface typically intersects a single sub-band, allowing atten- 
tion to be drawn to a strictly one-dimensional system. A similar mechanism renders certain 
organic molecules (such as the Bechgaard salt (TMTSF^PFe, where TMTSF stands for the 
tetramethyl-tetraselenafulvalene) one-dimensional conductors. 

A third, solid state, realization is presented by artificial low-dimensional structures fab- 
ricated from semiconducting devices. Redistribution of electron charge at the interface of a 
GaAs/AlGaAs heterostructure results in the formation of a two-dimensional electron gas. 
By applying external gates, it is possible to fabricate quasi-one-dimensional semiconductor 
quantum wires in which electron motion in the transverse direction is impeded by a large 
potential gradient (Fig. 2.8 (a)). At sufficiently low Fermi energies, only the lowest eigenstate 
of the transverse Schrodinger equation (the lowest “quantum mode”) is populated and one is 
left with a strictly one-dimensional electron system. There are other realizations, such as edge 
modes in quantum Hall systems, “stripe phases” in high-tempterature superconductors, or 
certain inorganic crystals, but we shall not discuss these here explicitly. 


More generally, a free Hamiltonian may also contain contributions 


and . 
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(b) 

Figure 2.8 Different realizations of one-dimensional electron systems, (a) Steep potential well (real- 
izable in, e.g., gated two-dimensional electron systems), (b) (Approximately) cylindrical quantum 
system (carbon nanotubes, quasi-one-dimensional molecules, etc.). In both cases, the single-particle 
spectrum is subject to mechanisms of size quantization. This leads to the formation of “minibands” 
(indicated by shaded areas in the figure), structureless in the transverse direction and extended in 
the longitudinal direction. 

The one-dimensional fermion system exhibits a number of features not shared by higher- 
dimensional systems. The origin of these peculiarities can be easily understood from a simple 
qualitative picture. Consider an array of interacting fermions confined to a line. To optimize 
their energy the electrons can merely “push” each other around, thereby creating density 
fluctuations. By contrast, in higher-dimensional systems, electrons are free to avoid contact 
by moving around each other. A slightly different formulation of the same picture can be 
given in momentum space. The Fermi “sphere” of the one-dimensional system is defined 
through the interval [— A;f,&f] of filled momentum states. The Fermi “surface” consists 
of two isolated points, {/cp,— &f} (see the figure below). By contrast, higher-dimensional 
systems typically exhibit continuous and simply connected Fermi surfaces. It takes little 
imagination to anticipate that an extended Fermi sphere provides more phase space to 
two-particle interaction processes than the two isolated Fermi energy sectors of the one- 
dimensional system. The one-dimensional electron system represents a rare exception of an 
interacting system that can be solved under no more than a few, physically weak, simplifying 
assumptions. This makes it a precious test system on which non-perturbative quantum 
manifestations of many-body interactions can be explored. 

Quantitative analysis 

We now proceed to develop a quantitative picture of the charge density excitations of 
the one-dimensional electron system. Anticipating that, at low temperatures, the relevant 
dynamics will take place in the vicinity of the two Fermi points {/cp, — kp } , the Hamiltonian 
(2.33) can be reduced further to an effective model describing the propagation of left and 
right moving excitations. To this end, we first introduce the notation that the subscripts 
R/L indicate that an operator aj + /_) feF+g creates an electron that moves to the right/left 
with velocity ~ Vf = kp /to. 

We next observe (see the figure below) that, in the immediate vicinity of the Fermi points, 
the dispersion relation is approximately linear, implying that the non-interacting part of 
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the Hamiltonian can be represented as (exercise) 

#o - X! '^2 a tg a s v Fqa sq , 


(2.34) 


where a s = (+/— ) for s = R/L and the summation over q is restricted by some momentum 

cut-off \q\ < r beyond which the linearization of 
the dispersion is invalid. (Throughout this section, 
all momentum summations will be subject to this 
constraint.) Turning to the interacting part of the 
Hamiltonian, let us first define the operator 



Psq — y^, a ik+g a sk- 


(2.35) 


just motivated by notational convenience. It is 
straightforward to verify (exercise) that p s (q) is obtained from the Fourier transform of the 
local density operator p(x). In other words, p sg measures density fluctuations of character- 
istic wavelength q~ l supported by electron excitations with characteristic momentum ±/cf 
( see Fig. 2.9 (a)). From our heuristic argument above, suggesting charge density modula- 
tions to be the basic excitations of the system, we expect the operators p sg to represent the 
central degrees of freedom of the theory. 

Represented in terms of the density operators, the interaction contribution to the Hamil- 
tonian may be recast as 


^ee = 4-q4'+9 afc ' afc 

kk'q 


fc = 2L \.9±PsqPs-q + 92psqPs- 


(2.36) 


where s = L/R denotes the complement of s = R/L, and the constants and 34 measure 
the strength of the interaction in the vicinity of the Fermi points, i.e. where q ~ 0 and 
q — 2k f- (With the notation < 72,4 we follow a common convention in the literature.) 


EXERCISE Explore the relation between the coupling constants 52 , 34 and the Fourier transform 
of Vee- Show that to the summation J2kk’q’ n °t on ^ terms with (k,k',q) ~ (±&f, ±/cf, 0), but 
also terms with (fc, k! , q) ~ (±&f, Tfe, 2/cf) contribute. When adequately ordered (do it!), these 
contributions can be arranged into the form of the right-hand side of Eq. (2.36). (For a detailed 
discussion see, e.g., T. Giamarchi, Quantum Physics in One Dimension (Oxford University Press, 
2004) or G. Mahan, Many Particle Physics (Plenum Press, 1981)). At any rate, the only point 
that matters for our present discussion is that the interaction can be represented through density 
operators with positive constants 32,4 determined by the interaction strength. 


INFO Working with second quantized theories, one frequently needs to compute commutators of 
operators A(a, of ) polynomial in the elementary boson/fermion operators of the theory (e.g. A = 
aa\ A = aaafaf, etc. where we have omitted the quantum number subscripts generally carried 
by a and (A). Such types of operation are made easier by a number of operations of elementary 
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Figure 2.9 Two different interpretations of the excitations created by the density operators p sq . 
(a) Real space; p sq creates density modulations of characteristic wavelength g -1 and characteristic 
velocity uf- (b) Momentum space; application of p sq to the ground state excites electrons from 
states k to k + q. This creates particle-hole excitations of energy ek+ q — tk = vf q independent 
of the particle/hole momentum k. Both particles and holes forming the excitation travel with the 
same velocity vf, implying that the excitation does not disperse (i.e. decay). 


commutator algebra. The most basic identity, from which all sorts of other formulae can be 
generated recursively, is the following: 


[A, BC]± = [A, B\±C =F B[A, 6]-. 


(2.37) 


Iteration of this equation for boson operators a, a * shows that 


[at ,a n \ = —net 1 


(2.38) 


(Due to the fact that a 2 = 0 in the fermionic case, there is no fermion analog of this equation.) 
Taylor expansion then shows that, for any analytic function F(a), [ad, F(a)] = —F'(a). Similarly, 
another useful formula which follows from the above is the relation at F(aat) = F(ata)at, which 
is also verified by series expansion. 


So far, we have merely rewritten parts of the Hamiltonian in terms of density operators. 
Ultimately, however, we wish to arrive at a representation whereby these operators, instead 
of the original electron operators, represent the fundamental degrees of freedom of the 
theory. Since the definition of the operators p involves the squares of two Fermi operators, we 
expect the density operators to resemble bosonic excitations. Thus, as a first and essential 
step towards the construction of the new picture, we explore the commutation relations 
between the operators p sq . 

From the definition (2.35) and the auxiliary identity (2.37) it is straightforward to verify 
the commutation relation \p sq ,p s <q'] = S ss > Y,k( a \k+q a sk-q' -a\ k+q+q ,a sk )- As it stands, this 
relation is certainly not of much practical use. To make further progress, we must resort to a 
(not very restrictive) approximation. Ultimately we will want to compute some observables 
involving quantum averages taken on the ground state of the theory, (f2| . . . |f2). To simplify 
the structure of the theory, we may thus replace the right-hand side of the relation by its 
ground state expectation value: 

\psq,Ps’q'] « Sss' ^(U|a^ +q a sfe _,/ - a\ k+q+q ,a sk |fi) = 6 ss >S q - q , ^(f2|(n sfc+<? - n sfe )|fi) , 
k k 

where, as usual, n s k = a^a s fc, and we have made use of the fact that (fl|a^ fc a s fc'|f2) = 5kk'- 
Although this is an uncontrolled approximation, it is expected to become better the closer we 
stay to the zero-temperature ground state | ST) of the theory (i.e. at low excitation energies). 
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EXERCISE Try to critically assess the validity of the approximation. (For a comprehensive 
discussion, see the text by Giamarchi. 17 ) 

At first glance, it would seem that the right-hand side of our simplified commutator relation 
actually vanishes. A simple shift of the summation index, ^ fc (f2|n s fc +9 |f2) = 5Z fc (fI|n s fc|f2) 
indicates that the two terms contributing to the sum cancel. However, this argument is 
certainly too naive. It ignores the fact that our summation is limited by a cut-off momentum 
r. Since the shift k — >• k — q changes the cut-off, the interpretation above is invalid. 

To obtain a more accurate result, let us consider the case s = R and q > 0. We know 
that, in the ground state, all states with momentum k < 0 are occupied while all states 
with k > 0 are empty. This implies that 


£(^l {nRk+q ~ n Rk )\Cl) = £ + £ + £ i^l(.f>'Rk+ q ~ 

k -r<k -q —q<k 0 0<k<T 

= £ (n\(h Rk+q -h Rk )\n) = -^, 

-q k 0 

where, with the last equality, we have used the fact that a momentum interval of size q 
contains q/{2'K/L) quantized momentum states. Similar reasoning for s = L shows that the 
effective form of the commutator relation reads 

[Psq,Ps'q'] = -S ss ’5 q ,- g 'a s ^. ( 2 . 39 ) 

Now, if it were not for the g-dependence of the right-hand side of this relation, we would 
indeed have found (approximate) bosonic commutation relations. Therefore, to make the 
connection to bosons explicit, let us define 

bq = Tiq P'Lqi = ^ 

\ ( 2 - 40 ) 

b ~q = n 9 p R (_ ? ), bLg^TlqpRq, j 


where q > 0 and n q = (2n / Lq) 1 / 2 . It is easily confirmed that the newly defined operators b q 
obey canonical commutation relations (exercise), i.e. we have indeed found that, apart from 
the scaling factors n q , the density excitations of the system behave as bosonic “particles.” 
Expressed in terms of the operators b, the interaction part of the Hamiltonian takes the 
form (exercise) 


V ee 


1 

27T 






( 9a 

92 \ 

( b l) 

V 92 

9a ) 

\b-q) 


Notice that we have succeeded in representing a genuine two-body interaction, a contribution 
that usually renders a model unsolvable, in terms of a quadratic representation. However, 


T. Giamarchi, Quantum Physics in One Dimension (Oxford University Press, 2004). 
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the free boson representation of the interaction term will be of little use until the kinetic 
part of the Hamiltonian Ho is represented in terms of the b operators. There are various 
ways of achieving this goal. The most straightforward route, a direct construction of a 
representation of Ho in terms of the Bose operators, is cumbersome in practice. However, 
there exists a more efficient way that is based on indirect reasoning. As follows from the 
discussion of Section 2.1, the properties of second quantized operators are fixed by their 
commutation relations. 18 So what we are going to do is search for an operator H' 0 (b,tf) 
that has the same commutation relations with the boson operators (b, as the original 
kinetic energy operator Ho(a,aJ). Using Eq. (2.34), the definition (2.35), and the auxiliary 
identity (2.37), it is straightforward to verify that [Ho,p sq \ = qvFcr s p sq . On the other hand, 
using Eq. (2.39) one finds that the same commutation relations hold with the operator 

H 0 — £ 2^/ PsqPs-q , 

qs 

i.e. [H'o, p S q] = qvpa s p sq ■ Following the logic of our argument we thus identify Ho = H' 0 (up 
to inessential constants) and substitute Hq for the non-interacting Hamiltonian. 

EXERCISE To gain some confidence in the identification Ho = Hq + const,., and to show that the 
undetermined constant actually equals zero, compute the energy expectation value of the state 
| sq ) = p sq | ) both as ( s q | Ho | sq ) and as ( sq |Ho| sq ). Confirm that the two expressions 
coincide. 


Finally, using Eq. (2.40) and adding the interaction contribution V ee we arrive at the effective 
Hamiltonian 


q> 0 


blq) 


f V F + B 

£ ) 

(H) 

V n 

**+! n J 

\b-qj 


(2.41) 


We have thus succeeded in mapping the full interacting problem onto a free bosonic theory. 
The mapping a — > p — > b is our first example of a technique known as bosonization. 
Such techniques play an important role in 2(= 1 space + 1 time)-dimensional field theory in 
general. More sophisticated bosonization schemes will be discussed in Sections 4.3 and 9.4.4. 
Conversely, it is sometimes useful to represent a boson problem in terms of fermions via 
fermionization. One may wonder why it is indeed possible to effortlessly represent the low- 
lying excitations of a gas of fermions in terms of bosons. Fermi— Bose transmutability 
is indeed a peculiarity of one-dimensional quantum systems. Particles confined to a line 
cannot pass “around” each other. That means that the whole issue of sign factors arising 
from the interchange of particle coordinates does not arise, and much of the exclusion-type 


18 This argument can be made quantitative by group theoretical reasoning: Eq. (2.4) and (2.7) define the irre- 
ducible representation of an operator algebra - an algebra because [ , ] defines a product in the space of 
generators {a*, a^}, a representation because the operators act in a vector space (namely Fock space J 7 ), which 
is irreducible because all states |Ai, . . . , Ajv) G J 7 can be reached by iterative application of operators onto a 
unique reference state (e.g. |Q)). Under these conditions, Schur’s lemma - to be discussed in more detail in 
Chapter 4 - states that two operators A± and A2 having identical commutation relations with all {oa, a^} are 
equal up to a constant. 
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characteristics of the Fermi system are inactivated. A more systematic formulation of Fermi 
■o- Bose transformations will be discussed in Chapter 4. 

Now, there is one last problem that needs to be overcome to actually solve the interacting 
problem. In Chapter 1, we learned how to interpret Hamiltonians of the structure tf q b q as 

superpositions of harmonic oscillators. However, in our present problem, terms of the type 
b q b- q and bl q b\ appear. To return to familiar terrain, we need to eliminate these terms. 
However, before doing so, it is instructive to discuss the physical meaning of the problem. 

Firstly, let us recall that the total number operator of a theory described by operators 
b\, b\ is given by N = b\b\. Now, if the Hamiltonian has the form H = J)) b^H^ v b v , 

the total number operator commutes with H, i.e. [ N , H] = 0 (exercise). This means that H 
and N can be simultaneously diagonalized, or, in more physical terms, that the Hamiltonian 
enjoys the feature of particle number conservation. More generally, any Hamiltonian in 
which operators appear as polynomials containing equal numbers of creation and annihila- 
tion operators (e.g. tftfbb,tftftfbbb, etc.) has this property. This is because any operator 
of this structure creates as many particles as it annihilates. In problems where the total 
number of particles is conserved (e.g. the theory of interacting electrons in an isolated piece 
of metal), the Hamiltonian is bound to have this structure. Conversely, in situations where 
the number of excitations is not fixed (e.g. a theory of photons or phonons) particle number 
violating terms like bb or can appear. Such a situation is realized in our present prob- 
lem; the number of density excitations in an electron system is certainly not a conserved 
quantity which explains why contributions like b q b- q appear in H. 

To eliminate the non-particle-number-conserving contributions we should, somehow, 
transform the matrix 


K = 


^ + t § 

£ VF + t 


to a diagonal structure. Transformations of K can be generated by transforming the opera- 
tors b q and to a different representation. Specifically, with = (6j, b_ q ) T , we may define 
Af'q = T _1 \ \! q , where T is a 2 x 2 matrix acting on the two components of T. (Since K does 
not depend on q, T can be chosen to have the same property.) After the transformation, 
the Hamiltonian will have the form 


H = ^q^\ K A> q ^^q^T^KT%, (2.42) 

9>0 q> 0 K ' 

with a new matrix K' = T 1 KT. We will seek for a transformation T that makes K' diag- 
onal. However, an important point to be kept in mind is that not all 2 x 2 matrices T 
qualify as transformations. We must ensure that the transformed “vector” again has the 
structure \E' q = (l/ ( J , b’_ q ) , with a boson creation/annihilation operator in the first/second 
component - i.e. the commutation relations of the operators must be conserved by the 
transformation. Remembering that the algebraic properties of the operators b are speci- 
fied through commutation relations, this condition can be cast in mathematical form by 
requiring that the commutator be invariant under the 

transformation. Using the fact that T' = T -1 \E', this condition is seen to be equivalent to 
the pseudo-unitarity condition, T^a^T = <73. 
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With this background, we are now in a position to find a transformation that brings the 
matrix K' to a 2 x 2 diagonal form. Multiplication of the definition K' = T^KT by (73 leads 
to 

KT = K' cr^T^crs cr^KT = <73 K' . 
t- 1 

This means that the matrix cr^K' is obtained by a similarity transformation T _1 (---)T 
from the matrix <73 K, or, in other words, that the matrix <73 K' contains the eigenvalues ±it 
of a$K on its diagonal. (That the eigenvalues sum to 0 follows from the fact that the trace 
vanishes, tr (a^K) = 0.) However, the eigenvalues of a^K are readily computed as 

v P = [(2ttu f + fl 4 ) 2 - gf \ 1/2 ■ (2.43) 

Thus, with CF 3 K' = CT 3 v p we arrive at K' = v p ■ id., where “id.” stands for the unit matrix . 19 
Substitution of this result into Eq. (2.42) finally leads to the diagonal Hamiltonian H = 
or equivalently, making use of the identity 4 V q = b^b q + b^_ q b- q + 1, 

H = v P Y J \^\b q . (2.44) 

Q 

Here we have ignored an overall constant and omitted the prime on our new Bose operators. 

In the literature, the transfor- 
mation procedure outlined above is 
known as a Bogoliubov trans- 
formation. Transformations of this 
type are frequently applied in quan- 
tum magnetism (see below), super- 
conductivity, or, more generally, all 
problems where the particle number is not conserved. Notice that the possibility to trans- 
form to a representation ~ b^b does not imply that miraculously the theory has become 
particle number conserving. The new “quasi-particle” operators b are related to the origi- 
nal Bose operators through a transformation that mixes b and . While the quasi-particle 
number is conserved, the number of original density excitations is not. 

Equations (2.43) and (2.44) represent our final solution of the problem of spinless inter- 
acting fermions in one dimension. We have suceeded in mapping the problem onto a form 
analogous to our previous results (1.34) and (1.39) for the phonon and the photon sys- 
tem, respectively. Indeed, all that has been said about those Hamiltonians applies equally 
to Eq. (2.44): the basic elementary excitations of the one-dimensional fermion system are 
waves, i.e. excitations with linear dispersion u> = v p \q\. In the present context, they are 


Nicolai Nikolaevich Bogoliubov 1909-92 

A theoretical physicist acclaimed for his works in 
nonlinear mechanics, statistical physics, theory of 
superfluidity and superconductivity, quantum field 
theory, renormalization group theory, proof of dis- 
persion relations, and elementary particle theory. 


Explicit knowledge of the transformation matrix T, i.e. knowledge of the relation between the operators b and 
b' , is not needed for our construction. However, for the sake of completeness, we mention that 

T _ cosh Ok sinh Ok 

~ sinh Ok cosh Ok 

with tanh(20) = — P2/(27tuf + 9a) represents a suitable parameterization. 
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termed charge density waves (CDW). The Bose creation operators describing these exci- 
tations are, up to the Bogoliubov transformation, and a momentum dependent scaling factor 
(2n/ Lq) 1 / 2 , equivalent to the density operators of the electron gas. For a non-interacting 
system, g% = <74 = 0, and the CDW propagates with the velocity of the free Fermi particles, 
v-p- A fictitious interaction that does not couple particles of opposite Fermi momentum, 
gi = 0, g 4 7 ^ 0, speeds up the CDW. Heuristically, this can be interpreted as an “acceleration 
process” whereby a CDW pushes its own charge front. By contrast, interactions between 
left and right movers, <72 7 ^ 0, diminish the velocity, i.e. due to the Coulomb interaction it 
is difficult for distortions of opposite velocities to penetrate each other. (Notice that, for 
a theory with g 2 = 0, no Bogoliubov transformation would be needed to diagonalize the 
Hamiltonian, i.e. in this case, undisturbed left- and right-moving waves would be the basic 
excitations of the theory.) 

Our discussion above neglected the spin carried by the conduction electrons. Had we 
included the electron spin, the following picture would have emerged (see Problem 2.4): the 
long-range dynamics of the electron gas is governed by two independently propagating wave 
modes, the charge density wave discussed above, and a spin density wave (SDW ). 20 The 
SDW carries a spin current, but is electrically neutral. As with the CDW, its dispersion 
is linear with an interaction-renormalized velocity, v s (which, however, is generally larger 
than the velocity v p of the CDW). To understand the consequences of this phenomenon, 
imagine an electron had been thrown into the system (e.g. by attaching a tunnel contact 
somewhere along the wire). As discussed above, a single electron does not represent a stable 
excitation of the one-dimensional electron gas. What will happen is that the spectral weight 
of the particle 21 disintegrates into a collective charge excitation and a spin excitation. The 
newly excited waves then propagate into the bulk of the system at different velocities ±v p 
and ±u s . In other words, the charge and the spin of the electron effectively “disintegrate” 
into two separate excitations, a phenomenon known as spin— charge separation. Spin- 
charge separation in one-dimensional metals exemplifies a mechanism frequently observed 
in condensed matter systems: the set of quantum numbers carried by elementary particles 
may get effectively absorbed by different excitation channels. One of the more spectacular 
manifestations of this effect is the appearance of fractionally charged excitations in quantum 
Hall systems, to be discussed in more detail in Chapter 9. 

The theory of spin and charge density waves in one-dimensional conductors has a long 
history spanning four decades. However, despite the rigor of the theory its experimental 
verification has proved excruciatingly difficult! While various experiments are consistent 
with theory (for a review, see Ref. 17 ), only recently have signatures of spin and charge 
density wave excitations been experimentally observed. 


One may think of the charge density of the electron gas p = p + p as the sum of the densities of the spin up 
and spin down populations, respectively. The local spin density is then given by p s = p — p . After what has 
been said above, it is perhaps not too surprising that fluctuations of these two quantities represent the dominant 
excitations of the electron gas. What is surprising, though, is that these two excitations do not interact with 
each other. 

For a precise definition of this term, see Chapter 7. 
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Quantum spin chains 


In the previous section, the emphasis was placed on charging effects generated by Coulomb 
interaction. However, as we have seen in Section 2.2, Coulomb interaction may also lead to 
the indirect generation of magnetic interactions. In one dimension, one can account for these 
mechanisms by adding to our previously structureless electrons a spin degree of freedom. 
This leads to the Tomonaga— Luttinger liquid, a system governed by the coexistence of 
collective spin and charge excitations. However, to introduce the phenomena brought about 

by quantum magnetic correlations, 
it is best to first consider systems 
where the charge degrees of freedom 
are frozen and only spin excitations 
remain. Such systems are realized, 
for example, in Mott insulators 
where interaction between the spins 
of localized electrons is mediated by 
virtual exchange processes between 
neighboring electrons. One can 
describe these correlations through 
models of localized quantum spins - 
either in chains or, more generally, 
in higher-dimensional quantum spin 

lattices. We begin our discussion with the ferromagnetic spin chain. 


Werner Heisenberg 1901-76 

Nobel Laureate in Physics in 
1932 “for the creation of quan- 
tum mechanics, the application of 
which has, inter alia, led to the 
discovery of the allotropic forms 
of hydrogen." In 1927 he pub- 
lished his uncertainty principle, for 
which he is perhaps best known. He also made 
important contributions to the theories of hydrody- 
namics of turbulent flows, ferromagnetism, cosmic 
rays, and subatomic particles, and he was instrumen- 
tal in planning the first West German nuclear reac- 
tor at Karlsruhe, together with a research reactor in 
Munich, in 1957. (Image © The Nobel Foundation.) 



Quantum ferromagnet 

The quantum Heisenberg ferromagnet is specified by the Hamiltonian 

H = -J Sm-Sn , 

( mn ) 


(2.45) 


where J > 0, S m represents the quantum mechanical spin operator at lattice site to, and, 
as before, {mn) denotes summation over neighboring sites. In Section 2.1 (see Eq. (2.13)) 
the quantum mechanical spin was represented through an electron basis. However, one can 
conceive of situations where the spin sitting at site to is carried by a different object (e.g. an 
atom with non- vanishing magnetic moment). At any rate, for the purposes of our present 
discussion, we need not specify the microscopic origin of the spin. All we need to know is 
(i) that the lattice operators S l m obey the SU(2) commutator algebra, 

[S l m ,Sl] =i5 mn e ijk S%, (2.46) 

characteristic of quantum spins, and (ii) the total spin at each lattice site is S. 22 


Remember that the finite-dimensional representations of the spin operator are of dimension 2 S + 1 where S 
may be integer or half integer. While a single electron has spin S = 1/2, the total magnetic moment of electrons 
bound to an atom may be much larger. 
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Figure 2.10 Showing the spin configuration of an elementary spin- wave excitation from the spin 
polarized ground state. 


Now, due to the positivity of the coupling constant J, the Hamiltonian favors configura- 
tions where the spins at neighboring sites are aligned in the same direction (cf. Fig. 2.10). A 
ground state of the system is given by |fi) = m \S m ), where \S m ) represents a state with 
maximal spin -2 component: = S'! jS'm.) - We have written “a” ground state instead of 

“the” ground state because the system is highly degenerate: a simultaneous change of the 
orientation of all spins does not change the ground state energy, i.e. the system possesses a 
global rotation symmetry. 

EXERCISE Compute the energy expectation value of the state | ). Defining global spin operators 
through S l = Ylm Smi consider the state |a) = exp(ia-S)| ). Verify that the state a is degenerate 
with | ). Explicitly compute the state | (7r /2, 0, 0)). Convince yourself that, for general a, |a) can 
be interpreted as a state with rotated quantization axis. 

As with our previous examples, we expect that a global continuous symmetry will involve 
the presence of energetically low-lying excitations. Indeed, it is obvious that, in the limit 
of long wavelength A, a weak distortion of a ground state configuration (see Fig. 2.10) will 
cost vanishingly small energy. To quantitatively explore the physics of these spin waves, 
we adopt a “semiclassical” picture, where the spin S' » 1 is assumed to be large. In this 
limit, the rotation of the spins around the ground state configuration becomes similar to 
the rotation of a classical magnetic moment. 

I N FO To better understand the mechanism behind the semi-classical approximation, con- 
sider the Heisenberg uncertainty relation, AS* A S j ~ |([S l ,S : ’])| =i:e sofc |(S fc )|, where AS 1 is the 
root mean square of the quantum uncertainty of spin component i. Using the fact that | (S fc ) | S, 
we obtain for the relative uncertainty, A S l /S, 

AS* AS' S si 
S S ~ S 2 

I.e., for S>1, quantum fluctuations of the spin become less important. 

In the limit of large spin S, and at low excitation energies, it is natural to describe the 
ordered phase in terms of small fluctuations of the spins around their expectation values 
(cf. the description of the ordered phase of a crystal in terms of small fluctuations of the 
atoms around the ordered lattice sites). These fluctuations are conveniently represented in 
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Figure 2.11 Spin-wave spectrum of europium oxide as measured by inelastic neutron scattering at 
a reference temperature of 5.5 K. Note that, at low values of momenta q, the dispersion is quadratic, 
in agreement with the low-energy theory. (Exercise: A closer inspection of the data shows the 
existence of a small gap in the spectrum at q = 0. To what may this gap be attributed?) Figure 
reprinted with permission from L. Passell, O. W. Dietrich, and J. Als-Nielser, Neutron scattering 
from the Heisenberg ferromagnets EuO and EuS I: the exchange interaction, Phys. Rev. B 14 
(1976), 4897-907. Copyright (1976) by the American Physical Society. 


terms of spin raising and lowering operators: with S± = S ™ ± iS it is straightforward to 
verify that 

[S z m , S±] = ±8 mn S±, [S+,S-]=25 mn S z rn . (2.47) 

Application of Sm +> lowers (raises) the ^-component of the spin at site m by one. To 
actually make use of the fact that deviations around | O) are small, a representation known 
as the Holstein— Primakoff transformation 23 was introduced in which the spin operators 
S^~, S are specified in terms of bosonic creation and annihilation operators a t and a: 


S m = aln (2 S — a]„a m ) 1/2 , 


S+ = (2 S — a^a™) 1 ^ 2 a m , S ^ = S — a^a™. 


EXERCISE Confirm that the spin operators satisfy the commutation relations (2.47). 

The utility of this representation is clear. When the spin is large, S 1, an expansion 
in powers of 1/S gives S'™ = S — aj^o™, S“ ~ (2 S) 1 / 2 o]„, and S+ ~ (2S) 1 / 2 a™. In this 


T. Holstein and H. Primakoff, Field dependence of the intrinsic domain magnetisation of a ferromagnet, Phys. 
Rev. 58 (1940), 1098-113. 
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approximation, the one-dimensional Heisenberg Hamiltonian takes the form 

H = -J]T s*J^ +1 + l(s+s- +1 +s-s+ +1 ) 

= -JNS 2 - JS^2 ~^ a m a m + ( a m a m+l + h.C.) + 0{S°) 
= - JNS 2 + -JS5>Ui - al)(a m+1 - a m ) + 0(S°). 


Keeping fluctuations at leading order in S, the quadratic Hamiltonian can be diagonalized by 
Fourier transformation. In this case, it is convenient to impose periodic boundary conditions: 
Sm+N = S rn , and a m +N = a m , where N denotes the total number of lattice sites. Defining 

where the summation over k runs over the Brillouin zone, the Hamiltonian for the one- 
dimensional lattice system takes the form (exercise) 


B.Z. 

H = -JNS 2 + Y, t™ k a{a k + O(S 0 ). 

k 


(2.48) 


Here hco k = 2JS(1 — cos k) = l./.S'sin 2 (fc/2) represents the dispersion relation of the spin 
excitations. In particular, in the limit k — > 0, the energy of the elementary excitations van- 
ishes, Hu) k — > JSk 2 . These massless low-energy excitations, known as magnons, describe the 
elementary spin- wave excitations of the ferromagnet. Taking into account terms at higher 
order in the parameter 1/S, one finds interactions between the magnons. A comparison of 
these theoretical predictions and experiment is shown in Fig. 2.11. 


Quantum antiferromagnet 

Having explored the elementary excitation spectrum of the ferromagnet, we now turn to 
the discussion of the spin S Heisenberg antiferromagnetic Hamiltonian 


H = J Y Sm • S„, 

(mn) 


where, once again, J > 0. As we have seen above, such antiferromagnetic systems occur 
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Figure 2.12 (a) Example of a two-dimensional bipartite lattice, (b) Example of a non-bipartite 
lattice. Notice that, with the latter, no antiferromagnetic arrangement of the spins can be made 
that recovers the maximum exchange energy from each and every bond. 


in the arena of strongly correlated 
electron compounds. Although the 
Hamiltonian differs from its fer- 
romagnetic relative “only” by a 
change of sign, the differences in 
the physics are drastic. Firstly, the 
phenomenology displayed by the 
antiferromagnetic Hamiltonian H 
depend sensitively on the morphol- 
ogy of the underlying lattice. For a 
bipartite lattice, i.e. one in which the neighbors of one sublattice A belong to the other 
sublattice B (see Fig. 2.12(a)), the ground states of the Heisenberg antiferromagnet are 
close 24 to a staggered spin configuration, known as a Neel state, where all neighboring 
spins are antiparallel (see Fig. 2.12). Again the ground state is degenerate, i.e. a global 
rotation of all spins by the same amount does not change the energy. By contrast, on non- 
bipartite lattices such as the triangular lattice shown in Fig. 2.12(b), no spin arrangement 
can be found wherein each bond recovers the full exchange energy J. Spin models of this 
kind are said to be frustrated. 

EXERCISE Engaging only symmetry considerations, try to identify a possible classical ground 
state of the triangular lattice Heisenberg antiferromagnet. (Hint: Construct the classical ground 
state of a three-site plaquette and then develop the periodic continuation.) Show that the classical 
antiferromagnetic ground state of the Kagome lattice - a periodic array of corner-sharing “stars 
of David” - has a continuous spin degeneracy generated by local spin rotations. How might the 
degeneracy affect the transition to an ordered phase? 


Louis Neel 1904-2000 

Nobel Laureate in Physics in 
1970, shared with Hannes Olof 
Gosta Alfven, for his “funda- 
mental work and discoveries 
concerning antiferromagnetism 
and ferrimagnetism that have led 
to important applications in solid 
state physics." (Image © The Nobel Foundation.) 



Returning to the one-dimensional system, we first note that a chain is trivially bipartite. 
As before, our strategy will be to expand the Hamiltonian in terms of bosonic operators. 
However, before doing so, it is convenient to apply a canonical transformation to the Hamil- 
tonian in which the spins on one sublattice, say B, are rotated through 180° about the 


It is straightforward to verify that the classical ground state - the Neel state - is now not an exact eigenstate of 
the quantum Hamiltonian. The true ground state exhibits zero-point fluctuations reminiscent of the quantum 
harmonic oscillator or atomic chain. However, when S 1, it serves as a useful reference state from which 
fluctuations can be examined. 
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Figure 2.13 (a) Neel state configuration of the spin chain, (b) Cartoon of an antiferromagnetic 
spin wave. 


a:- axis. i.e. S B —t S B = S B , S B —> S B = —S V B , and S Z B — > S B = —S B . That is, when repre- 
sented in terms of the new operators, the Neel ground state looks like a ferromagnetic state, 
with all spins aligned. We expect that a gradual distortion of this state will produce the 
antiferromagnetic analog of the spin waves discussed in the previous section (see Fig. 2.11). 
Represented in terms of the transformed operators, the Hamiltonian takes the form 

H = [ sA + 1 - 2 (^£++1 + $ A + 1 )] • 

Once again, applying an expansion of the Holstein-Primakoff representation, S~ t ~ 
(2S) 1 / 2 aJ, l , etc., one obtains the Hamiltonian 

H = —NJS 2 + JS ^ ja]„a m + a} m+1 a m+ i + a m a m+1 + a]) l aJ re+1 J + O(S 0 ). 

At first sight the structure of this Hamiltonian, albeit quadratic in the Bose operators, looks 
awkward. However, after Fourier transformation, a m = TV -1 / 2 e~ zkm ah it assumes the 
more accessible form 


H = —NJS(S + 1) + JS^ (4 

k 


( 1 

Ik \ 


V 7 k 

1 ) 

U-J 


+ O(S 0 ), 


where 7 & = cos k. Apart from the definition of the matrix kernel between the Bose oper- 
ators, H is equivalent to the Hamiltonian (2.41) discussed in connection with the charge 
density wave. Performing the same steps as before, the non-particle-number-conserving con- 
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Figure 2.14 Experimentally obtained spin-wave dispersion of the high-T c parent compound 
LaCuCU - a prominent spin 1/2 antiferromagnet. Reprinted with permission from R. Coidea 
S. M. Hayder, G. Aeppli, et al.. Spin waves and electronic excitations in La 2 Cu 04 , Phys. Rev. Lett. 
86 (2001), 5377-80. Copyright (2001) by the American Physical Society. 


tributions a) a) can be removed by Bogoliubov transformation. As a result, the transformed 
Hamiltonian assumes the diagonal form 


H = -NJS 2 +2JSJ2\ sin A: | 

k 


+ 1 
a k a k + j 


(2.49) 


Thus, in contrast to the ferromagnet, the spin- wave excitations of the antiferromagnet 
(Fig. 2.14) display a linear spectrum in the limit k — > 0. Surprisingly, although developed 
in the limit of large spin, experiment shows that even for S' = 1/2 spin chains, the integrity 
of the linear dispersion is maintained (see Fig. 2.14). 

More generally, it turns out that, for chains of arbitrary half integer spin S = 
1/2, 3/2, 5/2, . . ., the low-energy spectrum is linear, in agreement with the results of the 
harmonic approximation. In contrast, for chains of integer spin S = 1,2,3..., the low- 
energy spectrum contains a gap, i.e. these systems do not support long-range excitations. 
As a rule, the sensitivity of a physical phenomenon to the characteristics of a sequence 
of numbers - such as half integer vs. integer - signals the presence of a mechanism of 
topological origin. 25 At the same time, the formation of a gap (observed for integer chains) 
represents an interaction effect; at orders beyond the harmonic approximation, spin waves 
begin to interact nonlinearly with each other, a mechanism that may ( S integer) but need 
not ( S half integer) destroy the-wave like nature of low-energy excitations. In Section 9.3.3 
- in a chapter devoted to a general discussion of the intriguing condensed matter phenom- 


Specifically, the topological signature of a spin field configuration will turn out to be the number of times the 
classical analog of a spin (a vector on the unit sphere) will wrap around the sphere in (1 + l)-dimensional space 
time. 
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ena generated by the conspiracy of global (topological) structures with local interaction 
mechanisms - we will discuss these phenomena on a deeper level. 


2.3 Summary and outlook 

This concludes our preliminary discussion of applications of the second quantization. Addi- 
tional examples can be found in the problems. In this chapter, we have introduced second 
quantization as a tool whereby problems of many-body quantum mechanics can be addressed 
more efficiently than by the traditional language of symmetrized many-body wave functions. 
We have discussed how the two approaches are related to each other and how the standard 
operations of quantum mechanics can be performed by second quantized methods. 

One may note that, beyond qualitative discussions, the list of concrete applications 
encountered in this chapter involved problems that either were non-interacting from the 
outset, or could be reduced to a quadratic operator structure by a number of suitable 
manipulations. However, we carefully avoided dealing with interacting problems where no 
such reductions are possible - the majority by far of the problems encountered in condensed 
matter physics. What can be done in situations where interactions, i.e. operator contribu- 
tions of fourth or higher order, are present and no tricks like bosonization can be played? 
Generically, either interacting problems of many-body physics are fundamentally inaccessi- 
ble to perturbation theory, or they necessitate perturbative analyses of infinite order in the 
interaction contribution. Situations where a satisfactory result can be obtained by first- or 
second-order perturbation theory are exceptional. Within second quantization, large-order 
perturbative expansions in interaction operators lead to complex polynomials of creation 
and annihilation operators. Quantum expectation values taken over such structures can be 
computed by a reductive algorithm, known as Wick’s theorem. However, from a modern 
perspective, the formulation of perturbation theory in this way is not very efficient. More 
importantly, problems that are principally non-perturbative have emerged as the focus of 
interest. 

To understand the language of modern condensed matter physics, we thus need to develop 
another layer of theory, known as field integration. In essence, the latter is a concept 
generalizing the effective action approach of Chapter 1 to the quantum level. However, 
before discussing quantum field theory, we should understand how the concept works in 
principle, i.e. on the level of single particle quantum mechanics. This will be the subject of 
the next chapter. 


2.4 Problems 

Stone-von Neumann theorem 

In the text we introduced creation and annihilation operators in a constructive manner, i.e. by specifying 
their action on a fixed Fock space state. We saw that this definition implied remarkably simple algebraic 
relations between the newly introduced operators - the Heisenberg algebra (2.7). In this problem we 
explore the mathematical structure behind this observation. (The problem has been included for the 
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benefit of the more mathematically inclined. Readers primarily interested in the practical aspects of 
second quantization may safely skip it.) 

Let us define an abstract algebra of objects a\ and a\ by 

[«A> [«A, tt/ilc = [®A> %]■£. = °- 

Further, let us assume that this algebra is unitarily represented in some vector space T . 
This means that (i) to every a\ and ii\ we assign a linear map T ax : J -> J such that 
(ii) T[ ait a„] { = [T ax , and (iii) = T\ x . To keep the notation simple, we will denote 
T ax by a,\ (now regarded as a linear map T — >• P) and T dx by o^. 

The Stone— von Neumann theorem states that the representation above is unique, i.e. that, 
up to unitary transformations of basis, there is only one such representation. The statement 
is proven by explicit construction of a basis on which the operators act in a specific and well- 
defined way. We will see that this action is given by Eq. (2.6), i.e. the reference basis is but 
the Fock space basis used in the text. This proves that the Heisenberg algebra encapsulates 
the full mathematical structure of the formalism of second quantization. 

(a) We begin by noting that the operators h\ = a^a\ are Hermitian and commute with each 

other, i.e. they can be simultaneously diagonalized. Let \n\ 1 ,n \ 2 , . . .) be an orthonor- 
malized eigenbasis of the operators {fU}, i.e. n\ i \n \ 1 , n\ 2 , ■ ■ ■) = , n \ 2 , . . .). Show 

that, up to unit-modular factors, this basis is unique. (Hint: Use the irreducibility of 
the transformation.) 

(b) Show that a\ i \n\ 1 ,n\ 2 , • • ■) is an eigenstate of h\ i with eigenvalue n\ t — 1. Use this 
information to show that all eigenvalues n\ i are positive integers. (Hint: note positivity 
of the scalar norm.) Show that the explicit representation of the basis is given by 

fn\ t 

K,«A 2 ,...)=n^Hi^ ( 2 - 5 °) 

where |0) is the unique state which has eigenvalue 0 for all rq. Comparison with Eq. (2.4) 
shows that the basis constructed above indeed coincides with the Fock space basis 
considered in the text. 

Answer: 

(a) Suppose we had identified two bases { | tiaj > ^ta 2 ,•••)} and {|n^ 1 ,riA 2 , • • •) / } on which 
all operators hi assumed equal eigenvalues. The irreducibility of the representation 
implies the existence of some polynomial F({a w , aj^}) such that \n\ 1 ,n \ 2 , . . .) = 
P({a ;ii , at .DInAi , ua 2 , ■ • ■)'■ Now, the action of P must not change any of the eigen- 
values of hi, which means that P contains the operators and in equal numbers. 
Reordering operators, we may thus bring P into the form P({a IIt , aj^. }) = P({h /it }). 
However, the action of this latter expression on | n\ 1 ,n\ 2 , . . .}' just produces a number, 
i.e. the bases are equivalent. 
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(b) For a given state |n), (concentrating on a fixed element of the single-particle basis, we 
suppress the subscript A* throughout), let us choose an integer q such that na 9-1 |n) = 
(n — q + l)a 9_1 |n) with n — q + 1 > 0 while n — q < 0. We then obtain 

0>(n-q)(n\a' q a q \n) = {n\a' q ha q \n) = (n|(a^ ) q+1 a q+l \n) > 0. 

The only way to satisfy this sequence of inequalities is to require that ( n\a) q+1 \a q+1 \n ) = 
0 and n — q = 0. The last equation implies the “integer-valuedness” of n. (In principle, 
we ought to prove that a zero-eigenvalue state |0) exists. To show this, take any reference 
state \n\ 1 ,nx 2 , ■ ■ •) and apply operators a\ i as long as it takes to lower all eigenvalues 
n\ i to zero.) Using the commutation relations, it is then straightforward to verify that 
the r.h.s. of Eq. (2.50) (a) is unit normalized and (b) has eigenvalue n\ t for each h\ t . 


Semiclassical spin waves 

In Chapter 1, the development of a theory of lattice vibrations in the harmonic atom chain was motivated 
by the quantization of the continuum classical theory. The latter provided insight into the nature of 
the elementary collective excitations. Here we will employ the semiclassical theory of spin dynamics to 
explore the nature of elementary spin-wave excitations. 

(a) Making use of the spin commutation relation, [5“, Sj] = iS^e^^S ? , apply the operator 
identity *S, = [Si, H] to express the equation of motion of a spin in a nearest neighbor 
spin-, S' one-dimensional Heisenberg ferromagnet as a difference equation (N.B. h = 1). 

(b) Interpreting the spins as classical vectors, and taking the continuum limit, show that 
the equation of motion of the hydrodynamic modes takes the form S = JS x d 2 S where 
we have assumed a unit lattice spacing. (Hint: In taking the continuum limit, apply a 
Taylor expansion to the spins i.e. S) + i = ,S) + dSj + •••.) Find and sketch a wave-like 
solution describing small angle precession around a globally magnetized state S* = Se z 
(i.e. a solution as shown in Fig. 2.10). 

Answer: 


(a) Making use of the equation of motion, and the commutation relation, substitution of 
the Heisenberg ferromagnetic Hamiltonian gives the difference equation 

hi = JSi X (S i+1 + S ;_!). 

(b) Interpreting the spins as classical vectors, and applying the Taylor expansion Sj+i H> 
S(x + 1) = S + <9S + <9 2 S/2 + • • • , one obtains the classical equation of motion shown. 
Making the ansatz S = ( ccos(kx — cut), csin(A:a: — uit), x/S 2 — c 2 ) one may confirm that 
the equation of motion is satisfied if uj = Jk 2 . 
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Figure 2.15 (a) An sp 2 -hybridized polymer chain, (b) One of the configurations of the Peierls 
distorted chain. The double bonds represent the short links of the lattice, (c) A topological defect 
separating two domains of the ordered phase. 

Su-Shrieffer-Heeger model of a conducting polymer chain 

Polyacetylene consists of bonded CH groups forming an isomeric long-chain polymer. According to 
molecular orbital theory, the carbon atoms are expected to be sp 2 -hybridized suggesting a planar config- 
uration of the molecule. An unpaired electron is expected to occupy a single 7r-orbital which is oriented 
perpendicular to the plane. The weak overlap of the 7r-orbitals delocalizes the electrons into a narrow 
conduction band. According to the nearly free electron theory, one might expect the half-filled con- 
duction band of a polyacetylene chain to be metallic. However, the energy of a half-filled band of a 
one-dimensional system can always be lowered by imposing a periodic lattice distortion known as a 
Peierls instability (see Fig. 2.15). The aim of this problem is to explore the instability. 

(a) At its simplest level, the conduction band of polyacetylene can be modeled by a simple 
(arguably over-simplified) microscopic Hamiltonian, due to Su, Shrieffer, and Heeger, 26 
in which the hopping matrix elements of the electrons are modulated by the lattice 
distortion of the atoms. By taking the displacement of the atomic sites to be u n , and 
treating their dynamics as classical, the effective Hamiltonian can be cast in the form 

N N k 

H = -t (i + u n ) [cl a c n+la + h.c.] + ^2 («„+ 1 - u n f , 

71=1 71=1 1 

where, for simplicity, the boundary conditions are taken to be periodic, and summation 
over the spins a is assumed. The first term describes the hopping of electrons between 
neighboring sites with a matrix element modulated by the periodic distortion of the 
bond-length, while the last term represents the associated increase in the elastic energy. 
Taking the lattice distortion to be periodic, u n = (— l)”a, and the number of sites to 
be even, bring the Hamiltonian to diagonal form. (Hint: Note that the lattice distortion 
lowers the symmetry of the lattice. The Hamiltonian is most easily diagonalized by 
distinguishing the two sites of the sublattice - i.e. doubling the size of the elementary 
unit cell.) Show that the Peierls distortion of the lattice opens a gap in the spectrum at 
the Fermi level of the half-filled system. 

(b) By estimating the total electronic and elastic energy of the half-filled band (i.e. an 
average of one electron per lattice site), show that the one-dimensional system is always 
unstable towards the Peierls distortion. To complete this calculation, you will need 


W. P. Su, J. R. Schrieffer, and A. J. Heeger, Solitons in polyacetylene, Phys. Rev. Lett. 42 (1979), 1698-701. 
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the approximate formula for the (elliptic) integral, dk (l — (l — a 2 ) sin 2 k )^ 2 ~ 

2+(ai — bi lna 2 )a 2 +0(a 2 In a 2 ), where ai and bi are (unspecified) numerical constants, 
(c) For an even number of sites, the Peierls instability has two degenerate configurations 
(see Fig. 2.15(a)) - ABABAB. . . and BABABA. . . Comment on the qualitative form of 
the ground state lattice configuration if the number of sites is odd (see Fig. 2.15(b)). 
Explain why such configurations give rise to mid-gap states. 

Answer: 


(a) Since each unit cell is of twice the dimension of the original lattice, we begin by recasting 
the Hamiltonian in a sublattice form 


H = -t 


N/2 

£ 

71 = 1 , <7 


(1 + a) [alurbm* + h-c.] + (1 — a) \b\ na a m+ i a + h.c.] + 2 Nk s a 2 , 


where the creation operators a ^ and b ^ act on the two sites of the elemental unit cell of 
the distorted lattice. Switching to the Fourier basis, a m = \J2/N e 2lkm ak (similarly 
b m ), where k takes N/2 values uniformly on the interval [— 7r/2,7r/2] and the lattice 
spacing of the undistorted system is taken to be unity, the Hamiltonian takes the form 


H = 2Nk s a 2 

^ t S(°L» 6 L)( (l + a) + (l-a)e~ 2ik 


(1 + a) + (1 - a)e 2ifc 
0 



Diagonalizing the 2x2 matrix , one obtains e(k) = ±2t [l + ( a 2 — 1) sin 2 k\ . Reas- 
suringly, in the limit a 0, one recovers the cosine spectrum characteristic of the undis- 
torted tight-binding problem while, in the limit a — » 1, pairs of monomers become 
decoupled and we obtain a massively degenerate bonding and antibonding spectrum. 

(b) According to the formula given in the text, the total shift in energy is given by 6c = 
— 4t(oi — b\ In a 2 ') a 2 + 2 k s a 2 . Maximizing the energy gain with respect to a, one finds 
that the stable configuration is found when a 2 = exp[|^- — 1 — 

(c) If the number of sites is odd, the Peierls distortion is inevitably frustrated. The result is 
that the polymer chain must accommodate a topological excitation. The excitation 
is said to be topological because the defect cannot be removed by a smooth continu- 
ous deformation. Its effect on the spectrum of the model is to introduce a state that 
lies within the band gap of the material. The consideration of an odd number of sites 
forces a topological defect into the system. However, even if the number of sites is even, 
one can create low energy topological excitations of the system either by doping (see 
Fig. 2.15(b)), or by the creation of excitons, particle-hole excitations of the system. 
Indeed, such topological excitations can dominate the transport properties of the system. 
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Schwinger boson representation 


As with the Holstein-Primakoff representation, the Schwinger boson provides another representation of 
quantum mechanical spin. The aim of this problem is to confirm the validity of the representation. For 
practical purposes, the value of the particular representation depends on its application. 

In the Schwinger boson repre- 
sentation, the quantum mechanical 
spin is expressed in terms of two 
bosonic operators a and b in the form 

S + =a'b, S~ = {S+)\ 

S z = - (a* a — b^b) . 


Julian Schwinger 1918-1994 

Nobel Laureate in Physics with 
Sin-ltiro Tomonaga and Richard 
P. Feynman, for their fundamen- 
tal work in quantum electrody- 
namics, with far-reaching conse- 
quences for the physics of ele- 
mentary particles. (Image © The 
Nobel Foundation.) 



(a) Show that this definition is consistent with the commutation relations for spin: 
[S+,S~] = 2S Z . 

(b) Using the bosonic commutation relations, show that 


| S, m) = 


(at) S+ m (b t)S-m 
\/{S + to)! \/{S — m)\ 


is compatible with the definition of an eigenstate of the total spin operator S 2 and S' 2 . 
Here |Q) denotes the vacuum of the Schwinger bosons, and the total spin S defines the 
physical subspace {| n a , nb)\n a + nt, = 2 S}. 


Answer: 


(a) Using the commutation relation for bosons, one finds [S + ,S _ ] = a^b b^a — b^a a^b = 
a) a — b^b = 2S Z , as required. 

(b) Using the identity S 2 = (S z ) 2 + ^(S + S~ +S~ S + ) = \{h a —hb) 2 +h a hb + \{h a +hb) one 
finds that S 2 |S, m) = [m 2 + (S + m)(S — m) + S] |S, m) = S(S + 1)|S, m), as required. 
Similarly, one finds S 2 |S, m) = | (n a — nb)\n a = S + m, nb = S — m) = m|S, to) showing 
|S, to) to be an eigenstate of the operator S z with eigenvalue to. 


Jordan-Wigner transformation 

So far we have shown how the algebra of quantum mechanical spin can be expressed using boson 
operators - cf. the Holstein-Primakoff transformation and the Schwinger boson representation. In this 
problem we show that a representation for spin-(l/2) can be obtained in terms of Fermion operators. 

Let us represent an up spin as a particle and a down spin as the vacuum, i.e. I't) = |1) = 
/+|0), | 4.) = |0) = /|1). In this representation the spin raising and lowering operators are 
expressed in the forms S + = p and S~ = /, while S z = p f — 1/2. 




2-4 Problems 


89 


(a) With this definition, confirm that the spins obey the algebra [S' + ,<S I_ ] = 2 S z . 

However, there is a problem: spins on different sites commute while fermion operators 
anticommute, e.g. S/“»SU = S+Sp but fjfj = — fjf ‘J. To obtain a faithful spin repre- 
sentation, it is necessary to cancel this unwanted sign. Although a general procedure is 
hard to formulate, in one dimension this can be achieved by a nonlinear transformation, 
namely 

§+ = /t e - E,<, A , Sf = e ~ ™ ^<« ** ft , Sf =f}h-\. 

Operationally, this seemingly complicated transformation is straightforward: in one 
dimension, the particles can be ordered on the fine. By counting the number of par- 
ticles “to the left” we can assign an overall sign of +1 or —1 to a given configuration 
and thereby transmute the particles into fermions. (Put differently, the exchange of 
two fermions induces a sign change that is compensated by the factor arising from the 
phase - the “Jordan-Wigner string.”) 

(b) Using the Jordan-Wigner representation, show that S'+«§ T )) +1 = 

(c) For the spin- (1/2) anisotropic quantum Heisenberg spin chain, the spin Hamiltonian 

assumes the form H = — 1 + S~Sp^j J . Turning to the 

Jordan-Wigner representation, show that the Hamiltonian can be cast in the form 

H = ~ X) \~J (fnfn+l + h -C-) + Jz ( \ ~ flfn + flfnfl+Jn+1^ • 

(d) The mapping above shows that the one-dimensional quantum spin-(l/2) XF-model (i.e. 
J z = 0) can be diagonalized as a non-interacting theory of spinless fermions. In this 
case, show that the spectrum assumes the form e(k) = — J_l cos ka. 

Answer: 


(a) Using the fermionic anti-commutation relations, one finds [5 + , S ]_ = [p , /]_ = Pf — 

fP = 2pf - 1 = 2S Z . 

(b) Using the fact that the number operators on different sites commute, one finds 
‘S'm'S'm+i = /(Le lOTE i<™^'e _mE ‘<™+ in 7 m+ i = /,t, 1 e' i7rn "7m - 1 = Pmfm+ 1 , where we 
have made use of the fact that, for fermionic particles, f} n e~ l ' Knm = /^. 

(c) The fermion representation is simply obtained by substitution. 

(d) With J z = 0, the spin Hamiltonian assumes the form of a non-interacting tight-binding 
Hamiltonian H = p- Yl n (fnfn+ i+h.c.). This Hamiltonian, which has been encountered 
previously, is diagonalized in the Fourier space, after which one obtains the cosine band 
dispersion. 
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Spin-charge separation in one- dimension 


In Section 2.2 a free theory of interacting spinless fermions was developed in one dimension making 
use of the bosonization formalism. This analysis showed that the low-energy degrees of freedom were 
described by hydrodynamic charge (i.e. density) fluctuations that propagated with a linear dispersion. 
However, as well as charge, the electron degrees of freedom carry spin. The aim of this problem is to 
explore the fate of the spin degrees of freedom in a one-dimensional environment. 

As a first step, we introduce operators (cf. Eq. (2.35)) p sqa = Y^k a l(k+q)a aska ’ a = t>4- 
generalizing the previously introduced density operators for the presence of spin. Similarly, 
the bosonic degrees of freedom of the theory (cf. Eq. (2.40)) now carry a spin index, b q — > b qa . 
One aspect that makes the problem more difficult to tackle than the previously explored 
spinless case is that the 2fcF-momentum transfer interaction \kp + q + &f + q — ■ i 

) — > | — fcp + q + <? 2 ) ti — + Q ~ f? 2 ) 4-) i n which a right-moving spin up electron is scattered 

to a left-moving spin up electron cannot be expressed in terms of slowly fluctuating density 
operators. (If you don’t believe this, try!) However, using the renormalisation group methods 
introduced in Chapter 8, it can be shown that this type of interaction is physically largely 
irrelevant and can be neglected from the outset. 

Concentrating on the low-momentum-transfer interaction, the effective bosonic Hamilto- 
nian assumes the form (exercise) 

H= V FQbl qa b sqa + M [fr { b U<* b tqa' + h - C -) + f^tgcAga'] • 

q>0,s,ot q>0,s,a,ot' 

Introducing operators that create charge (p) and spin (cr) fluctuations, b sqp = ^=( b sq ^ + 
b sq ±),b sqa = ^(b s tf — b sq j,), rearrange the Hamiltonian, and thereby show that it assumes 
a diagonal form with the spin and charge degrees of freedom exhibiting different velocities. 
This is a manifestation of spin— charge separation: even without the introduction of spin- 
dependent forces, the spin and charge degrees of freedom of the electron in the metallic 
conductor separate and propagate at different velocities. In this sense, there is no way 
to adiabatically continue from non-interacting electrons to the collective charge and spin 
excitations of the system. 

Despite the “fragility” of the electron, and the apparent ubiquity of this phenomenon, 
the observation of spin— charge separation in one-dimensional conductors has pre- 
sented a significant challenge to experimentalists. The reason is subtle. The completion 
of an electrical circuit necessarily requires contact of the quantum wire with bulk leads. 
The leads involve a reservoir of electrons with conventional Fermi-liquid character. Electri- 
cal transport requires the recombination of the collective charge (holon) and spin (spinon) 
degrees of freedom at the contact to reconstitute physical electrons. It is an exasperating 
fact that this reconstitution of the physical electron “masks” the character of spin-charge 
separation. Instead, the phenomenon of spin-charge separation has been inferred indirectly 
through spectroscopic techniques. 

Answer: 

Motivated by the separation into spin and charge degrees of freedom, a rearrangement of 
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the Hamiltonian gives 

H= VF( 1 ( b Up b sqP + b lq<T b sq<T ) + |<?| [ b tqp b tqp + h - C ‘) + ^ b lqp b sqp] ■ 

q>0,s 

Once again, applying a Bogoliubov transformation, the Hamiltonian is brought to the diag- 
onal form 

H= ^2 + Wtt) 2 - {g- 2 /n) 2 oi\ qp a sqp + \q\v?a\ q<T a sqc r j + const.. 

ej>0,s 


The Kondo problem 

Historically, the Kondo problem has assumed a place of great significance in the development of the field 
of strongly correlated quantum systems. It represents perhaps the simplest example of a phenomenon 
driven by strong electron interaction and, unusually for this arena of physics, admits a detailed theoretical 
understanding. Further, in respect of the principles established in Chapter 1, it exemplifies a number of 
important ideas from the concept of reducibility - the collective properties of the system may be captured 
by a simplified effective Hamiltonian which includes only the relevant low-energy degrees of freedom - 
and the renormalization group. In the following problem, we will seek to develop the low-energy theory 
of the “Kondo impurity system” leaving the discussion of its phenomenology to Problems 5.5 and 8.8.5 
in subsequent chapters. 

The Kondo effect is rooted in the experimental observation that, when small amounts of 
magnetic ion impurities are embedded in a metallic host (such as manganese in copper, or 
iron in CuAu alloys), a pronounced minimum develops in the temperature dependence of 
the resistivity. Although the phenomenon was discovered experimentally in 1934, 27 it was 
not until 1964 that a firm understanding of the phenomenon was developed by Kondo. 28 
Historically, the first step towards the resolution of this phenomenon came with a suggestion 
by Anderson that the system could be modeled as an itinerant band of electron states inter- 
acting with local dilute magnetic moments associated with the ion impurities. 29 Anderson 
proposed that the integrity of the local moment was protected by a large local Coulomb 
repulsion which inhibited multiple occupancy of the orbital state - a relative of the Hubbard 
[/-interaction. Such behavior is encoded in the Anderson impurity Hamiltonian 

H = ^2 [ekcJ^Ckff + (Vk<4 c ka + h - c -)] + edTldi 7 + Un dl n dl, 

kcr <J 

where the operators cj^ create itinerant electrons of spin o and in the metallic host 
while the operators d\ create electrons of spin o on the local impurity at position r = 0. 
Here we have used n,i a = to denote the number operator. While electrons in the band 


de Haas, de Boer, and van den Berg, The electrical resistance of gold, copper, and lead at low temperatures, 
Physica 1 (1934), 1115. 

J. Kondo, Resistance minimum in dilute magnetic alloys, Prog. Theor. Phys. 32 (1964), 37-69. 

P. W. Anderson, Localized magnetic states in metals, Phys. Rev. 124 (1961), 41-53. 
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are assumed to be characterized by a Fermi-liquid-like behavior, those associated with the 
impurity state experience an on-site Coulomb interaction of a strength characterized by a 
Hubbard energy U. 

According to the experimental phenomenology, the Fermi level cf is assumed to lie some- 
where in between the single-particle impurity level and + U so that, on average, the 
site occupancy of the impurity is unity. Nevertheless, the matrix element coupling the local 
moment to the itinerant electron states Vk = L ~ d ^ 2 f dr V (r)e' k ' r admits the existence of 
virtual processes in which the site occupancy can fluctuate between zero and two. These 
virtual fluctuations allow the spin on the impurity site to flip through exchange. 

The discussion of the half-filled Hubbard system in Section 2.2 suggests that it will 
be helpful to transform the Anderson impurity Hamiltonian to an effective theory which 
exposes the low-energy content of the system. To this end, let us express the total wave- 
function of the many-body Hamiltonian | ip) as the sum of terms \ipo), | V , i)> and 1 ip 2 ), where 
the subscript denotes the occupancy of the impurity site. With this decomposition, the 
Schrodinger equation for the Hamiltonian can be cast in matrix form, y^_ n H mn \ili n ) = 
E\tp rn ), where H mn = P m HP n , and the operators P m project onto the subspace with m 
electrons on the impurity (i.e. Pq = flcr^ — nda), etc.). 


(a) Construct the operators H mn explicitly and explain why P 20 = H 02 = 0. 

(b) Since we are interested in the effect of virtual excitations from the |^i) subspace, we may 
proceed by formally eliminating \ipo) and | tp 2 ) from the Schrodinger equation. Doing so, 
show that the equation for |^>i) can be written as 


Hu 


1 


E-H 0 < 


H 01 + Hy 


-Hu 


1 


#21 |V>i)=P|^i). 


e-h 22 

(c) At this stage, the equation for IVq) is exact. Show that, when substituted into this 
expression, an expansion to leading order in 1/Z7 and l/e<j leads to the expression 


#12- 


1 


E-H 2 


-H 2 \ + H w 


1 


- E Wi (: 

kkVo-' V 


E- H { 

U + Crf — Ck' 


#01 - 

Ck'a'ct a dt,d a 


+ - 


Ck - £d 


To obtain the first term in the expression, consider the commutation of (E — H 22 ) -1 
with H 21 and make use of the fact that the total operator acts upon the singly occupied 
subspace. A similar line of reasoning will lead to the second term in the expression. Here 
U + ed — £k' and e,i — ek denote the respective excitation energies of the virtual states. 

Making use of the Pauli matrix identity, <T a p ■ cr^g = 26 a g6p^ — S af 36 ^ 5 , it follows that 
(exercise) 

^ ^ Cko-CkV'd^/dcr = 2§kk' ' ^ ^ Ckn-CkyUdcr' ) 


where S d = Ylap da <T apd 0 / 2 denotes the impurity spin 1/2 degree of freedom associated 
with the impurity and s k k' = c k a <T «/3 c k'/3/2. Combining this result with that 
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obtained above, up to an irrelevant constant, the total effective Hamiltonian (including 
Hu) acting in the projected subspace, \ipi) is given by 


where 


tisd — y ' ckc^cka ■+■ y ' 


2</k,k' Skk' ‘ b,/ + l\\ k ' y ( C^Clc'c 


Jk,k' 

#k,k' 


vzv* [ 
2 


Y — - — + — ■— 1 , 

u + Cd — Ck' — €d J 

P I 1 

L e k — £d U + Cd — £k' J 


With both U + ed and e d greatly in excess of the typical excitation energy scales, one 
may safely neglect the particular energy dependence of the parameters and . 
In this case, the exchange interaction J^y can be treated as local, the scattering term 
Kky can be absorbed into a shift of the single-particle energy of the itinerant band, and 
the positive (i.e. antiferromagnetic) exchange coupling can be accommodated through 
the effective sd-Hamiltonian 


#sd = Y e k ct a c krT + 2JS d • s(r = 0), 
k<r 


(2.51) 


where s(r = 0) = y^kk'aa' c ka a ^^' Ck '^' / 2 denotes the spin density of the itinerant 
electron band at the impurity site. To understand how the magnetic impurity affects 
the low-temperature transport, we refer to Problem 5.5, where the sd-Hamiltonian is 
explored in the framework of a diagrammatic perturbation theory in the spin interaction. 


Answer: 


(a) Since the diagonal elements H mm leave the occupation number fixed, they may be 
identified with the diagonal elements of the microscopic Hamiltonian, i.e. 

#00 = Y Ckcj^Cka, tin = Y e k c L C ko- + e <h #22 = Y €kC kcT Ck « + 2e d + # 

k k k 

The off-diagonal terms arise from the hybridization between the free electron states and 
the impurity. Since the coupling involves only the transfer of single electrons, ti 02 = 
ti 20 = 0 and 

#10 = y ^ l'k ( I flddr)Ck<n #21 = ^ ( ^k^]j^dd f 'ka- 

k <j k<j 

where a =t for a =4- an( i vice versa, tia = ti\ 0 and ti V i = #|i • 

(b) Using the fact that #oo|Vh) +#oi|Vh) = E\ip 0 ), one may set IV’o) = (#-#oo) -1 #oi|Vh) 
and, similarly, IV' 2 ) = (E — # 22 ) _1 # 2 i|Vh)- Then, substituting into the equation for 
IV’i), one obtains the required expression. 
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(c) Making use of the expressions from part (a), we have 

Hn 1= 53 4<T n dg d <r j-! 1 #l ^ 1 |yCkV, 

E — £±22 kk '(7(7' ^ ~ 

/>10 £~ $01 = 53 ^k^k' ^o-(l — n da)Ckcr— ^ c k'o-'(l — n dd')d a ' ■ 

t, — H 00 kkVo-' — -“00 



Finally, noting that this operator acts upon the singly-occupied subspace spanned by 
|^i), we see that the factors involving n,j a are redundant and can be dropped. As a 
result, swapping the momentum and spin indices in the second part of the expression, 
we obtain the required expression. 
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Feynman path integral 


The aim of this chapter is to introduce the concept of the Feynman path integral. As well as develop- 
ing the general construction scheme, particular emphasis is placed on establishing the interconnections 
between the quantum mechanical path integral, classical Hamiltonian mechanics, and classical statistical 
mechanics. The practice of path integration is discussed in the context of several pedagogical applica- 
tions. As well as the canonical examples of a quantum particle in a single and a double potential well, 
we discuss the generalization of the path integral scheme to tunneling of extended objects (quantum 
fields), dissipative and thermally assisted quantum tunneling, and the quantum mechanical spin. 


In this chapter we temporarily leave the arena of many-body physics and second quantiza- 
tion and, at least superficially, return to single-particle quantum mechanics. By establishing 
the path integral approach for ordinary quantum mechanics, we will set the stage for the 
introduction of field integral methods for many-body theories explored in the next chap- 
ter. We will see that the path integral not only represents a gateway to higher-dimensional 
functional integral methods but, when viewed from an appropriate perspective, already 
represents a field theoretical approach in its own right. Exploiting this connection, various 
concepts of field theory, namely stationary phase analysis, the Euclidean formulation of field 
theory, instanton techniques, and the role of topology in field theory, are introduced in this 
chapter. 


3.1 The path integral: general formalism 

Broadly speaking, there are two approaches to the formulation of quantum mechanics: the 
“operator approach” based on the canonical quantization of physical observables and the 
associated operator algebra, and the Feynman path integral . 1 Whereas canonical quantiza- 
tion is usually taught first in elementary courses on quantum mechanics, path integrals seem 
to have acquired the reputation of being a sophisticated concept that is better reserved for 


For a more extensive introduction to the Feynman path integral, one can refer to one of the many standard 
texts including R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill, 1965), 
J. W. Negele and H. Orland, Quantum Many Particle Systems (Addison- Wesley, 1988), and L. S. Schulman, 
Techniques and Applications of Path Integration (Wiley, 1981). Alternatively, one may turn to the original 
paper, R. P. Feynman, Space-time approach to non-relativistic quantum mechanics, Rev. Mod. Phys. 20 (1948), 
362-87. Historically, Feynman’s development of the path integral was motivated by earlier work by Dirac on 
the connection between classical and quantum mechanics, P. A. M. Dirac, On the analogy between classical and 
quantum mechanics, Rev. Mod. Phys. 17 (1945), 195-9. 
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advanced courses. Yet this treat- 
ment is hardly justified! In fact, 
the path integral formulation has 
many advantages, most of which 
explicitly support an intuitive under- 
standing of quantum mechanics. 

Moreover, integrals - even the 
infinite-dimensional ones encoun- 
tered below - are hardly more 
abstract than infinite-dimensional 
linear operators. Further merits of 
the path integral include the following: 

> Whereas the classical limit is not always easy to retrieve within the canonical formulation 
of quantum mechanics, it constantly remains visible in the path integral approach. The 
latter makes explicit use of classical mechanics as a “platform” on which to construct a 
theory of quantum fluctuations. The classical solutions of Hamilton’s equation of motion 
always remain a central ingredient of the formalism. 2 

> Path integrals allow for an efficient formulation of non-perturbative approaches to the 
solution of quantum mechanical problems. Examples include the “instanton” formulation 
of quantum tunneling discussed below. The extension of such methods to continuum 
theories has led to some of the most powerful concepts of quantum field theory. 

> The Feynman path integral represents a prototype of the higher-dimensional field integrals 
to be introduced in the next chapter. However, even the basic “zero-dimensional” path 
integral is of relevance to applications in many-body physics. Very often, one encounters 
environments such as the superconductor, superfluid, or strongly correlated few-electron 
devices where a macroscopically large number of degrees of freedom “lock” to form a 
single collective variable. (For example, to a first approximation, the phase information 
carried by the order parameter field in moderately large superconducting grains can often 
be described in terms of a single phase degree of freedom, i.e. a “quantum particle” living 
on the unit circle.) Path integral techniques have proven ideally suited to the analysis of 
such systems. 

What then is the basic idea of the path integral approach? More than any other formula- 
tion of quantum mechanics, the path integral formalism is based on connections to classical 
mechanics. The variational approach employed in Chapter 1 relied on the fact that classi- 
cally allowed trajectories in configuration space extremize an action functional. A principal 
constraint to be imposed on any such trajectory is energy conservation. By contrast, quan- 
tum particles have a little bit more freedom than their classical counterparts. In particular, 
by the Uncertainty Principle, energy conservation can be violated by an amount A E over a 
time ~ h/AE (here, and throughout this chapter, we will use h for clarity). The connection 


Richard P. Feynman 1918-88 

Nobel Laureate in Physics in 1965 
(with Sin-ltiro Tomonaga, and 
Julian Schwinger) for “fundamen- 
tal work in quantum electrody- 
namics, with far-reaching conse- 
quences for the physics of elemen- 
tary particles." He was also well 
known for his unusual life style and for his popu- 
lar books and lectures on mathematics and physics. 
(Image © The Nobel Foundation.) 



2 For this reason, path integration has turned out to be an indispensable tool in fields such as quantum chaos 
where the quantum manifestations of classically non-trivial behavior are investigated - for more details, see 
Section 3.3. 
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to action principles of classical mechanics becomes particularly apparent in problems of 
quantum tunneling: a particle of energy E may tunnel through a potential barrier of height 
V > E. However, this process is penalized by a damping factor ~ exp(< ./] )arrier dxp/h), 
where p = \J e lm(E — V ), i.e. the exponent of the (imaginary) action associated with the 
classically forbidden path. 

These observations motivate the idea of a new formulation of quantum propagation: could 
it be that, as in classical mechanics, the quantum amplitude A for propagation between 
any two points in coordinate space is again controlled by the action functional - controlled 
in a relaxed sense where not just a single extremal path x c i(i), but an entire manifold 
of neighboring paths contribute? More specifically, one might speculate that the quantum 
amplitude is obtained as A ~ Ylx(t) ex p(*S l [®]/fi)> where Y2 x (t) symbolically stands for a 
summation over all paths compatible with the initial conditions of the problem, and S 
denotes the classical action. Although, at this stage, no formal justification for the path 
integral has been presented, with this ansatz some features of quantum mechanics would 
obviously be borne out correctly. Specifically, in the classical limit (H — »• 0), the quantum 
mechanical amplitude would become increasingly dominated by the contribution to the 
sum from the classical path x c \ (t). This is because non-extremal configurations would be 
weighted by a rapidly oscillating amplitude associated with the large phase S/K and would, 
therefore, average to zero. 3 Secondly, quantum mechanical tunneling would be a natural 
element of the theory; non-classical paths do contribute to the net amplitude, but at the cost 
of a damping factor specified by the imaginary action (as in the traditional formulation). 

Fortunately, no fundamentally novel “picture” of quantum mechanics needs to be declared 
to promote the idea of the path “integral” X^(t) exp (iS[x]/h) to a working theory. As we 
will see in the next section, the new formulation can be developed from the established 
principles of canonical quantization. 


3.2 Construction of the path integral 

All information about an autonomous 4 quantum system is contained in its time evolution 
operator. A formal integration of the time-dependent Schrodinger equation ihd t \'l l ) = /T| d/) 
gives the time evolution operator 

|>F(f')) = U(if,t)\*(t)), =e-i^ t '- t) Q(t' -t). (3.1) 

The operator U(t',t ) describes dynamical evolution under the influence of the Hamiltonian 
from a time t to time t' . Causality implies that t! > t. as indicated by the step or Heaviside 
©-function. In the real space representation we can write 

= (g'l’F(f')) = = j dq U(q',t';q,t)^{q,t), 


3 More precisely, in the limit of small K, the path sum can be evaluated by saddle-point methods, as detailed 
below. 

A system is classified as autonomous if its Hamiltonian does not explicitly depend on time. Actually the 
construction of the path integral can be straightforwardly extended so as to include time-dependent problems. 
However, in order to keep the introductory discussion as simple as possible, here we assume time independence. 
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where U(q',t';q,t ) = (g'|c ^|c/) (-)(/' — t) defines the (q', g)-component of the time 

evolution operator. As the matrix element expresses the probability amplitude for a par- 
ticle to propagate between points q and q' in a time t' — t, it is sometimes known as the 
propagator of the theory. 

The basic idea behind Feynman’s path integral approach is easy to formulate. Rather 
than attacking the Schrodinger equation governing the time evolution for general times t, 
one may first attempt to solve the much simpler problem of describing the time evolution 
for infinitesimally small times At. In order to formulate this idea one must first divide the 
time evolution into N » 1 time steps, 

e -iHt/n = ( 3 . 2 ) 

where At = t/N. Albeit nothing more than a formal rewriting of Eq. (3.1), the representa- 
tion (3.2) has the advantage that the factors e -> HAt / fl ( 0 r, rather, their expectation values) 
are small. (More precisely, if At is much smaller than the [reciprocal of the] eigenvalues of 
the Hamiltonian in the regime of physical interest, the exponents are small in comparison 
with unity and, as such, can be treated perturbatively.) A first simplification arising from 
this fact is that the exponentials can be factorized into two pieces, each of which can be 
readily diagonalized. To achieve this factorization, we make use of the identity 

e ~iHAt/n _ e -iTAt/h e -iV At/h 0(At 2 ) 

where the Hamiltonian H = T + V is the sum of a kinetic energy T = p 2 /2m, and some 
potential energy operator V. 5 (The following analysis, restricted for simplicity to a one- 
dimensional Hamiltonian, is easily generalized to arbitrary spatial dimension.) The advan- 
tage of this factorization is that the eigenstates of each factor e -* TA */ fi and e - lVAt / h are 
known independently. To exploit this fact we consider the time evolution operator factorized 
as a product, 

<®| N | ?f ) ~ ( q{ | A e -iTAt/n e -iVAt/h A ... A ( 3 . 3 ) 

and insert at each of the positions indicated by the symbol “A” the resolution of identity 

id = J dq„ J dp n \q„)(q„\p n )(Pn\- (3.4) 

Here | q n ) and \p n ) represent a complete set of position and momentum eigenstates respec- 
tively, and n = 1 , ,N serves as an index keeping track of the time steps at which the 
unit operator is inserted. The rationale behind the particular choice (3.4) is clear. The 
unit operator is arranged in such a way that both T and V act on the corresponding 


5 Although this ansatz covers a wide class of quantum problems, many applications (e.g. Hamiltonians involving 
spin or magnetic fields) do not fit into this framework. For a detailed exposition covering its realm of applicability, 
we refer to the specialist literature such as, e.g., Schulman 1 . 
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Figure 3.1 (a) Visualization of a set of phase space points contributing to the discrete time con- 
figuration integral (3.5). (b) In the continuum limit, the set of points becomes a smooth curve. 


eigenstates. Inserting Eq. (3.4) into (3.3), and making use of the identity ( q\p ) = {p\q)* = 
e iqp/h /(2i rh) 1 ' 2 , one obtains 


(qt |e im/h \qi) - 


II d( ln 


n N dpn i-t f (v{q n )+T(p n . 

27 rti 


(3.5) 


SjV=9f ,QQ=Qi 


Thus, the matrix element of the time evolution operator has been expressed as a (2 N — 1)- 
dimensional integral over eigenvalues. Up to corrections of higher order in VAt/h and 
TAt/H, the expression (3.5) is exact. At each “time step” t n = nAt, n = 1, . . . , N, we are 
integrating over a pair of coordinates x n = (q n ,Pn) parameterizing the classical phase 
space. Taken together, the points {x n } form an Appoint discretization of a path in this 
space (see Fig. 3.1). 

To make further progress, we need to develop some intuition for the behavior of the 
integral (3.5). We first notice that rapid fluctuations of the integration arguments x n as 
a function of the index n are strongly inhibited by the structure of the integrand. When 
taken together, contributions for which {q n + 1 — q n )p n + 1 > 0(h) (i.e. when the phase of the 
exponential exceeds 27r) tend to lead to a “random phase cancellation.” In the language of 
wave mechanics, the superposition of partial waves of erratically different phases destruc- 
tively interferes. The smooth variation of the paths that contribute significantly motivates 
the application of a continuum limit analogous to that employed in Chapter 1. 

To be specific, sending N — >• oo whilst keeping t = NAt fixed, the formerly discrete set 

t n = nAt, n= 1 N, becomes dense on the time interval [0, t], and the set of phase space 

points {x n } becomes a continuous curve x(t). In the same limit, 

At ^ i-)- f dt', <ln+ \. ^ n d t >q \t’=t„ = q\t’=t n , 

n=o m 

while \V(q n ) + T(p n+1 )\ H [T{p\t’=t n ) + V(q\ t >=t n )\ = H(x\ t '= tn ) denotes the classical 
Hamiltonian. In the limit N — > oo, the fact that kinetic and potential energies are evaluated 
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at neighboring time slices, n and n + 1, 


lim 

N—too 


! 


n 


«JV=9f >«0 =< 5 


becomes irrelevant. 6 Finally, 
dp n 


ddn n 


27 rti 


Dx, 


defines the integration measure of the integral. 


INFO Integrals extending over infinite-dimensional integration measures like D(q,p) are gener- 
ally called functional integrals (recall our discussion of functionals in Chapter 1). The question 
of the way functional integration can be rigorously defined is far from innocent and represents 
a subject of current, and partly controversial, mathematical research. In this book - as in most 
applications in physics - we take a pragmatic point of view and deal with the infinite-dimensional 
integration naively unless mathematical problems arise (which actually will not be the case!). 


Then, applying these conventions to Eq. (3.5), one finally obtains 


(5 f |e im/h \qi) = / i 

/ q(t)=q f 
«( 0)=9i 


inJo 


(3.6) 


Equation (3.6) represents the Hamiltonian formulation of the path integral. The 
integration extends over all possible paths through the classical phase space of the system 
which begin and end at the same configuration points q\ and qt respectively (cf. Fig. 3.1). 
The contribution of each path is weighted by its Hamiltonian action. 


I N FO Remembering the connection of the Hamiltonian to the Lagrangian through the Legendre 
transform, H(p,q) = pq — L(p,q), the classical action of a trajectory t q(l) is given by 

%> q] = fo dt ' Hi, q) = fo dt ' \pq ~ H (P, «)]■ 

Before we turn to the discussion of the path integral (3.6), it is useful to recast the integral in 
an alternative form which will be both convenient in applications and physically instructive. 
The search for an alternative formulation is motivated by the resemblance of Eq. (3.6) to 
the Hamiltonian formulation of classical mechanics. Given that, classically, Hamiltonian 
and Lagrangian mechanics can be equally employed to describe dynamical evolution, it is 
natural to seek a Lagrangian analog of Eq. (3.6). Focusing on Hamiltonians for which the 
kinetic energy T(p ) is quadratic in p, the Lagrangian form of the path integral can indeed 
be inferred from Eq. (3.6) by straightforward Gaussian integration. 


6 To see this formally, one may Taylor expand T(p n + 1 ) = T{p{t > + At))| t /_ n t around p(t'). For smooth p(t '), all 
but the zeroth-order contribution T(p(t )) scale with powers of At, thereby becoming irrelevant. Note, however, 
that all of these arguments are based on the assertion that the dominant contributions to the path int egra l are 
smooth in the sense g n _|_i— q n ~ O(At). A closer inspection, however, shows that in fact q n +i — Qn ~ 0(y/At) (see 
Schulman 1 .) In some cases, the most prominent one being the quantum mechanics of a particle in a magnetic 
field, the lowered power of At spoils the naive form of the continuity argument above, and more care must be 
applied in taking the continuum limit. In cases where a “new” path integral description of a quantum mechanical 
problem is developed, it is imperative to delay taking the continuum limit until the fluctuation behavior of the 
discrete integral across individual time slices has been thoroughly examined. 
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To make this point clear, let us rewrite the integral in a way that emphasizes its depen- 
dence on the momentum variable p: 

(q f \e~ iftt/h \ qi )= f f Dp e ~* (3.7) 


The exponent is quadratic in p which means that we are dealing with the continuum gener- 
alization of a Gaussian integral. Carrying out the integration by means of Eq. (3.13) below, 
one obtains 


<® \e~ i6t/h \ qi ) = 


Dq exp — / dt' L(q,q) 


(3.8) 


where Dq = lim/v-xx) ( ^™ ft ) JV ^ 2 dq n denotes the functional measure of the remaining 

^-integration, and L(q,q) = mq 2 / 2 — V(q) represents the classical Lagrangian. Strictly 
speaking, the (finite-dimensional) integral formula (3.13) is not directly applicable to the 
infinite-dimensional Gaussian integral (3.7). This, however, does not represent a substantial 
problem as we can always rediscretize the integral (3.7), apply Eq. (3.13), and reinstate the 
continuum limit after integration (exercise). 

Together Eq. (3.6) and (3.8) represent the central results of this section. A quantum 
mechanical transition amplitude has been expressed in terms of an infinite-dimensional 
integral extending over paths through phase space, Eq. (3.6), or coordinate space, Eq. (3.8). 

All paths begin (end) at the initial 
(final) coordinate of the matrix ele- 
ment. Each path is weighted by its 
classical action. Notice in particular 
that the quantum transition ampli- 
tude has been cast in a form that 
does not contain quantum mechani- 
cal operators. Nonetheless, quantum 
mechanics is still fully present! The 
point is that the integration extends 
over all paths and not just the sub- 
set of solutions of the classical equa- 
tions of motion. (The distinguished 
role classical paths play in the path 
integral will be discussed below in Section 3.2.) The two forms of the path integral, Eq. (3.6) 
and Eq. (3.8), represent the formal implementation of the “alternative picture” of quantum 
mechanics proposed heuristically at the beginning of the chapter. 


Johann Carl Friedrich Gauss 
1777-1855 

Worked in a wide variety of 
fields in both mathematics and 
physics including number the- 
ory, analysis, differential geome- 
try, geodesy, magnetism, astron- 
omy, and optics. As well as sev- 
eral books, Gauss published a number of memoirs 
(reports of his experiences), mainly in the journal 
of the Royal Society of Gottingen. However, in gen- 
eral, he was unwilling to publish anything that could 
be regarded as controversial and, as a result, some 
of his most brilliant work was found only after his 
death. 



INFO Gaussian integration: Apart from a few rare exceptions, all integrals encountered in 
this book will be of Gaussian form. In most cases the dimension of the integrals will be large if 
not infinite. Yet, after a bit of practice, it will become clear that high-dimensional representatives 
of Gaussian integrals are no more difficult to handle than their one-dimensional counterparts. 
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Therefore, considering the important role played by Gaussian integration in field theory, we will 
here derive the principal formulae once and for all. Our starting point is the one-dimensional 
integral (both real and complex). The ideas underlying the proofs of the one-dimensional formulae 
will provide the key to the derivation of more complex functional identities that will be used 
liberally throughout the remainder of the text. 


One-dimensional Gaussian integral: The basic ancestor of all Gaussian 
identity 


dx e 2X = — , Re a > 0. 

a 



itegrals is the 


(3.9) 


In the following we will need various generalizations of Eq. (3.9). Firstly, we have f^° dx e ax ^ 2 x 2 
y/2n/a 3 , a result established either by substituting a — > a + e in Eq. (3.9) and expanding both 
the left and the right side of the equation to leading order in e, or by differentiating Eq. (3.9). 
Often one encounters integrals where the exponent is not purely quadratic from the outset but 
rather contains both quadratic and linear pieces. The generalization of Eq. (3.9) to this case 
reads 


(3.10) 


To prove this identity, one simply eliminates the linear term by means of the change of variables 
x — > x + h/a which transforms the exponent —ax ' 2 / 2 + bx — > —ax 2 / 2 + b 2 /2a. The constant 
factor scales out and we are left with Eq. (3.9). Note that Eq. (3.10) holds even for complex 
b. The reason is that as a result of shifting the integration contour into the complex plane no 
singularities are encountered, i.e. the integral remains invariant. 

Later, we will be concerned with the generalization of the Gaussian integral to complex argu- 
ments. The extension of Eq. (3.9) to this case reads 

/ d(z, z) e~ zwz = — , Re w > 0, 

J w 

where z represents the complex conjugate of z. Here, f d(z , zj S dx dy represents the 
independent integration over the real and imaginary parts oi z = x-\-iy. The identity is easy to 
prove: owing to the fact that zz = x 2 + y 2 , the integral factorizes into two pieces, each of which is 
equivalent to Eq. (3.9) with a = w. Similarly, it may be checked that the complex generalization 
of Eq. (3.10) is given by 


d(z, z) e 


-zwz+uz+zv 


—e w , Re w > 0. 
w 


(3.11) 


More importantly u and v may be independent complex numbers; they need not be related to 
each other by complex conjugation (exercise). 

Gaussian integration in more than one dimension: All of the integrals above have higher- 
dimensional counterparts. Although the real and complex versions of the A-dimensional integral 
formulae can be derived in a perfectly analogous manner, it is better to discuss them separately 
in order not to confuse the notation. 


(a) Real case: The multi-dimensional generalization of the prototype integral (3.9) reads 

J dv e“5 vTA ^«(2 7 r) JV/2 det A~ 1/2 , 


(3.12) 
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where A is a positive definite real symmetric IV-dimensional matrix and v is an IV-component 
real vector. The proof makes use of the fact that A (by virtue of being symmetric) can be 
diagonalized by orthogonal transformation, A = O t DO, where the matrix O is orthogonal, 
and all elements of the diagonal matrix D are positive. The matrix O can be absorbed 
into the integration vector by means of the variable transformation, v h* Ov, which has 
unit Jacobian, det O = 1. As a result, we are left with a Gaussian integral with exponent 
— v t Dv/2. Due to the diagonality of D, the integral factorizes into N independent Gaussian 
integrals, each of which contributes a factor yj 2ir/di , where di , i = 1, . . . , N, is the ith entry 
of the matrix D. Noting that ™ =1 di = detD = det A, Eq. (3.12) is derived. 

The multi-dimensional generalization of Eq. (3.10) reads 


J dv e -i vTAv+jT ' v = (2jr) w/2 det A“ 1/2 e5 jTA lj , 


(3.13) 


where j is an arbitrary IV-component vector. Equation (3.13) is proven by analogy with 
Eq. (3.10), i.e. by shifting the integration vector according to v — > v + A _ 1 j, which does not 
change the value of the integral but removes the linear term from the exponent, — |v T Av + 
j T • v — ¥ — 2 v 7 Av + |j T A _ 1 j. The resulting integral is of the type (3.12), and we arrive at 
Eq. (3.13). 

The integral (3.13) not only is of importance in its own right, but also serves as a “generator” 
of other useful integral identities. Applying the differentiation operation d 2 m j n |j=o to the 
left- and the right-hand side of Eq. (3.13), one obtains the identity 7 f dv e~ 2 v Av Vm.v n = 
(27r) Jv ^ 2 det A _1/,2 A“^. This result can be more compactly formulated as 

( v m v„ ) = Af^ n , (3-14) 


where we have introduced the shorthand notation 

(■■■}~m,(2'K)- N/2 detA 1/2 J d-v e~^ vTAv {■■■), (3.15) 


suggesting an interpretation of the Gaussian weight as a probability distribution. 

Indeed, the differentiation operation leading to Eq. (3.14) can be iterated. Differentiating four 
times, one obtains (v m v„v q v p ) = Af^ n A qp + A~ p + A~ p A~ q . One way of memorizing the 
structure of this - important - identity is that the Gaussian “expectation” value ( v m v n v p v q ) 
is given by all “pairings” of type (3.14) that can be formed from the four components » m . This 
rule generalizes to expectation values of arbitrary order: 2n-fold differentiation of Eq. (3.13) 
yields 


§r E 


(3.16) 


This result is the mathematical identity underlying Wick s theorem (for real bosonic fields), 
to be discussed in more physical terms below. 


7 Note that the notation A rr ^ n refers to the ran-element of the matrix A' 
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(b) Complex case: The results above are straightforwardly extended to multi-dimensional com- 
plex Gaussian integrals. The complex version of Eq. (3.12) is given by 

J d(v t ,v) e _vtAv = 7r JV detA“ 1 , (3.17) 

where v is a complex IV-component vector, d(v^,v) = iLid Re v t d Im v z , and A is a 
complex matrix with positive definite Hermitian part. (Remember that every matrix can be 
decomposed into a Hermitian and an anti-Hermitian component, A = |(A + A*) + |(A — 
A 1 ).) For Hermitian A, the proof of Eq. (3.17) is analogous to that of Eq. (3.12), i.e. A is 
unitarily diagonalizable, A = U t AU, the matrices U can be transformed into v, the resulting 
integral factorizes, etc. For non-Hermitian A the proof is more elaborate, if unedifying, and 
we refer to the literature for details. The generalization of Eq. (3.17) to exponents with linear 
contributions reads 


' d(v f , v) e" 


= 7r det A e” 


(3.18) 


Note that w and w' may be independent complex vectors. The proof of this identity mirrors 
that of Eq. (3.13), i.e. by effecting the shift + w^, v —> v + w'. 8 As with Eq. (3.13), 

Eq. (3.18) may also serve as a generator of integral identities. Differentiating the integral 
twice according to d^, | w =w'=o gives 

( VmVn ) = Ann, 

where (■■■) = 7 t~ n det A f d(v^, v) e _vtAv (- • • ). The iteration to more than two derivatives 
gives ( VnVmVpVq ) = A~^A~^ + A~nA~^ and, eventually, 


A jlipi ■ ■ ■ A iniPn 


where J2p represents for the sum over all permutations of n integers. 

Gaussian functional integration: With this preparation, we are in a position to investigate 
the main practice of quantum and statistical field theory - the method of Gaussian functional 
integration. Turning to Eq. (3.13), let us suppose that the components of the vector v parame- 
terize the weight of a real scalar field on the sites of a one-dimensional lattice. In the continuum 
limit, the set {vi} translates to a function v(x), and the matrix Aij is replaced by an operator 
kernel or propagator A(x,x'). In this limit, the natural generalization of Eq. (3.13) is 

J Dv(x) exp ^ J dx dx v(x)A(x, x)v{x) + J dx j(x)v(x)^ 

oc (det A) _1 ^ 2 exp J dx dx 1 j(a:)A _1 (®, ®')j(a:')| , (3.19) 


For an explanation of why v and may be shifted independently of each other, cf. the analyticity remarks 
made in connection with Eq. (3.11). 





3.2 Construction of the path integral 


105 


where the inverse kernel A 1 ( x , x') satisfies the equation 


J dx A(x,x')A 1 (x, x") = 5(x — x"), 


(3.20) 


i.e. A -1 (a:, x') can be interpreted as the Green function of the operator A(x, x'). The notation 
Dv(x) is used to denote the measure of the functional integral. Although the constant of propor- 
tionality, (2tc) n , left out of Eq. (3.19) is formally divergent in the thermodynamic limit N oo, 
it does not affect averages that are obtained from derivatives of such integrals. For example, for 
Gaussian distributed functions, Eq. (3.14) has the generalization 


(v(x)v(x')) = A 1 {x,x). 


Accordingly, Eq. (3.16) assumes the form 

{v{xi)v(X2) ■ ■ ■ v{X2n)) = ^ A _1 (x kl , X k2 ) • • • A _1 {x k2n _ 1 , X k ^ n )■ 


(3.21) 


The generalization of the other Gaussian averaging formulae discussed above should be obvious. 

To make sense of Eq. (3.19) one must interpret the meaning of the determinant, det A. When 
the variables entering the Gaussian integral were discrete, the integral simply represented the 
determinant of the (real symmetric) matrix. In the present case, one must interpret A as a 
Hermitian operator having an infinite set of eigenvalues. The determinant simply represents the 
product over this infinite set (see, e.g., Section 3.3). 


Before turning to specific applications of the Feynman path integral, let us stay with the 
general structure of the formalism and identify two fundamental connections of the path 
integral to classical point mechanics and classical and quantum statistical mechanics. 


Path integral and statistical mechanics 

The path integral reveals a connection between quantum mechanics and classical (and 
quantum) statistical mechanics whose importance to all areas of field theory and statis- 
tical physics can hardly be exaggerated. To reveal this link, let us for a moment forget 
about quantum mechanics and consider, by way of an example, a perfectly classical, one- 
dimensional continuum model describing a “flexible string.” We assume that our string is 
held under constant tension, and confined to a “gutter-like potential” (as shown in Fig. 3.2). 
For simplicity, we also assume that the mass density of the string is pretty high, so that 
its fluctuations are “asymptotically slow” (the kinetic contribution to its energy is negligi- 
ble). Transverse fluctuations of the string are then penalized by its line tension, and by the 
external potential. 

Assuming that the transverse displacement of the string u(x) is small, the potential energy 
stored in the string separates into two parts. The first arises from the line tension stored in 
the string, and the second comes from the external potential. Starting with the former, a 
transverse fluctuation of a line segment of length dx by an amount du leads to a potential 
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Figure 3.2 A string held under tension and confined to a potential well V. 


energy of magnitude <5Vtension = a[(dx 2 +du 2 ) 1 ! 2 — dx] ~ adx (d x u) 2 / 2, where a denotes the 
tension. Integrating over the length of the string, one obtains ^tension [d x u\ = f ^tension = 
\ Jq dx a(d x u(x)) 2 . The second contribution, arising from the external potential, is given 
by External M = J 0 dx V ( u(x )). Adding the two contributions, we find that the total energy 
of the string is given by V = Vision + V; xternal = f Q L dx [f (d x u) 2 + V(u)]. 

EXERCISE Find an expression for the kinetic energy contribution assuming that the string has 
a mass per unit length of m. How does this model compare to the continuum model of lattice 
vibrations discussed in Chapter 1? Convince yourself that in the limit m —¥ oo, the kinetic 
contribution to the partition function Z = tr [e~' 3W ] is inessential. 


According to the general principles of statistical mechanics, the equilibrium properties of a 
system are encoded in the partition function Z = tr , where “tr” denotes a summation 

over all possible configurations of the system and V is the total potential energy functional. 
Applied to the present case, tr — ► f Du, where f Du stands for the functional integration 
over all configurations of the string u(x), x € [0,T]. Thus, the partition function of the 
string is given by 


= J Du ex p — /? J dx (^(d x u) 2 + V{u) 


(3.22) 


A comparison of this result with Eq. (3.8) shows that the partition function of the classical 
system coincides with the quantum mechanical amplitude 


Z = jdq (q\e- it6 / h \q) 


evaluated at an imaginary “time” t — > —ir = —iL, where H = p 2 /2a + V(q), and Planck’s 
constant is identified with the “temperature,” h = 1//?. (Here we have assumed that our 
string is subject to periodic boundary conditions.) 

To see this explicitly, let us assume that we had reason to consider quantum propagation 
in imaginary time, i.e. e~ ltH ^ h — > e~ rH l n , or t —¥ —ir. Assuming convergence (i.e. positivity 
of the eigenvalues of H), a construction scheme perfectly analogous to the one outlined in 
Section 3.1 would have led to a path integral formula of the structure (3.8). Formally, the 
only differences would be (a) that the Lagrangian would be integrated along the imaginary 
time axis t' —¥ —it' € [0, — ir] and (b) that there would be a change of the sign of the kinetic 
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energy term, i.e. ( d t > q) 2 — > — (d T > q) 1 . After a suitable exchange of variables, r — > L, h — »• 1//3, 
the coincidence of the resulting expression with the partition function (3.22) is clear. 

The connection between quantum mechanics and classical statistical mechanics outlined 
above generalizes to higher dimensions. There are close analogies between quantum field 
theories in d dimensions and classical statistical mechanics in d + 1. (The equality of the 
path integral above with the one-dimensional statistical model is merely the d = 0 version 
of this connection.) In fact, this connection turned out to be one of the major driving forces 
behind the success of path integral techniques in modern field theory /statistical mechanics. 
It offered, for the first time, a possibility to draw connections between systems that had 
seemed unrelated. 

However, the concept of imaginary times not only provides a bridge between quantum 
and classical statistical mechanics, but also plays a role within a purely quantum mechanical 
context. Consider the partition function of a single-particle quantum mechanical system, 


z44k[e~ f>A ] = J dq (q\e ™\q). 


The partition function can be interpreted as a trace over the transition amplitude 
{q\e~ lHt / n \q) evaluated at an imaginary time t = —ill, 3. Thus, real time dynamics and 
quantum statistical mechanics can be treated on the same footing, provided that we allow 
for the appearance of imaginary times. 

Later we will see that the concept of imaginary or even generalized complex times plays an 
important role in all of field theory. There is even some nomenclature regarding imaginary 
times. The transformation t — > —it is described as a Wick rotation (alluding to the 
fact that a multiplication by the imaginary unit can be interpreted as a (-7r/2)-rotation 
in the complex plane). Imaginary time representations of Lagrangian actions are termed 
Euclidean, whereas the real time forms are called Minkowski actions. 

INFO The origin of this terminology can be understood by considering the structure of the 
action of, say, the phonon model (1.4). Forgetting for a moment about the magnitude of the 
coupling constants, we see that the action has the bilinear structure ~ x , where p = 0,1, 
the vector x IJy = and the diagonal matrix g = diag(— 1, 1) is the two-dimensional version 
of a Minkowski metric. (In three spatial dimensions, g would take the form of the standard 
Minkowski metric of special relativity.) On Wick rotation of the time variable, the factor —1 in 
the metric changes sign to +1, and g becomes a positive definite Euclidean metric. The nature 
of this transformation motivates the notation above. 

Once one has grown accustomed to the idea that the interpretation of time as an imaginary 
quantity can be useful, yet more general concepts can be conceived. For example, one 
may contemplate propagation along temporal contours that are neither purely real nor 
purely imaginary but rather are generally complex. Indeed, it has turned out that path 
integrals with curvilinear integration contours in the complex “time plane” find numerous 
applications in statistical and quantum field theory. 
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Semiclassics from the path integral 

In deriving the two path integral representations (3.6) and (3.8) no approximations were 
made. Yet the vast majority of quantum mechanical problems are unsolvable in closed form, 
and it would be hoping for too much to expect that within the path integral approach this 
situation would be any different. In fact no more than the path integrals of problems with a 
quadratic Hamiltonian - corresponding to the quantum mechanical harmonic oscillator and 
generalizations thereof - can be carried out in closed form. Yet what counts more than the 
(rare) availability of exact solutions is the flexibility with which approximation schemes can 
be developed. As to the path integral formulation, it is particularly strong in cases where 
semiclassical limits of quantum theories are explored. Here, by “semiclassical,” we 
mean the limit h—i 0, i.e. the case where the theory is expected to be largely governed by 
classical structures with quantum fluctuations superimposed. 

To see more formally how classical structures enter the path integral approach, let us 
explore Eq. (3.6) and (3.8) in the limit of small h. In this case the path integrals are 
dominated by path configurations with stationary action. (Non-stationary contributions to 
the integral imply massive phase fluctuations that largely average to zero.) Now, since the 
exponents of the two path integrals (3.6) and (3.8) involve the classical action functionals 
in their Hamiltonian and Lagrangian forms respectively, the extremal path configurations 
are simply the solutions of the classical equations of motion, namely, 

Hamiltonian: <55[a:] = 0 =>■ dtx = {H(x),x} = d p H d q x — d q H d p x, ^ 

Lagrangian: £$[§] = 0 =>■ ( dfd q — d q ) L(q, q) = 0. 

These equations are to be solved subject to the boundary conditions g(0) = q\ and q(t') = qi . 
(Note that these boundary conditions do not uniquely specify a solution, i.e. in general there 
are many solutions to the equations (3.23). As an exercise, one may try to invent examples!) 

Now the very fact that the stationary phase configurations are classical does not imply 
that quantum mechanics has disappeared completely. As with saddle-point approximations 
in general, it is not just the saddle-point itself that matters but also the fluctuations around 
it. At least it is necessary to integrate out Gaussian (quadratic) fluctuations around the point 
of stationary phase. In the case of the path integral, fluctuations of the action around the 
stationary phase configurations involve non-classical (in that they do not solve the classical 
equations of motion) trajectories through phase or coordinate space. Before exploring how 
this mechanism works in detail, let us consider the stationary phase analysis of functional 
integrals in general. 

I N FO Stationary phase approximation: Consider a general functional integral f Dx e~ F ^ 
where Dx = limjv->oo n=1 dx„ represents a functional measure resulting from taking the con- 
tinuum limit of some finite-dimensional integration space, and the “action” F [*] may be an 
arbitrary complex functional of x (leading to convergence of the integral). The function resulting 
from taking the limit of infinitely many discretization points, { x„ }, is denoted by a : : t x(t) 

(where t plays the role of the formerly discrete index n). Evaluating the integral above within a 
stationary phase approximation amounts to performing the following steps: 
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Figure 3.3 Quantum fluctuations around a classical path in coordinate space (here we assume a 
set of two-dimensional coordinates). Non-classical paths q fluctuating around a classical solution 
q c \ typically extend a distance 0(h}^ 2 ). All paths begin and end at q\ and qt, respectively. 


1. Firstly, find the “points” of stationary phase, i.e. configurations x qualified by the condition 
of vanishing functional derivative (cf. Section 1.2), 


DF X -Do Ml 


jgjg, 

6x(t) x= - 


Although there may, in principle, be one or many solutions, for clarity we first discuss the 
case in which the stationary phase configuration x is unique. 

2. Secondly, Taylor expand the functional to second order around x, i.e. 


F[x\ = F[x + y\ = F[x] + ^ j dt J dt' y(t')A(t,t')y(t) , (3.24) 

where A(t,t r ) = denotes the second functional derivative. Due to the station- 

arity of x, no first-order contribution can appear. 

3. Thirdly, check that the operator A = {A(t, //)} is positive definite. If it is not, there is a 
problem - the integration over the Gaussian fluctuations y diverges. (In practice, where the 
analysis is rooted in a physical context, such eventualities arise only rarely. In situations 
where problems do occur, the resolution can usually be found in a judicious rotation of the 
integration contour.) For positive definite A, however, the functional integral over y can be 
performed, after which one obtains f Dx e~ F ^ ~ e~ F ^ det(^) -1 / 2 , (cf. the discussion of 
Gaussian integrals above and, in particular, Eq. (3.19)). 

4. Finally, if there are many stationary phase configurations, ah, the individual contributions 
have to be added: 

j a> e-'M^£ e -n..l det (Tj 1,2 C3.26J. 


Equation (3.25) represents the most general form of the stationary phase evaluation of a (real) 
functional integral. 

EXERCISE Applied to the Gamma function, T(z + 1) = J 0 °° dx x z e~ x , with z complex, show 
that the stationary phase approximation is consistent with Stirling’s approximation, i.e. r(s + 
1) = i). 
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Applied to the Lagrangian form of the Feynman path integral, this program can be imple- 
mented directly. In this case, the extremal field configuration q(t ) is identified as the classical 
solution associated with the Lagrangian, i.e. q(t) = q c \(t). Defining r{t) = q{t) — q c \{t) as 
the deviation of a general path, q(t), from a nearby classical path, q c \(t) (see Fig. 3.3), and 
assuming for simplicity that there exists only one classical solution connecting qi with qf in 
a time t, a stationary phase analysis obtains 


<s e iS ^ n 


Dr exp 

(0)=r(t)=0 



dt"r{t!) 


S 2 S[q] 

6q(t') 6q{t") 


r(t") 

Q=qa 


(3.26) 


as the Gaussian approximation to the path integral (cf. Eq. (3.24)). For free Lagrangians 
of the form L(q,q) = mq 2 / 2 — V (q), the second functional derivative of the action can be 
straightforwardly computed by means of the rules of functional differentiation formulated 
in Chapter 1. Alternatively, one can obtain this result by simply expanding the action as a 
Taylor series in the deviation r(t). As a result, one obtains (exercise) 

\ J o dt J dt> r(t) | = ~\ J dt r ( t ) [ md t + v "(Qo\i0\ r{t), (3.27) 

where V”(q c \(t)) = d 2 V(q) | g=gcl represents an ordinary derivative of the potential function. 
Thus, the Gaussian integration over r yields the square root of the determinant of the 
operator md 2 + V" (q c \(t)) - interpreted as an operator acting in the space of functions r(t) 
with boundary conditions r(0) = r{t) = 0. (Note that, as we are dealing with a differential 
operator, the issue of boundary conditions is crucial.) 


INFO More generally, Gaussian integration over fluctuations around the stationary phase con- 
figuration obtains the formal expression 


<9/l e_i ^ </fi |9i> — det 


i a 2 S[g c i] V 2 is[ 9cl ] 

271 h dq\ dqi ) 


(3.28) 


as the final result for the transition amplitude evaluated in the semiclassical approxima- 
tion. (In cases where there is more than one classical solution, the individual contributions have 
to be added.) To derive this expression, one shows that the operator controlling the quadratic 
action (3.27) fulfils some differential relations which can again be related back to the classical 
action. While a detailed formulation of this calculation 9 is beyond the scope of the present text, 
the heuristic interpretation of the result is straightforward, as detailed below. 

According to the rules of quantum mechanics P(qt, qi,t ) = |(gf \e~ tHt ^ h \qi)\ 2 defines the prob- 
ability density function for a particle injected at coordinate q, to arrive at coordinate qf after 
a time t. In the semiclassical approximation, the probability density function assumes the form 
P(qt,qi,t) = | det( 2 ^ ) I • can S a i n some physical insight into this expression from the 

following consideration: for a fixed initial coordinate q\, the final coordinate qt(qi,p\) becomes 
a function of the initial momentum pi. The classical probability density function P(q\, qt) can 


See, e.g., Schulman 1 . 
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then be related to the probability density function P(qi,pi) for a particle to leave from the initial 
phase space coordinate {qi,pi) according to 


P{qi,qt)dqidq { = P(qi,qt(qi,Pi)) det 



dqidpi = P(qi,pi)dqidpi. 


Now, if we say that our particle actually left at the phase space coordinate ( qi,p\),P must be 
singular at (qi,pi) while being zero everywhere else. In quantum mechanics, however, all we 
can say is that our particle was initially confined to a Planck cell centered around {qi,pi) : 
P{qi,pi) = l/(27rh) d . We thus conclude that P(qi,q{) = | det(dpi/dqt)\(2-jzh)~ d . Finally, noticing 
that pi = —d qi S we arrive at the result of the semiclassical analysis above. 

In deriving Eq. (3.28) we have restricted 
ourselves to the consideration of quadratic fluc- 
tuations around the classical paths. Under what 
conditions is this semiclassical approximation 
justified? Unfortunately there is no rigorous and 
generally applicable answer to this question. For 
finite h, the quality of the approximation depends 
largely on the sensitivity of the action to path 
variations. Whether or not the approximation is 
legitimate is a question that has to be judged 
from case to case. However, the asymptotic sta- 
bility of the semiclassical approximation in the 
limit h — > 0 can be deduced simply from power counting. From the structure of Eq. (3.28) it 
is clear that the typical magnitude of fluctuations r(t) scales as r ~ (h/SgS) 1 ^ 2 , where 5 2 S is 
a symbolic shorthand for the functional variation of the action. (Variations larger than that 
lead to phase fluctuations > 27 t, thereby being negligible.) Non-Gaussian contributions to the 
action would have the structure ~ h~ 1 r n SgS, n > 2. For a typical r, this is of the order 
~ 5gS/(5gS) n / 2 x h n/2 ~ 1 . Since the 5-dependent factors are classical (ft-independent), these 
contributions scale to zero as h — t 0. 



This concludes the conceptual part of the chapter. Before turning to the discussion of 
applications of the path integral, let us first briefly summarize the main steps taken in its 
construction. 


Construction recipe of the path integral 

Consider a general quantum transition amplitude (ip\e- lHt / h \ip'), where t may be real, 
purely imaginary, or generally complex. To construct a functional integral representation of 
the amplitude: 

1. Partition the time interval into N 1 steps, 

e -im/n = jg-iffAt/ftj N , At = t/N _ 
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2. Regroup the operator content appearing in the expansion of each factor e lHAt / h accord- 
ing to the relation 

e -iHAt/h = 1 + At CmnA m B n + 0(Af 2 ), 

where the eigenstates |a), |6) of A,B are known and the coefficients c mn are c- numbers. 
(In the quantum mechanical application above A = p, B = q.) This “normal ordering” 
procedure emphasizes that distinct quantum mechanical systems may be associated with 
the same classical action. 

3. Insert resolutions of identity according to 

e -iHAt/H _ J2\a){a\ ^l + At'£c mn A m B n + 0(At 2 )^ |6)<6| 

= Y,\ a )^ a \e~ iH(a ' b)M/h \m+o{At% 

a,b 

where H(a, b) is the Hamiltonian evaluated at the eigenvalues of A and B. 

4. Regroup terms in the exponent: due to the “mismatch” of the eigenstates at neighboring 
time slices n and n+ 1, not only the Hamiltonians H(a , b), but also sums over differences 
of eigenvalues, appear (cf. the last term in the action (3.5)). 

5. Take the continuum limit. 


3.3 Applications of the Feynman path integral 


Having introduced the general machinery of path integration we now turn to the discussion 
of specific applications. Our starting point will be an investigation of a low-energy quantum 
particle confined to a single potential well, and the phenomenon of tunneling in a double 
well. With the latter, we become acquainted with instanton techniques and the role of 
topology in field theory. The ideas developed in this section are generalized further in the 
investigation of quantum mechanical decay and quantum dissipation. Finally, we turn our 
attention to the development of the path integral for quantum mechanical spin and, as a 
case study, explore the semiclassical trace formulae for quantum chaos. 

The simplest example of a quantum mechanical problem is that of a free particle 
(H = p 2 /2m). Yet, within the framework of the path integral, this example, which can be 
dealt with straightforwardly by elementary means, is far from trivial: the Gaussian func- 
tional integral engaged in its construction involves divergences which must be regularized 
by rediscretizing the path integral. Nevertheless, its knowledge will be useful as a means to 
normalize the path integral in the applications below. Therefore, we leave it as an exercise 
to show 10 


Gfree(q{,qi-,t) = {q { \e 




P K5l(9f-9i) : 


6(f), 


(3.29) 


Compare this result with the solution of a classical diffusion equation. 
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where the Heaviside 0-function reflects causality. 11 

EXERCISE Derive Eq. (3.29) by the standard methodology of quantum mechanics. (Hint: Insert 
a resolution of identity and perform a Gaussian integral.) 

EXERCISE Using the path integral, obtain a perturbative expansion for the scattering amplitude 
(p'\U(t — >■ oo ,t' — oo)|p) of a free particle from a short-range central potential V(r). In 

particular, show that the first-order term in the expansion recovers the Born scattering amplitude 
-ihe-^-t'W^SiEip) - E(p'))(p'\V\p). 


Quantum particle in a well 

As a first application of the path integral, let us consider the problem of a quantum particle 
in a one-dimensional potential well (see figure). The discussion of this example illustrates 
how the semiclassical evaluation scheme discussed above works in practice. For simplicity 
we assume the potential to be symmetric, V(q ) = V(—q) with V (0) = 0. The quantity we 
wish to compute is the probability amplitude that a particle injected at q = 0 returns after 
a time t, i.e. with H = p 2 /2m + V(q), G(0,0;t) m = 0\e~ iftt/h \qi = 0 )0(t). Drawing 
on our previous discussion, the path integral representation of the transition amplitude is 
given by 

G(0, 0; t) = J Dq exp ^ J dt' L(q, q) 

?(*)=«( o)=o 

where L = rnq 2 / 2 — V (q) represents the corresponding Lagrangian. 

Now, for a generic potential V (q). the path inte- 
gral cannot be evaluated exactly. Instead, we wish 
to invoke the semiclassical analysis outlined above. 

Accordingly, we must first find solutions to the clas- 
sical equation of motion. Minimizing the action with 
respect to variations of q(t), one obtains the Euler- 
Lagrange equation of motion mq = —V'(q). Accord- 
ing to the Feynman path integral, this equation must 
be solved subject to the boundary conditions q(t) = 
namely q c \(t ) = 0. Assuming that this is in fact the only solution, 12 we obtain (cf. Eq. (3.26) 



q(0) = 0. One solution is obvious, 


11 Motivated by its interpretation as a Green function, in the following we refer to the quantum transition prob- 
ability amplitude by the symbol G (as opposed to U used above). 

12 In general, this assumption is wrong. For smooth potentials V(q ), a Taylor expansion of V at small q obtains 
the harmonic oscillator potential, V(q) = Vo + muj 2 q 2 /2 + • • • . For times t that are commensurate with tt/uj, 
one has periodic solutions, q c l(t) oc sin(a;t) that start out from the origin at time t = 0 and revisit it at just 
the right time t. In the next section we will see why the restriction to just the trivial solution was nonetheless 
legitimate (for arbitrary times t). 
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and (3.27)) 

G(0, 0;f) J Dr exp ^ J dtfr (t')^ {d 2 + cu 2 ) r(t') 

r(0)=r(t)=0 

where, by definition, mu 2 = V" (0) is the second derivative of the potential at the origin. 13 
Note that, in this case, the contribution to the action from the stationary phase field con- 
figuration vanishes: S[q c ]} = 0. Following the discussion of Section 3.2, Gaussian functional 
integration over r then leads to the semiclassical expansion 

G(0,0;t) ~ J det (— m(<9 2 + w 2 )/2) -1/2 , (3.30) 


where the prefactor J absorbs various constant prefactors. 

Operator determinants are usually most conveniently obtained by presenting them as a 
product over eigenvalues. In the present case, the eigenvalues e n are determined by the 
equation 

-^(d 2 t +u 2 )r n = e n rn, 

which is to be solved subject to the boundary condition r n (t) = r n ( 0) = 0. A complete set of 
solutions to this equation is given by, 14 r n {i!) = sm(mrt' /t), n = 1,2,... , with eigenvalues 
e n = m[(mr/t) 2 — w 2 ]/ 2. Applying this to the determinant, one finds 

« (-m(a? + ^2)-W '= ft [I ((f) 2 - “ 2 )] " /2 ■ 


To interpret this result, one must make sense of the infinite product (which even seems 
divergent for times commensurate with tt/oj). Moreover the value of the constant J has 
yet to be determined. To resolve these difficulties, one may exploit the facts that (a) we 
do know the transition amplitude Eq. (3.29) of the free particle system, and (b) the latter 
coincides with the transition amplitude G in the special case where the potential V = 0. 
In other words, had we computed Gf ree via the path integral, we would have obtained the 
same constant J and, more importantly, an infinite product like the one above, but with 
u) = 0. This allows the transition amplitude to be regularized as 


G(0, 0; t) 


G(0, 0; t) 
G free ( 0,0; t) 




Then, with the help of the identity ri”j — (x/mr) 2 ] 1 = a;/sina;, one finally arrives at 
the result 


G(0,0;f) ~ 


27rifisin(o;t) 


0(f). 


(3.31) 


Those who are uncomfortable with functional differentiation can arrive at the same expression simply by sub- 
stituting q(t) = q c i(t ) + r(t ) into the action and expanding in r. 

To find the solutions of this equation, recall the structure of the Schrodinger equation of a particle in a one- 
dimensional box of width L = t. 
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In the case of the harmonic oscillator, the expansion of the potential necessarily truncates 
at quadratic order and, in this case, the expression above is exact. (For a more wide- 
ranging discussion of the path integral for the quantum harmonic oscillator system, see 
Problem 3.5.) For a general potential, the semiclassical approximation effectively involves 
the replacement of V ( q ) by a quadratic potential with the same curvature. The calculation 
above also illustrates how coordinate space fluctuations around a completely static solution 
may reinstate the zero-point fluctuations characteristic of quantum mechanical bound states. 


Double well potential: tunneling and instantons 

. As a second application of the path integral let 
/ us now consider the motion of a particle in a dou- 
' ble well potential (see figure). Our aim will be to 
estimate the quantum probability amplitude for a 
particle either to stay at the bottom of one of the 
local minima or to go from one minimum to the 
q other. In doing so, it is understood that the energy 
range accessible to the particle (i.e. A E ~ h/t) is 
well below the potential barrier height, i.e. quantum 
mechanical transfer between minima is by tunnel- 
\ ing. Here, in contrast to the single well system, it 
is far from clear what kind of classical stationary 
phase solutions may serve as a basis for a description of the quantum dynamics; there appear 
to be no classical paths connecting the two minima. Of course one may think of particles 
“rolling” over the potential hill. Yet, these are singular and, by assumption, energetically 
inaccessible. 

The key to resolving these difficulties is an observation, already made above, that the 
time argument appearing in the path integral should be considered as a general complex 
quantity that can (according to convenience) be set to any value in the complex plane. In 
the present case, a Wick rotation to imaginary times will reveal a stationary point of the 
action. At the end of the calculation, the real time amplitudes we seek can be obtained by 
analytic continuation. 

I N FO The mechanism of quantum double (or multiple) well tunneling plays a role in a number 
of problems of condensed matter physics. A prominent example is in the physics of amorphous 
solids such as glasses. 

A caricature of a glass is shown in the figure. The 
absence of long-range order in the system implies that 
individual chemical bonds cannot assume their optimal 
binding lengths. For understretched bonds this leads to 
the formation of two approximately equal metastable min- 
ima around the ideal binding axis (see the inset). The 
energetically lowest excitations of the system are transi- 
tions of individual atoms between nearly degenerate minima of this type, i.e. flips of atoms around 
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the binding axis. A prominent phenomenological model 15 describes the system by an ensemble 
of quantum double wells of random center height and width. This model effortlessly explains the 
existence of a vast system of metastable points in the landscape of low-energy configurations of 
glassy systems. 


To be specific, let us consider the imaginary time transition amplitudes 

Gs(a, ±a; r) = (a| exp | ± a) = Ge(— a, =f a; r), (3.32) 

where the coordinates ±o coincide with the two minima of the potential. From Eq. (3.32) 
the real time amplitudes G(a, ±a;t) = GE{a,±a;T — > it) can be recovered by the analytic 
continuation r — > it. According to Section 3.2, the Euclidean path integral formulation 
of the transition amplitudes is given by 


Ge(ci, ±a; r) 


= / 

5(0)=±o,g(T)=o 




(3.33) 


where the function q now depends on imaginary time. From Eq. (3.33) we obtain the sta- 
tionary phase (or saddle-point) equations 


—mq + V'(q) = 0. 


(3.34) 


From this result, one can infer that, as a consequence of the Wick rotation, there is an 
effective inversion of the potential, V —> —V (shown dashed in the figure on page 115). The 
crucial point is that, within the inverted potential landscape, the barrier has become a sink, 
i.e. within the new formulation, there are classical solutions connecting the two points, ±a. 
More precisely, there are three different types of classical solution that fulfill the condition 
to be at coordinates ±a at times 0 and/or r: (a) the solution wherein the particle rests 
permanently at a; 16 (b) the corresponding solution staying at —a; and, most importantly, 
(c) the solution in which the particle leaves its initial position at ±a, accelerates through 
the minimum at 0 and eventually reaches the final position a at time r. In computing 
the transition amplitudes, all three types of path have to be taken into account. As to 
(a) and (b), by computing quantum fluctuations around these solutions, one can recover 
the physics of the zero-point motion described in Section 3.3 for each well individually. 
(Exercise: Convince yourself that this is true!) Now let us see what happens if the paths 
connecting the two coordinates are added to this picture. 


P. W. Anderson, B. I. Halperin, and C. M. Varma, Anomalous low-temperature thermal properties of glasses 
and spin glasses, Phil. Mag. 25 (1972), 1-9. 

Note that the potential inversion answers a question that arose above, i.e. whether or not the classical solution 
staying at the bottom of the single well was actually the only one to be considered. As with the double well, 
we could have treated the single well within an imaginary time representation, whereupon the well would have 
become a hill. Clearly, the boundary condition requires the particle to start and finish at the top of the hill, i.e. 
the solution that stays there forever. By formulating the semiclassical expansion around that path, we would 
have obtained Eq. (3.31) with t — ir, which, upon analytic continuation, would have led back to the real 
time result. 
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The instanton gas 

The classical solution of the Euclidean equation of motion that connects the two poten- 
tial maxima is called an instanton solution while a solution traversing the same path 
but in the opposite direction (“—a — f a” — > “a — > —a”) is called an anti-instanton. The 
name “instanton” was invented by ’t Hooft with the idea that these objects are very sim- 
ilar in their mathematical structure to “solitons,” particle-like solutions of classical field 
theories. However, unlike solitons, they are structures in time (albeit Euclidean time); thus 
the “instant-.” As another etymological remark, note that the syllable “-on” in “instanton” 
hints at an interpretation of these states as a kind of particle. The background is that, as a 
function of the time coordinate, instantons are almost everywhere constant save for a short 

region of variation (see below). 

Alluding to the interpretation of 
time as something akin to a spa- 
tial dimension, these states can be 
interpreted as a well-localized exci- 
tation or, according to standard field 
theoretical practice, a particle . 17 

To proceed, we must first compute 
the classical action associated with 
a single-instanton solution. Multiply- 
ing Eq. (3.34) by q c i, integrating over 
time (i.e. performing the first integral 
of the equation of motion), and using the fact that, at q c \ = ±a, d T q c \ = V = 0, one finds 
that 

j = n«d). (3.35) 

With this result, one obtains the instanton action 
rajci 

Sinst = J dr' (™<7d + V(<lc\j)= J dr' ^(rog c i) = J dq (2 mV(q)) 1/2 . (3.36) 

Notice that ,S'j nst is determined solely by the functional profile of the potential V (i.e. does 
not depend on the structure of the solution q c \ ) . 

Secondly, let us explore the structure of the instanton as a function of time. Defining the 
second derivative of the potential at ±o by V"{±a) = tow 2 , Eq. (3.35) implies that, for large 
times (where the particle is close to the right maximum), q c \ = — oj(q c \ — a), which integrates 
to q c \{r) T — T a — e~ Taj . Thus the temporal extension of the instanton is set by the oscillator 
frequencies of the local potential minima (the maxima of the inverted potential) and, in 


Gerardus t Hooft 1946- 

Nobel Laureate in Physics in 
1999, with Martinus J. G. 

Veltman, “for elucidating the 
quantum structure of elec- 
troweak interactions in physics." 

Together, they were able to iden- 
tify the properties of the W and 
Z particles. The ’t Hooft-Veltman model allowed 
scientists to calculate the physical properties of 
other particles, including the mass of the top quark, 
which was directly observed in 1995. (Image © The 
Nobel Foundation.) 



17 In addition to the original literature, the importance that has been attached to the instanton method has inspired 
a variety of excellent and pedagogical reviews of the field. Of these, the following are highly recommended: A. 
M. Polyakov, Quark confinement and topology of gauge theories, Nucl. Phys. B120 (1977), 429—58 - see also 
A. M. Polyakov, Gauge Fields and Strings (Harwood, 1987); S. Coleman, Aspects of Symmetry - Selected 
Erice Lectures (Cambridge University Press, 1985), Chapter 7. 
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cases where tunneling takes place on time scales much larger than that, can be regarded as 
short (see Fig. 3.4). 




Figure 3.4 Single-instanton configuration. 

The confinement of the instanton configuration to a narrow interval of time has an impor- 
tant implication - there must exist approximate solutions of the stationary equation involv- 
ing further anti-instanton/instanton pairs (physically, the particle repeatedly bouncing to 
and fro in the inverted potential). According to the general philosophy of the saddle-point 
scheme, the path integral is obtained by summing over all solutions of the saddle-point 
equations and hence over all instanton configurations. The summation over multi-instanton 
configurations - termed the “instanton gas” - is substantially simplified by the fact that 
individual instantons have short temporal support (events of overlapping configurations are 
rare) and that not too many instantons can be accommodated in a finite time interval (the 
instanton gas is dilute). The actual density is dictated by the competition between the con- 
figurational “entropy” (favoring high density), and the “energetics,” the exponential weight 
implied by the action (favoring low density) - see the estimate below. 

In practice, multi-instanton configurations imply a transition amplitude 


G(a, ±a; r) ~ V"' K n f dr\ f dr^ ■ ■ ■ [ 

, . , Jo Jo Jo 

n even/odd 


dr n A n (ri , . . . , r n 


(3.37) 


where A n denotes the amplitude associated with n instantons, and we have taken into 
account the fact that, in order to connect a with ±o, the number of instantons must be 
even/odd. The n instanton bounces contributing to each A n can take place at arbitrary times 
Tj £ [0, r], i = 1 and all these possibilities have to be added (i.e. integrated). Here K 
denotes a (dimensionful) constant absorbing the temporal dimension [time]" introduced by 
the time integrations, and A„(ti, . . . , r n ) is the transition amplitude, evaluated within the 
semiclassical approximation around a configuration of n instanton bounces at times 0 < r n < 
T n - 1 < • • • < t\ < t (see Fig. 3.5). In the following, we first focus on the transition amplitude 
A n , which controls the exponential dependence of the tunneling amplitude, returning later 
to consider the prefactor K. 

According to the general semiclassical principle, each amplitude A n = A n c \ x A n qu 
factorizes into two parts: a classical contribution A n c \ accounting for the action of the 
instanton configuration; and a quantum contribution A n qu resulting from quadratic fluctu- 
ations around the classical path. Focusing initially on A„ jC i we note that, at intermediate 
times, Tj < r 7 C Tj+i> where the particle rests on top of either of the maxima at ±a, no 
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Figure 3.5 Dilute instanton gas configuration. 


action accumulates (cf. the previous section). However, each instanton bounce has a finite 
action Si nst (see Eq. (3.36)) and these contributions add up to give the full classical action, 

^n,ci(ri,...,r n )=e-" s ‘”-/ fi , (3.38) 

which is independent of the time coordinates t*. (The individual instantons “do not know 
of each other” ; their action is independent of their relative position.) 

As to the quantum factor A nqu , there are, in principle, two contributions. Whilst the 
particle rests on either of the hills (the straight segments in Fig. 3.5), quadratic fluctuations 
around the classical (i.e. spatially constant) configuration play the same role as the quantum 
fluctuations considered in the previous section, the only difference being that we are working 
in a Wick rotated picture. There it was found that quantum fluctuations around a classical 
configuration which stays for a (real) time t at the bottom of the well result in a factor 
yA/sin (wf) (the remaining constants being absorbed into the prefactor K n ). Rotating to 
imaginary times, t —i — ir, one can infer that the quantum fluctuation accumulated during 
the stationary time Tj+i — Tj is given by 


1 

sin(-*w(r i+ i - r,)) 


where we have used the fact that, for the dilute configuration, the typical separation times 
between bounces are much larger than the inverse of the characteristic oscillator scales of 
each of the minima. (It takes the particle much longer to tunnel through a high barrier than 
to oscillate in either of the wells of the real potential.) 

Now, in principle, there are also fluctuations around the “bouncing” segments of the 
path. However, due to the fact that a bounce takes a time of 0(u; -1 ) <C At, where At 
represents the typical time between bounces, one can neglect these contributions (which is 
to say that they can be absorbed into the prefactor K without explicit calculation). Within 
this approximation, setting To = 0,r n+ i = t, the overall quantum fluctuation correction is 
given by 


^n,qu(Tl,...,T n ) = ( 


,-w(n+l-T<)/J! = -ur/2 


(3.39) 
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again independent of the particular spacing configuration \ji 
(3.39), one finds that 


}. Combining Eq. (3.38) and 


G(a, ±a;r) ~ ^ K n e nSi ™*/ h e “ r / 2 / dr\ / d.T 2 


Finally, perfo 


: the summation, one obtains the transits 


G(a, ±a; r) ~ Ce 


cosh (rKe 
sinh (rKe~ 


where C is some factor that depends in a non-exponential way on the transition time. 

Before we turn to a discussion of the physical content of this result, let us check the 
self-consistency of our central working hypothesis - the diluteness of the instanton gas. To 
this end, consider the representation of G in terms of the partial amplitudes (3.40). To 
determine the typical number of instantons contributing to the sum, one may make use of 
the fact that, for a general sum ^2 n c n of positive quantities c n > 0, the “typical” value 
of the summation index can be estimated as ( n ) = En C W E n c n ■ With the abbreviation 
X = T K Sin ’ A / h the application of this estimate to our current sum yields 

_ T,„nX n /n ] - _ v 

{ ) ~ En X n /n\ 

where we have used the fact that, as long as (n) 1, the even/odd distinction in the sum 

is irrelevant. Thus, we can infer that the average instanton density, ( n)/r = Ker Sinst / h , is 
both exponentially small in the instanton action <Si nst , and independent of r, confirming the 
validity of our diluteness assumptions above. 

Finally, let us discuss how the form of the transition amplitude (3.41) can be understood 
in physical terms. To this end, let us reconsider the basic structure of the problem we are 
dealing with (see Fig. 3.6). While there is no coupling across the barrier, the Hamiltonian 
has two independent, oscillator- like sets of low-lying eigenstates sitting in the two local 
minima. Allowing for a weak inter-barrier coupling, the oscillator ground states (like all 
higher states) split into a doublet of a symmetric and an antisymmetric eigenstate, IS 1 ) and 
| A) with energies ca and es , respectively. Focusing on the low-energy sector formed by the 
ground state doublet, we can express the transition amplitudes (3.32) as 


G(a,±a;r) ~ (a| ^l-S^e 


Setting ca/s = hu;/2± Ae/2, where Ae represents the tunnel-splitting, the symmetry prop- 
erties |(a|S')| 2 = |(— a.| /S') | 2 = C / 2 and (a|A)(A| — a) = — |(o|A)| 2 = —C/2 imply that 

G(a, ±a; r) ~ C ± e ~(^+ = Ce -^r/i cosh(A er/h) 

v ' 2 V ) sinh(Aer/h). 
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Figure 3.6 Quantum states of the double well. The thick lines indicate energy levels of harmonic 
oscillator states; the thin and dotted lines indicate exact symmetric (S) and antisymmetric (A) 
eigenstates. 

Comparing this expression with Eq. (3.41) the interpretation of the instanton calculation 
becomes clear: at long times, the transition amplitude engages the two lowest states - the 
symmetric and anti-symmetric combinations of the two oscillator ground states. The energy 
splitting Ae accommodates the energy shift due to the tunneling between the two wells. 
Remarkably, the effect of tunneling was obtained from a purely classical picture (formulated 
in imaginary time!). The instanton calculation also produced a prediction for the tunnel 
splitting of the energies, namely 

Ae = hK exp(-5 inst /fi), 

which, up to the prefactor, agrees with the result of a WKB-type analysis of the tunnel 
process. 

Before leaving this section, some general remarks on instantons are in order: 

i> In hindsight, was the approximation scheme used above consistent? In particular, terms 
at second order in h were neglected, while terms non-perturbative in h (the instanton) 
were kept. Yet, the former typically give rise to a larger correction to the energy than 
the latter. However, the large perturbative shift affects the energies of the symmetric and 
antisymmetric states equally. The instanton contribution gives the leading correction to 
the splitting of the levels. It is the latter that is likely to be of more physical significance. 
i> Secondly, it may - legitimately - appear as though the development of the machinery 
above was a bit of an “overkill” for describing a simple tunneling process. As a matter 
of fact, the basic result Eq. (3.41) could have been obtained in a simpler way by more 
elementary means (using, for example, the WKB method). Why then did we discuss 
instantons at such length? One reason is that, even within a purely quantum mechanical 
framework, the instanton formulation of tunneling is much stronger than WKB. The 
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latter represents, by and large, an uncontrolled approximation. In general it is hard to 
tell whether WKB results are accurate or not. In contrast, the instanton approximation 
to the path integral is controlled by a number of well-defined expansion parameters. For 
example, by going beyond the semiclassical approximation and/or softening the diluteness 
assumption, the calculation of the transition amplitudes can, in principle, be driven to 
arbitrary accuracy. 

> A second and, for our purposes, more important motivation is that instanton techniques 
are of crucial importance within higher-dimensional field theories (here we regard the path 
integral formulation of quantum mechanics as a (0 space + 1 time) = 1-dimensional field 
theory). The reason is that instantons are intrinsically non-perturbative objects, which is 
to say that instanton solutions to stationary phase equations describe a type of physics 
that cannot be obtained by a perturbative expansion around a non-instanton sector of the 
theory. (For example, the bouncing orbits in the example above cannot be incorporated 
into the analysis by doing a kind of perturbative expansion around a trivial orbit.) This 
non-perturbative nature of instantons can be understood by topological reasoning. 

Relatedly, one of the features of the instanton analysis above was that the number of instan- 
tons involved was a stable quantity; “stable” in the sense that by including perturbative 
fluctuations around the n - instanton sector, say, one does not connect with the n + 2 sec- 
tor. Although no rigorous proof of this statement has been given, it should be heuristically 
clear: a trajectory involving n bounces between the hills of the inverted potential cannot be 
smoothly connected with one of a different number. Suppose for instance we should forcibly 
attempt to interpolate between two paths with different bounce numbers. Inevitably, some 
of the intermediate configurations would be charged with actions that are far apart from 
any stationary phase-like value. Thus, the different instanton sectors are separated by an 
energetic barrier that cannot be penetrated by smooth interpolation and, in this sense, they 
are topologically distinct. 

INFO Fluctuation determinant: Our analysis above provided a method to extract the tun- 
neling rate between the quantum wells to a level of exponential accuracy. However, in some 
applications, it is useful to compute the exponential prefactor K. Although such a computation 
follows the general principles outlined above and implemented explicitly for the single well, there 
are some idiosyncrasies in the tunneling system that warrant discussion. 

According to the general principles outlined in Section 3.2, integrating over Gaussian fluctua- 
tions around the saddle-point field configurations, the contribution to the transition amplitude 
from the n-instanton sector is given by 

G„ = J det -md* + V"(q c i, n ) e“ nS ‘“ st , 

where q c i, n {j) represents an n-instanton configuration and J the normalization. Now, in the zero- 
instanton sector, the evaluation of the functional determinant recovers the familiar harmonic 
oscillator result, G(a,a,r) {mui/iTh)? exp[— wt/ 2]. Let us now consider the one-instanton 
sector of the theory. To evaluate the functional determinant, one must consider the spectrum 
of the operator -m^ + V" Differentiating the defining equation for q c \,i Eq. (3.34), one 

may confirm that 

-md£ + V"(q c \,i) d T q c i,i = 0, 
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i.e. the function d T q c 1,1 presents a zero mode of the operator! Physically, the origin of the zero 
mode is elucidated by noting that a translation of the instanton along the time axis, gci,i("r) — > 
9 c1,i(t+<5t), should leave the action approximately invariant. However, for small Sr, 5 ci,i (T+dr) ~ 
9ci,i ( t) + 5Td T q c i'i, i.e. to first order, the addition of the increment function <9 T g c i,i leaves the 
action invariant, and St is a “zero mode coordinate.” 

With this interpretation, it becomes clear how to repair the formula for the fluctuation deter- 
minant. While the Gaussian integral over fluctuations is controlled for the non-zero eigenvalues, 
its execution for the zero mode must be rethought. Indeed, by integrating over the coordinate 
of the instanton, that is J Q r dro = r, one finds that the contribution to the transition amplitude 
in the one-instanton sector is given by 

Jt det' [-md 2 + V"(g c i,i)] 1/2 e -Sinst , 

where the prime indicates the exclusion of the zero mode from the determinant, and the factor 
\J Smst /27 t h reflects the Jacobian associated with the change to a new set of integration variables 
which contains the zero mode coordinate r as one of its elements. 18 To fix the, as yet, unde- 
termined coupling constant J, we normalize by the fluctuation determinant of the (imaginary 
time) harmonic oscillator, i.e. we use the fact that (cf. Section 3.3), for the harmonic oscillator, 
the return amplitude evaluates to G(a, a,r) = J det(m(— d 2 + w 2 )/2) -1 / 2 = (mu/wh) 1 ^ 2 e~ UT ^ 2 , 
where the first/second representation is the imaginary time variant of Eq. (3.30) /Eq. (3.31). Using 
this result, and noting that the zero mode analysis above generalizes to the n-instanton sector, 
we find that the pre-exponential constant K used in our analysis of the double-well problem 
above affords the explicit representation 

Sinst mw 2 det' [— md 2 + U"(g c i,i)] ^ 

2 Tth ^ 

Naturally, the instanton determinant depends sensitively on the particular nature of the potential 
V(q). For the quartic potential V(q) = mu 2 [x 2 — a 2 ) 2 /8a 2 , it may be confirmed that 

ma^det' [—md 2 + V"{q c \^\)\ 1 

det [— md 2 + mu 2 ] 12 ’ 

while Sinst = (2/3 )mua 2 . For further details of the calculation, we refer to, e.g., Zinn-Justin 
18 

(1993). 

Escape from a metastable minimum: bounces 
The instanton gas approximation for the double- well system can be easily adapted to explore 
the problem of quantum mechanical tunneling from a metastable state such as that pre- 
sented by an unstable nucleus. In particular, suppose one wishes to estimate the “survival 
probability” of a particle captured in a metastable minimum of a one-dimensional potential 
such as that shown in Fig. 3.7. 

According to the path integral scheme, the survival probability, defined by the probability 
amplitude of remaining at the potential minimum q m , i.e. the propagator G(q m ,q m ',t), can 

18 See J. Zinn-Justin, Quantum Field Theory and Critical Phenomena (Oxford University Press, 1993) for an 
explicit calculation of this Jacobian. 
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Figure 3.7 Effective potential showing a metastable minimum together with the inverted potential 
and a sketch of a bounce solution. To obtain the tunneling rate it is necessary to sum over a dilute 
gas of bounce trajectories. 


be evaluated by making use of the Euclidean time formulation of the Feynman path integral. 
As with the double well, in the Euclidean time formalism the dominant contribution to the 
transition probability arises from the classical path minimizing the action corresponding 
to the inverted potential (see Fig. 3.7). However, in contrast to the double- well potential, 
the classical solution takes the form of a “bounce” (i.e. the particle spends only a short 
time away from the potential minimum - there is only one metastable minimum of the 
potential). As with the double well, one can expect multiple bounce trajectories to present 
a significant contribution. Summing over all bounce trajectories (note that in this case we 
have an exponential series - no even/odd parity effect), one obtains the survival probability 

G(qm,qm',T) = Ce~ WT G exp |rA'e _ ' s ' bounce / R J _ 

Applying an analytic continuation to real time, one finds G(q m , q m \ t) = Ce ~ lut / 2 exp[— £t], 
where the decay rate is given by T/2 = |x|e _Sbounce / R . (Note that on physical grounds we 
can see that K must be imaginary. 19 ) 

EXERCISE Consider a heavy nucleus having a finite rate of a-decay. The nuclear forces can 
be considered very short-range so that the rate of a particle emission is controlled by tunneling 
under a Coulomb barrier. Taking the effective potential to be spherically symmetric with a deep 
minimum core of radius ro beyond which it decays as U(r) = 2(Z — l)e 2 /r where Z is the nuclear 
charge, find the temperature of the nuclei above which a-decay will be thermally assisted if the 
energy of the emitted particles is Eq. Estimate the mean energy of the a particles as a function 
of temperature. 

EXERCISE A uniform electric field E is applied perpendicular to the surface of a metal with 
work function W . Assuming that the electrons in the metal describe a Fermi gas of density n, with 
exponential accuracy, find the tunneling current at zero temperature (“cold emission”). Show 
that, effectively, only electrons with energy near the Fermi level are tunneling. With the same 
accuracy, find the current at finite temperature (“hot emission”). What is the most probable 
energy of tunneling electrons as a function of temperature? 


In fact, a more careful analysis shows that this estimate of the decay rate is too large by a factor of 2 (for 
further details see, e.g., the discussion in Coleman. 17 



3.3 Applications of the Feynman path integral 


125 



Figure 3.8 Snapshot of a field configuration <j>(x , t = const.) in a potential landscape with two 
nearly degenerate minima. For further discussion, see the text. 

Tunneling of quantum fields: “fate of the false vacuum” 

Hitherto we have focused on applications of the Feynman path integral to the quantum 
mechanics of isolated point-like particles. In this setting, the merit of the path integral 
scheme over, say, standard perturbative methods or the “WKB” approach is perhaps not 
compelling. Therefore, by way of motivation, let us present an example that builds on the 
structures elucidated above and illustrates the power of the path integral method. 

To this end, let us consider a theory involving a continuous classical field that can adopt 
two homogeneous equilibrium states with different energy densities. To be concrete, one may 
consider a harmonic chain confined to one or other minimum of an asymmetric quasi-one- 
dimensional “gutter-like” double- well potential (see Fig. 3.8). When quantized, the state of 
higher energy density becomes unstable through barrier penetration - it is said to be a “false 
vacuum ”. 20 Specifically, drawing on our discussion of the harmonic chain in Chapter 1, let 
us consider a quantum system specified by the Hamiltonian density 

n = ^n + ^f( d J) 2 + v W, (3.42) 

where [n{x),<p{x')\ = —ihS(x — x'). Here we have included a potential V{4>) which, in the 
present case, assumes the form of a double well. The inclusion of a weak bias —f<j> in V(<p) 
identifies a stable and a metastable potential minimum. Previously, we have seen that, in 
the absence of the confining potential, the quantum string exhibits low-energy collective 
wave-like excitations - phonons. In the confining potential, these harmonic fluctuations are 
rendered massive. However, drawing on the quantum mechanical principles established in 
the single-particle system, one might assume that the string tunnels freely between the two 
potential minima. To explore the capacity of the system to tunnel, let us suppose that, at 


For a detailed discussion of the history and ramifications of this idea, we refer to the original insightful paper by 
Sidney Coleman, Fate of the false vacuum: semiclassical theory, Phys. Rev. D 15 (1977), 2929-36. In fact, many 
of the ideas developed in this work were anticipated in an earlier analysis of metastability in the context of 
classical field theories by J. S. Langer, Theory of the condensation point, Ann. Phys. (NY) 41 (1967), 108—57. 
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some time t = 0, the system adopts a field configuration in which the string is located in 
the (metastable) minimum of the potential at, say, <f> = —a. What is the probability that 
the entire string of length L will tunnel across the barrier into the potential minimum at 
4> = a in a time t? 

I N FO The tunneling of fields between nearly degenerate ground states plays a role in numerous 
physical contexts. By way of example, consider a superheated liquid. In this context, the 
“false” vacuum is the liquid state, the true one the gaseous phase. The role of the field is taken by 
the local density distribution in the liquid. Thermodynamic fluctuations trigger the continuous 
appearance of vapor bubbles in the liquid. For bubbles of too small a diameter, the gain in 
volume energy is outweighed by the surface energy cost - the bubble will collapse. However, 
for bubbles beyond a certain critical size, the energy balance is positive. The bubble will grow 
and, eventually, swallow the entire mass density of the system; the liquid has vaporized or, more 
formally, the density field tunneled 21 from the false ground state into the true one. 

More speculative (but also potentially more damaging) manifestations of the phenomenon 
have been suggested in the context of cosmology : 22 what if the big bang released our universe 
not into its true vacuum configuration but into a state separated by a huge barrier from a more 
favorable sector of the energy landscape. In this case, everything would depend on the tunneling 
rate: 

If this time scale is of the order of milliseconds, the universe is still hot when the false vacuum 
decays. . . if this time is of the order of years, the decay will lead to a sort of secondary big bang 
with interesting cosmological consequences. If this time is of the order of 10 9 years, we have 
occasion for anxiety. 

(S. Coleman) 

Previously, for the point-particle system, we have seen that the transition probability 
between the minima of the double well is most easily accessed by exploring the classical 
field configurations of the Euclidean time action. In the present case, anticipating to some 
extent our discussion of the quantum field integral in the next chapter, the Euclidean time 
action associated with the Hamiltonian density (3.42) assumes the form 23 

S[(f>\ = dr dx^-(d T (l)) 2 + ^-{d x (j)) 2 + V(cj)) j, 

where the time integral runs over the interval [0, T = it]. Here, for simplicity, let us assume 
that the string obeys periodic boundary conditions in space, namely <f)(x+L, r) = <p(x, r). To 
estimate the tunneling amplitude, we will explore the survival probability of the metastable 
state, imposing the boundary conditions <j)(x,T = 0) = <f>(x,T = T) = —a on the path 
integral. Once again, when the potential barrier is high, and the time T is long, one may 
assume that the path integral is dominated by the saddle-point field configuration of the 


At this point, readers should no longer be confused regarding the mentioning of “tunneling” in the context of 
a classical system. Within the framework of the path integral, the classical partition sum maps onto the path 
integral of a fictitious quantum system. It is this tunneling that we have in mind. 

See note 25. 

Those readers who wish to develop a more rigorous formulation of the path integral for the string may either 
turn to the discussion of the field integral in the next chapter or, alternatively, may satisfy themselves of the 
validity of the Euclidean action by (re-) discretizing the harmonic chain, presenting the transition amplitude as 
a series of Feynman path integrals for each element of the string and, finally, taking the continuum limit. 
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Euclidean action. In this case, varying the action with respect to the field <p(x. t), one 
obtains the classical equation of motion 

rnd 2 (f> + k s a 2 d 2 cj) = 

which must be solved subject to the boundary conditions above. 

Now, motivated by our consideration of the point-particle problem, one might seek a 
solution in which the string tunnels as a single rigid entity without “flexing.” However, it 
is evident from the spatial translational invariance of the system that the instanton action 
would scale with the system size L. In the infinite system L — > oo, such a trajectory would 
therefore not contribute significantly to the tunneling amplitude. Instead, one must consider 
a different type of field configuration in which the transfer of the chain is by degree. In this, 

elements of the string cross the barrier in a con- 
secutive sequence as two outwardly propagat- 
ing “domain walls” (see the figure, where the 
emergence of such a double-kink configuration 
is shown as a function of space and time; notice 
the circular shape of the resulting space-time 
droplet - a consequence of the rotational sym- 
metry of the rescaled problem). Such a field con- 
figuration can be motivated from symmetry con- 
siderations by noting that, after rescaling x 
v s x (where v s = y/k s a 2 /m denotes the classical 
sound wave velocity), the saddle-point equation 
assumes the isotropic form md 2 (j) = d^V^), 
where d 2 = d 2 + d 2 . Then, setting r = yfx 2 (,r — T /2) 2 ) and sending ( T,L ) — > oo, the 

space-time rotational symmetry suggests a solution of the form (j> = (j>{r ), where 4>{r) obeys 
the radial diffusion equation 

md 2 (j) + —d r (j) = i) 0 V. 
r 

with the boundary condition lim r <p(r) = —a. This equation describes the one- 
dimensional motion of a particle in a potential — V and subject to a strange “friction force” 
—mr~ 1 d r (j) whose strength is inversely proportional to “time” r. 

To understand the profile of the bounce solution, suppose that at time r = 0 the particle 
has been released from rest at a position slightly to the left of the (inverted) potential 
maximum at a. After rolling through the potential minimum it will climb the potential hill 
at —a. Now, the initial position may be fine-tuned in such a way that the viscous damping 
of the particle compensates for the excess potential energy (which would otherwise make the 
particle overshoot and disappear to infinity) : there exists a solution where the particle starts 
close to <f> = a and eventually winds up at <j> = —a, in accord with the imposed boundary 
conditions. In general, the analytical solution for the bounce depends sensitively on the form 
of the confining potential. However, while we assume that the well asymmetry imposed by 
external potential —f<j> is small, the radial equation may be considerably simplified. In this 
limit, one may invoke a “thin- wall” approximation in which one assumes that the bounce 
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configuration is described by a domain wall of thickness A r, at a radius ro A r separating 
an inner region where <f>(r < ro) = a from the outer region where (f>(r > ro) = —a. In this 
case, and to lowest order in an expansion in /, the action of the friction force is immaterial, 
i.e. we may set md^cp = d^V - the very instanton equation formulated earlier for the 
point-particle system! 

Then, substituting back into S, one finds that the bounce (or kink-like) solution is char- 
acterized by the Euclidean action 

S = v s [STrroSinst - nr^af] , 


where Sinst denotes the action of the instanton associated with the point-particle system 
Eq. (3.36), and the last term accommodates the effect of the potential bias on the field 
configuration. Crucially, one may note that the instanton contribution to the action scales 
with the circumference of the domain wall in the space-time, while that of the potential 
bias scales with the area of the domain. From this scaling dependence, it is evident that, 
however small the external force /, at large enough ro the contribution of the second term 
will always outweigh the first and the string will tunnel from the metastable minimum to 
the global minimum of the potential. More precisely, the optimal size of domain is found by 
minimizing the action with respect to ro- In doing so, one finds that ro = Si ns t/2af. Then, 
substituting back into the action, one obtains the tunneling rate 


1 nv s S? 

exp ' 


From this result, one can conclude that, in the absence of an external force /, the tunneling 
of the string across the barrier is completely quenched !! In the zero-temperature unbiased 
system, the symmetry of the quantum Hamiltonian is broken: the ground state exhibits a 
two-fold degeneracy in which the string is confined to one potential minimum or another. 

The ramifications of the tunneling amplitude suppression can be traced to the statis- 
tical mechanics of the corresponding classical system. As emphasized in Section 3.2, any 
Euclidean time path integral of a d-dimensional system can be identified with the statistical 
mechanics of a classical system (d + l)-dimensional problem. In the double- well system, 
the Euclidean time action of the point-particle quantum system is isomorphic to the one- 
dimensional realization of the classical Ising ferromagnet, namely 

Rising = J dx [^ m2 + umi + y (Vm) 2 j . (3.43) 

Translated into this context, the saddle-point (or mean-field) analysis suggests that the 
system will exhibit a spontaneous symmetry breaking to an ordered phase (m 7^ 0) when 
the parameter t (the reduced temperature) becomes negative. However, drawing on our 
analysis of the quantum point-particle system, in the thermodynamic limit, we see that fluc- 
tuations (non-perturbative in temperature) associated with instanton field configurations 
of the Hamiltonian m(x) may restore the symmetry of the system and destroy long-range 
order at any finite temperature 1//3. Whether this happens or not depends on the compe- 
tition between the energy cost of instanton creation and the entropy gained by integrating 
over the instanton zero-mode coordinates. It turns out that, in d = 1, the latter wins, i.e. 
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the system is “disordered” at any finite temperature. In contrast, for d > 2, the creation 
of instantons is too costly, i.e. the system will remain in its energetically preferred ground 
state. For further discussion of these issues, we refer to the Info block in Section 8.1.1. 


Tunneling in a dissipative environment 

In the condensed matter context it is, of course, infeasible to completely divorce a system 
from its environment. Indeed, in addition to the dephasing effect of thermal fluctuations, 
the realization of quantum mechanical phenomena depends sensitively on the strength and 
nature of the coupling to the external degrees of freedom. For example, the tunneling of an 
atom from one interstitial site in a crystal to another is likely to be heavily influenced by its 
coupling to the phonon degrees of freedom that characterize the crystal lattice. By exchang- 
ing energy with the phonons, which act in the system as an external bath, a quantum 
particle can lose its phase coherence and with it, its quantum mechanical character. Begin- 
ning with the seminal work of Caldeira and Leggett, 24 there have been numerous theoretical 
investigations of the effect of an environment on the quantum mechanical properties of a 
system. Such effects are particularly acute in systems where the quantum mechanical degree 
of freedom is macroscopic , such as the magnetic flux trapped in a superconducting quantum 
interference device (SQUID). In the following, we show that the Feynman path integral 
provides a natural (and almost unique) setting in which the effects of the environment on 
a microscopic or macroscopic quantum mechanical degree of freedom can be explored. For 
further discussion of the response of quantum wave coherence to environmental coupling, 
we refer to Chapter 11. 

Before we begin, let us note that the phenomenon of macroscopic quantum tunneling 
represents an extensive and still active area of research recently reinvigorated by the bur- 
geoning field of quantum computation. By contrast, our discussion here will be necessarily 
limited in scope, targeting a particular illustrative application, and highlighting only the 
guiding principles. For a more thorough and detailed discussion, we refer the reader to one 
of the many comprehensive reviews. 25 


Caldeira-Leggett model 

Previously, we have discussed the ability of the Feynman path integral to describe quantum 
mechanical tunneling of a particle q across a potential barrier V (q) . In the following, we will 
invoke the path integral to explore the capacity for quantum mechanical tunneling when the 
particle is coupled to degrees of freedom of an external environment. Following Caldeira and 


A. O. Caldeira and A. J. Leggett, Influence of dissipation on quantum tunneling in macroscopic systems, Phys. 
Rev. Lett. 46 (1981), 211-14. 

See, e.g., A. J. Leggett et al., Dynamics of the dissipative two-state system, Rev. Mod. Phys 59 (1976), 1—85, 
and U. Weiss, Quantum Dissipative Systems (World Scientific Publishing, 1993). 
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Leggett’s original formulation, let us represent the environment by a bath of quantum 
harmonic oscillators characterized by a set of frequencies {w a }, 

where N is the number of bath-oscillators. For simplicity, let us suppose that, in the leading 
approximation, the particle-bath coupling is linear in the bath coordinates and such that 
H c [q,q a \ = —J2a fa[q]qcn where f a [q] represents some function of the particle coordinate 
q. Expressed as a Feynman path integral, the survival probability of a particle confined to 
a metastable minimum at a position q = a, and coupled to an external environment, can 
then be expressed as (H = 1) 

(a\e~ i6t/h \a) = J Dq e iS ^- M J Dq a e ^bathM+iS c [ g , go ], 

q(0)=q(t)=a 


where H = H pax t + Lfbath + He denotes the total Hamiltonian of the system, 

Spart [?] = dt'^q 2 -V (g)j , Sbathfea] = J X! Ra ~ W a?a] ’ 


denote, respectively, the actions of the particle and bath, while 

represents their coupling. 26 Here we assume that the functional integral over q a (t) is taken 
over all field configurations of the bath while, as before, the path integral on q(t) is subject 
to the boundary conditions f/(0) = q(t) = a. 

To reveal the effect of the bath on the capacity for tunneling of the particle, it is instructive 
to integrate out fluctuations q a and thereby obtain an effective action for q. Being Gaussian 
in the coordinates q a , the integration can be performed straightforwardly. Although not 
crucial, since we are dealing with quantum mechanical tunneling, it is useful to transfer to 
the Euclidean time representation. Taking the boundary conditions on the fields f/ 0: (r) to 
be periodic on the interval [0,T -1 = /3] , it may be confirmed that the Gaussian functional 
integral over q a induces a time non-local interaction of the particle (exercise) ( a\e~ lHt / h \a ) = 
f Dq e _Seff [ 9 l where a constant of integration has been absorbed into the measure and 


•Seff [<?] — <Spart [tf] + ^ 


.f a [q(Un)}fa[q(-Un)} 
+ w 2 ) 


The second term in the coupling action has been added to keep the effect of the environment minimally invasive 
(purely dissipative). If it were not present, the coupling to the oscillator degrees of freedom would effectively shift 
the extremum of the particle potential, i.e. change its potential landscape. (Exercise: Substitute the solutions of 
the Euler-Lagrange equations S qoc S[q, q a ] = 0 [computed for a fixed realization of q] into the action to obtain 
the said shift.) 
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Here, the sum , runs over the discrete set of Fourier frequencies uj n = 2irnT with n 
integer. 27 By integrating out of the bath degrees of freedom, the particle action acquires an 
induced contribution. To explore its effect on dissipation and tunneling, it is necessary to 
specialize our discussion to a particular form of coupling. 

In the particular case that the coupling to the bath is linear, i.e. / a [q(r)] = c a q(r), the 
effective action assumes the form (exercise) 


where K(t) = J 0 °° 


resembles the Green function of a boson with energy hui. Physically, the non-locality of the 
action is easily understood. By exchanging fluctuations with the external bath, a particle 
can effect a self-interaction, retarded in time. Taken as a whole, the particle and the bath 
maintain quantum phase coherence. However, when projected onto the particle degree of 
freedom, the total energy of the system appears to fluctuate and the phase coherence of 
the particle transport is diminished. To explore the properties of the dissipative action, 
it is helpful to separate the non-local interaction according to the identity g , (r)q , (r , ) = 
[q 2 (r) + 9 2 ('r / )]/2 — [g(r) — g r (r , )] 2 /2. The first square-bracketed contribution presents an 
innocuous renormalization of the potential V(q) and, applying equally to the classically 
allowed motion and quantum tunneling, presents an unobservable perturbation. Therefore, 
we will suppose that its effect has been absorbed into a redefinition of the particle potential 
V(q). By contrast, the remaining contribution is always positive. 

Either the particular form of the “spectral function” J(u>) may be obtained from an a 
priori knowledge of the microscopic interactions of the bath or, phenomenologically, it can 
be inferred from the structure of the classical damped equations of motion. For example, for 
a system subject to an “ohmic” dissipation (where, in real time, the classical equations of 
motion obtain a dissipative term — qq with a “friction coefficient” rf), one has J(oj) = i]\oj\ 
for all frequencies smaller than some characteristic cut-off (at the scale of the inverse Drude 
relaxation time of the environment). By contrast, for a defect in a three-dimensional crystal, 
interaction with acoustic phonons presents a frequency dependence of w 3 or u> 5 depending 
on whether u is below or above the Debye frequency. 

I N FO Consider, for example, the coupling of a particle to a continuum of bosonic modes whose 
spectral density J(u>) = quj grows linearly with frequency. In this case, 

K(Un) = — r dw 

7 T Jo w(w 2 + wg) 2 

27 More precisely, anticipating our discussion of the Matsubara frequency representation, we have defined 
the Fourier decomposition on the Euclidean time interval T, namely q(r) = QmG toJvnT , <?m = 

T fP dr q(r)e ~ * a,mT , where u> m = 2tt m/ (3 with m integer. 


rfi 

S'eff[q] = Spart [<?] -T dr dr' K(t- r , )g(r)q(r , ) 
Jo 


qjr J(u>)D u (t), J{bj) = | ~ W “) an( ^ 

D u (t) = - V , f n 2 , e^ T , 

' ^ cu(w 2 + W 2 ) 
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describes Ohmic dissipation of the particle. Fourier transforming this expression we obtain 


Kir) *; 


7r Tp 


2 sin 2 ( 7 xTt) 

1 strongly time non-local “self-interaction” of the particle. 


V 1 
2ttT r 2 5 


(3.44) 


Disssipative quantum tunneling 

To return to the particular problem at hand, previously we have seen that the tunneling rate 
of a particle from a metastable potential minimum can be inferred from the extremal field 
configurations of the Euclidean action: the bounce trajectory. To explore the effect of the 
dissipative coupling, it is necessary to understand how it revises the structure of the bounce 
solution. Now, in general, the non-local character of the interaction inhibits access to an 
exact solution of the classical equation of motion. In such cases, the effect of the dissipative 
coupling can be explored perturbatively or with the assistance of the renormalization group 
(see the discussion in Section 8.2). However, by tailoring our choice of potential V(q), we 
can gain some intuition about the more general situation. 

To this end, let us consider a particle of mass m confined in a metastable minimum by a 
(semi-infinite) harmonic potential trap (see figure), 


V(q) - 


mu) 2 q 2 /2, 

—00, 


0< |<z| < a, 
\q\ > a. 


Further, let us assume that the environment imparts an ohmic dissipation with a damping 
or viscosity r]. To keep our discussion general, let us consider the combined impact of 
dissipation and temperature on the rate of tunneling from the potential trap. To do so, 
following Langer 28 it is natural to investigate the “quasi-equilibrium” quantum partition 
function Z of the combined system. In this case, the tunneling rate appears as an imaginary 
contribution to the free energy F = —TlnZ, namely E = — (2//i)Im F. 

By drawing on the path integral, the quantum partition function of the system can be 
presented as a functional integral Z = f q (^ =q (o) Dq e~ Set! ^ n where, as we have seen above, 


for ohmic coupling, the Euclidean action assumes the 
form 


S e s[q] 




Once again, to estimate the tunneling rate, we will 
suppose that the barrier is high and the temperature 

is low, so that the path integral is dominated by stationary configurations of the action. 
In this case, one may identify three distinct solutions. In the first place, the particle may 


J. S. Langer, Theory of the condensation point, ^4nn. Phys. (NY) 41 (1967), 108-57. 
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remain at q = 0 poised precariously on the maximum of the inverted harmonic potential. 
Contributions from this solution and the associated harmonic fluctuations reproduce terms 
in the quantum partition function associated with states of the closed harmonic potential 
trap. Secondly, there exists a singular solution in which the particle remains at the minimum 
of the inverted potential, i.e. perched on the potential barrier. The latter presents a negligible 
contribution to the quantum partition function and can be neglected. Finally, there exists 
a bounce solution in which the particle injected at a position q inside the well accelerates 
down the inverted potential gradient, is reflected from the potential barrier, and returns to 
the initial position q in a time ft. While, in the limit ft —> oo, the path integral singles out 
the boundary condition g(0) = q(B) — > 0, at finite ft the boundary condition will depart 
from 0 in a manner that depends non-trivially on the temperature. It is this general bounce 
solution that governs the decay rate. 

Since, in the inverted potential, the classical bounce trajectory stays within the interval 
over which the potential is quadratic, a variation of the Euclidean action with respect to 
q(r) obtains the classical equation of motion 

-mi + mu.1, + 5 r dr' ‘FFm = AS{t _ m< 

Wo (r- t'Y 

where the term on the right-hand side of the equation imparts an impulse that changes 
discontinuously the velocity of the particle, while the coefficient A is chosen to ensure 
symmetry of the bounce solution on the Euclidean time interval. Turning to the Fourier 
representation, the solution of the saddle-point equation then assumes the form 

q n = ATe~ luln ^ g{o} n ) , g(u n ) = [m(w 2 + w 2 ) + r?|cu„|] _1 . (3.45) 

Imposing the condition that q(r = ft/2) = a, one finds that A = a/f where / = T ’Yf n g{u> n ). 
Finally, the action of the bounce is given by 

^bounce = + U c)+v\u m \)\qn\ 2 = (3.46) 

1. To make sense of these expressions, as a point of reference, let us first determine the 
zero-temperature tunneling rate in the absence of dissipation, that is 77 — >■ 0 and 
ft — > oo. In this case, the (Matsubara) frequency summation translates to the continuous 
integral, / = f//° oo (duj/2Tr)g(u)) = (2 mu c ) _1 . Using this result, the bounce action (3.46) 
takes the form Sbounce = ^w c a 2 . As one would expect, the tunneling rate T ~ e~ Shonnce is 
controlled by the ratio of the potential barrier height mw c 2 a 2 / 2 to the attempt frequency 
oj c . Also notice that the bounce trajectory is given by 

g(r) = jJ W (r ^ 2) 5(w) = a g rW-#iaj, 

i.e., as expected from our discussion in section 3.3, the particle spends only a time l/u> c 
in the under-barrier region. 

2. Now, restricting attention to the zero-temperature limit, let us consider the influence 
of dissipation on the nature of the bounce solution and the capacity for tunneling. 
Focusing on the limit in which the dynamics of the particle is overdamped, g mw c , 
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f = f^^duig^) £bf ( 2 / 7177 ) In (r)/mui c ), which implies ^bounce = 7'"??a 2 /(41n[r//(ma; c )]). In 
particular, this result shows that, in the limit 77 —l oo, the coupling of the particle to the 
ohmic bath leads to an exponential suppression of the tunneling rate while only a weak 
dependence on the jump frequency persists. Physically, this result is easy to rationalize: 
under-barrier tunneling is a feature of the quantum mechanical system. In the transfer 
of energy to and from the external bath, the phase coherence of the particle is lost. At 
zero temperature, the tunneling rate becomes suppressed and the particle confined. 

3. Let us now consider the influence of temperature on the tunneling rate when the 

dissipative coupling is inactive, r] — > 0. In this case, the discrete frequency summation 
takes the form 29 / = T'^2, n g(ui n ) = (coth(/3u; c /2)/2 u c m). Using this result, one obtains 
the action ^bounce = tanh(/3u; c /2). In the low temperature limit /3 — > oo, bounce = 

mu) c a 2 as discussed above. At high temperatures j3 — > 0, as expected, one recovers a 
classical activated dependence of the escape rate, namely bounce — /3mu> 2 a 2 /2. 

4. Finally, let us briefly remark on the interplay of thermal activation with ohmic dis- 
sipation. Applying the the Euler-Maclaurin formula Ylm = o /( m ) = /<r d.r/(;r)+ / f 2 0) - 

^ H to relate discrete sums over Matsubara frequencies to their zero-temperature 

integral limits, one finds that ^bounce (-0 -• S hounce (T = 0) a r\T 2 . This shows that, in 
the dissipative regime, an increase in temperature diminishes the tunneling rate with a 
scale proportion to the damping. 

This concludes our cursory discussion of the application of the Feynman path integral to 
dissipative quantum tunneling. As mentioned above, our brief survey was able only to 
touch upon the broad field of research. Those interested in learning more about the field of 
macroscopic quantum tunneling are referred to the wider literature. To close this chapter, 
we turn now to our penultimate application of the path integral - quantum mechanical spin. 


Path integral for spin 

The quantum mechanics of a spin-(l/2) particle is a standard example in introductory 
courses. Indeed, there is hardly any other system whose quantum mechanics is as easy to 
formulate. Given that, it is perhaps surprising that for a long time the spin problem defied 
all attempts to cast it in path integral form: Feynman, the architect of the path integral, 
did not succeed in incorporating spin into the new formalism. It took several decades to 
fill this gap (for a review of the early history up to 1980, see Schulman’s text 1 ), and a 
fully satisfactory formulation of the subject was obtained no earlier than 1988. (The present 
exposition follows closely the lines of the review by Michael Stone. 30 ) 

Why then is it so difficult to find a path integral of spin? In hindsight it turns out that 
the spin path integral is in fact no more complex than any other path integral, it merely 
appears to be a bit unfamiliar. The reason is that, on the one hand, the integrand of the path 
integral is essentially the exponentiated classical action whilst, on the other, the classical 


For details on how to implement the discrete frequency summation, see the Info block on page 170 below. 
M. Stone, Supersymmetry and the quantum mechanics of spin, Nucl. Phys B 314 (1989), 577-86. 
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mechanics of spin is a subject that is not standard in introductory or even advanced 
courses. In other words, the path integral approach must, by necessity, lead to an unusual 
object. The fact that the classical mechanics of spin is hardly ever mentioned is related 
not only to the common view that spin is something “fundamentally quantum” but also to 
the fact that the mechanics of a classical spin (see below) cannot be expressed within the 
standard formulation of Hamiltonian mechanics, i.e. there is no formulation in terms of a set 
of globally defined coordinates and equally many global momenta. It is therefore inevitable 
that one must resort to the (less widely applied) symplectic formulation of Hamiltonian 
mechanics. 31 However, as we will see below, the classical mechanics of spin can nevertheless 
be quite easily understood physically. 

Besides attempting to elucidate the connections between quantum and classical mechanics 
of spin, there is yet another motivation for discussing the spin path integral. Pretending that 
we have forgotten essential quantum mechanics, we will formulate the path integral ignoring 
the fact that spin quantum numbers are half integer or integer. The quantization of spin 
will then be derived in hindsight, by way of a geometric consideration. In other words, the 
path integral formulation demonstrates how quantum mechanical results can be obtained 
by geometric rather than standard algebraic reasoning. Finally, the path integral of spin will 
serve as a basic platform on which our analysis of higher-dimensional spin systems below 
will be based. 


A reminder of finite- dimensional SU (2) -representation theory 
In order to formulate the spin path integral, it is necessary to recapitulate some facts 
regarding the role of SU(2) in quantum mechanics. The special unitary group in two 
dimensions, SU(2), is defined by SU(2) = {g g Mat(2 x 2,C)\g^g = l 2 ,detg = 1}, where 
I 2 is the two-dimensional unit matrix. Counting independent components one finds that the 
group has three free real parameters or, equivalently, that its Lie algebra, su(2), is three- 
dimensional. As we have seen, the basis vectors of the algebra - the group generators - 
S z ,i = x,y,z, satisfy the closure relation [<§%<§■?] = ieijkS k , where is the familiar fully 
antisymmetric tensor. Another useful basis representation of su(2) is given by the spin 
raising and lowering operators, S ,± = S x ± iS y . Again, as we have seen earlier, the 
algebra {S + ,S~,S Z } is defined by the commutation relations [S’" 1 ", S' - ] = 2 S z , [SfyS^] = 
±S±. 

Each group element can be uniquely parameterized in terms of the exponentiated algebra. 
For example, in the Euler angle representation, the group is represented as 

SU(2) = g(4>,9,il>) =e -i ^ 3 e - ^ 2 e -i,/ ’‘ §3 |(/.,V’e [O,2 tt],0€ [0, tt] . 

The Hilbert spaces Hs of quantum spin are irreducible representation spaces of SU(2). 
Within the spaces Hs, SU(2) acts in terms of representation matrices (which will be denoted 
by g) and the matrix representations of its generators Sj. The index S is the so-called weight 


Within this formulation, the phase space is regarded as a differential manifold with a symplectic structure (cf. 
Arnold’s text on classical mechanics: V. I. Arnold, Mathematical Methods of Classical Mechanics [Springer- 
Verlag, 1978]). In the case of spin, this manifold is the 2-sphere S 2 . 
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of the representation (physically: the total spin). 32 Within each Hs, there is a distinguished 
state, a state of highest weight 1 1) , which is defined as the (normalized) eigenstate of S 
with maximum eigenvalue, S (physically: a spin state polarized in the 3-direction). Owing 
to the irreducibility of the representation, each (normalized) state of the Hilbert space Hs 
can be obtained by applying the Euler-angle-parameterized elements of the representation 
to the maximum weight state. 

Being a compact group, SU(2) can be integrated over, i.e. it makes sense to define objects 
like / SU ( 2 ) dg f(g), where / is some function of g and dg is a realization of a group measure. 33 
Among the variety of measures that can be defined in principle, the (unique) Haar measure 
plays a distinguished role. It has the convenient property that it is invariant under left and 
right multiplication of g by fixed group elements, i.e. 

V/i e SU(2) : J dg f(gh) = J dg f(hg) = J dg f(g), 

where, for notational simplicity, we have omitted the subscript in J SU ( 2 ) ■ 

Construction of the path integral 

With this background, we are now in a position to formulate the Feynman path integral. 
To be specific, let us consider a particle of spin S subject to the Hamiltonian 

H = B S, 

where B is a magnetic field and S = (S'i , S 2 , S3) is a vector of spin operators in the spin- 
s' representation. Our aim is to calculate the imaginary time path integral representation 
of the partition function Z = tve~^ H . In constructing the path integral we will follow 
the general strategy outlined at the end of Section 3.2, i.e. the first step is to represent 
Z as Z = tr (e~ eH ) N , where e = /3/N. Next, we have - the most important step in the 
construction - to insert a suitably chosen resolution of identity between each of the factors 
e~ eH . A representation that will lead us directly to the final form of the path integral is 
specified by 

id = C J dg \g)(g\, (3.47) 

where “id” represents the unit operator in Hs, f dg is a group integral over the Haar mea- 
sure, C is some constant, and \g) = g\ t) is the state obtained by letting the representation 
matrix g act on the maximum weight state 1 1) (cf. the glossary of SU(2) representation 
theory above). 

Of course it remains to be verified that the integral (3.47) is indeed proportional to the 
unit operator. That this is so follows from Schur’s lemma, which states that, if, and 
only if, an operator A commutes with all representation matrices of an irreducible group 


The index S is defined in terms of the eigenvalues of the Casimir operator (physically: the total angular 
momentum operator) S 2 = S 2 according to the relation |s) £ T-Ls '■ S 2 |s) = S(S + l)|s). 

To define group measures in a mathematically clean way, one makes use of the fact that (as a Lie group) SU(2) 
is a three-dimensional differentiable manifold. Group measures can then be defined in terms of the associated 
volume form (see the primer in differential geometry on page 537). 
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representation (in our case the gs acting in the Hilbert space T-Ls)i A is proportional to 
the unit matrix. That the group above integral fulfills the global commutativity criterion 
follows from the properties of the Haar measure: V/i £ SU(2): 


hjdg \g){g\ = J dg \hg){g\ H = r J dg \hh 1 g){h 1 g\ = J dg \g){g\h. 

Thus, / dg \g)(g\ is, indeed, proportional to the unit operator. The proportionality constant 
appearing in Eq. (3.47) will not be of any concern to us - apart from the fact that it is 
non-zero. 34 

Substituting the resolution of identity into the time-sliced partition function and making 
use of the fact that 


{g i+ i\e eB S \gi) ^ (g i+1 \gi) - e(g i+1 \B ■ S\gi) {9t] = 1 1 - ( gi \ gi ) + {g i+1 \ gi ) - e{g i+1 \B ■ S\gi) 


- exp [{9i+i\9i) ~ { 9i\9i ) ~ e(s*+i|B ■ %i>) , 


one obtains 


Z = lim 

N—± oo 


f 

/ n <i<ji exp 

^ 9 N =9o i—o 


•n 


{9i+i\9i) ~ { 9i\9i ) 


-(g i+1 \B.S\g$ 


By taking the limit N — > oo, this can be cast in path integral form, 




dr [~{d T g\g) + (g|B • 


(3.48) 


where the Hg-valued function |^(r)) is the continuum limit of \gi). Equation (3.48) is our 
final, albeit somewhat over-compact, representation of the path integral. In order to give 
this expression physical interpretation, we need to examine more thoroughly the meaning 
of the states |g). 

In the literature, the states \g) expressed in the Euler-angle representation 


| g{<!>M)) = e - i<t,§ 3 e- ie§ 2 e-^ §3 \ t) 


are referred to as spin coherent states. Before discussing the origin of this terminology, it 
is useful to explore the algebraic structure of these states. Firstly, note that the maximum 
weight state | t) is, by definition, an eigenstate of §3 with maximum eigenvalue S. Thus, 
| g((j>,0,ip)) = 1 '(')e ‘vS an( j the angle ip enters the coherent state merely as a 

phase or gauge factor. By contrast, the two remaining angles 6 and <j> act through true rota- 
tions. Now, the angular variables (j> £ [0, 27r) and 6 £ [0, n) define a standard representation 
of the 2-sphere. In view of the fact that the states | g(<f>, 9, ip)) cover the entire Hilbert space 


Actually, the constant C can be straightforwardly computed by taking the trace of Eq. (3.47), which leads to 
C = (dimension of the representation space) /(volume of the group). 
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Hs, we are led to suspect that the latter bears similarity with a sphere . 35 To substantiate 
this view, let us compute the expectation values 

n % = (g((j>, 0, ip)\Si\g(<j), 9, ip)), i = 1,2,3. (3.49) 

To this end, we first derive an auxiliary identity which will spare us much of the trouble 
that will arise in expanding the exponentials appearing in the definition of | g). By making 
use of the identity ( i 7 - j) 

e -%<t>Si g. e %<t>Si _ e -u i,[s it ] cos cf) -\- eijkSk sin</>, (3.50) 

where the last equality follows from the fact that cos x (sin x) contain x in even (odd) orders 
and [Sj , ] 2 Si = Si , it is straightforward to obtain (exercise) n = S' (sin 9 cos <p, sin 6 sin </>, cos 6 ) , 
i.e. n is the product of S and a unit vector parameterized in terms of spherical coordinates. 

This is the key to understanding the terminology 
“spin coherent states.” The vectors \g(<p-, 6, ip)) repre- 
sent the closest approximation of a classical angular 
momentum state one can formulate in Hilbert space 
(see figure). 

Let us now see what happens if we employ the 
Euler angle representation in formulating the 
path integral. A first and important observation 
is that the path integral is gauge invariant - in the 
sense that it does not depend on the U(l)-phase, ip. 
As to the B-dependent part of the action, the gauge 
invariance is manifest: Eq. (3.49) implies that 

Sb[4>, 9] = [ dr (p|B • S\g) = f dr (<?|B • S| g) = S f dr n • B = SB f dr cos 9. 

Jo Jo Jo Jo 

Here, we have introduced the gauge-independent part \g) of the state vector by setting | g) = 

| g) exp (—iSip) or, equivalently, \g((p, 9)) = e~^ S3 e~' ieS2 \ t). Substituting this representation 
into the first term of the action of Eq. (3.48), one obtains 



dr (d T g\g) = - dr {d T e~ iS * g\ge~ iS: f} 


dr (( d T g\g ) - iSd T ip{g\g)) = - / dr {d T g\g), (3.51) 


where the last equality holds because ( g\g ) = 1 is constant and ip is periodic in j3. We 
thus conclude that the path integral does not depend on the gauge phase, i.e. it effectively 
extends over paths living on the 2-sphere (rather than the entire group manifold SU(2)). 
This finding is reassuring in the sense that a degree of freedom living on a sphere comes 


There is a group theoretical identity behind this observation, namely the isomorphism SU(2) ^ S 2 X U(l), 
where U(l) is the “gauge” subgroup contained in SU(2). 
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close to what one might intuitively expect to be the classical counterpart of a quantum 
particle of definite angular momentum. 

Let us now proceed by exploring the action of the path integral. By using the auxiliary 
identity (3.50) it is a straightforward matter to show that 

Sto P [<f>, Q\ = — [ dr {d T g\g) = — iS f dr d T <j) cos 6 = iS f dr d T cj)(l — cos 0). (3.52) 

Jo Jo Jo 

Combining this with the B-dependent term discussed above, one obtains 


fP 

S[0, <f>] = S b [4>, 0} + S top [d>. 6 \=S dr [B cos 9 + i(l - cos 9)d t *ji] 

Jo 


(3.53) 


for the action of the path integral for spin. 


EXERCISE Derive the Euler-Lagrange equations associated with this action. Show that they 
are equivalent to the Bloch equations id T n = B x n of a spin with expectation value (S) = .S’n 
subject to a magnetic field. Here, n (</>, 6) € S 2 is the unit vector defined by the two angles 4>, 6. 


Analysis of the action 

To formulate the second term in the action (3.53) in a more illuminating way, we note that 
the velocity of the point n moving on the unit sphere is given by n = 6eg + 4> sin 9 e<j>, where 
(e r ,ee,e^) form a spherical orthonormal system. We can thus rewrite Eq. (3.52) as 

Stop [<^> 0] =iS J dr ri • A = iS £ dn ■ A, (3.54) 


where 


1 — cos 6 


(3.55) 


Notice that, in spite of its compact appearance, Eq. (3.54) does not represent a coordinate 
invariant formulation of the action S top . (The field A {<j>,6) explicitly depends on the coor- 
dinates ((p, 6).) In fact, we will see in Chapter 9 that the action 5 top cannot be expressed in 
a coordinate invariant manner, for reasons deeply rooted in the topology of the 2-sphere. 

A second observation is that Eq. (3.54) can be read as the (Euclidean time) action of a 
particle of charge S moving under the influence of a vector potential A. 36 Using standard 
formulae of vector calculus 37 one finds B m = V x A = e r , i.e. our particle moves in a radial 
magnetic field of constant strength unity. Put differently, the particle experiences the field 
of a magnetic “charge” of strength 47r centered at the origin of the sphere. 


INFO If you find this statement difficult to reconcile with the Maxwell equation • B = 0 
f s B • dS = 0 for any closed surface S, notice that • B = • ( x A) = 0 holds only if A 

is non-singular. However, the vector potential Eq. (3.55) is manifestly singular along the line 


See, e.g., H. Goldstein, Classical Mechanics (Addison- Wesley, 1981). 
See, e.g., J. D. Jackson, Classical Electrodynamics (Wiley, 1975). 
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(r, 9 = tt) through the south pole of the sphere. The physical picture behind this singularity 
is as follows: imagine an infinitely thin solenoid running from r = oo through the south pole 
of the sphere to its center. Assuming that the solenoid contains a magnetic flux 47r, the center 
of the sphere becomes a source of magnetic flux, the so-called Dirac monopole. This picture 
is consistent with the presence of a field B = e r . It also explains the singularity of A along 
the string. (Of course, the solenoidal construction does not lead to the prediction of a genuine 
monopole potential: somewhere, at r = oo, our auxiliary magnetic coil has to end, and this is 
where the flux lines emanating from the point r = 0 terminate.) The postulate of a flux line at 
the singularity of A merely helps to reconcile the presence of a radial magnetic field with the 
principles of electrodynamics. However, as far as our present discussion goes, this extra structure 
is not essential, i.e. we may simply interpret r = 0 as the position of a magnetic “charge.” 


To explore the consequences of this phenomenon, we apply Stokes’ theorem to write 


Stop [n] = iS j> dn ■ A = iS 


dS ■ (V x A) = iS / dS-e r = iSA r>n . (3.56) 


Here, A in is the domain on the 2-sphere which (a) has the curve 7 as its boundary, and 


(b) contains the north pole (see 
figure on next page). The integral 
produces the area of this surface 
which we again denote by A Jin . 
Curiously, the action St op is but a 
measure of the area bounded by the 
curve 7 : r n(r). However, simple 
as it is, this result should raise some 
suspicion: by assigning a designated 
role to the northern hemisphere of 
the sphere some symmetry breaking, 
not present in the original prob- 
lem, has been introduced. Indeed, 
we might have defined our action 
by S top [(j),6] = iSjf^dn ■ A' where 
A' = -I±ff^e 0 = A - 2V(f> differs 


Sir George Gabriel Stokes 
1819-1903 

As Lucasian Professor of Math- 
ematics at Cambridge, Stokes 
etablished the science of hydro- 
dynamics with his law of viscosity 
(1851), describing the velocity of 
a small sphere through a viscous 
fluid. Furthermore, he investigated the wave theory 
of light, named and explained the phenomenon 
of fluorescence, and theorized an explanation 
of the Fraunhofer lines in the solar spectrum. 
(Figure reproduced from Sir George Gabriel Stokes 
Memoirs presented to the Cambridge Philosophical 
Society on the occasion of the Jubilee of Stokes 
volume XVIII of the Transactions of the Cambridge 
Philosophical Society, Cambridge University Press, 
1900.) 



from A only by a gauge transformation. 38 The newly defined vector potential is non-singular 
in the southern hemisphere, so that application of Stokes’ theorem leads to the conclusion 


St op [n] = — iS /a dS ■ B m = —iSA JtS . Here, A ltS is the area of a surface bounded by 7 
but covering the south pole of the sphere. The absolute minus sign is due to the outward 
orientation of the surface A 7)S . 


You may, with some justification, feel uneasy about the fact that 0 is not a true “function” on the sphere (or, 
alternatively, about the fact that f dn • <J> = <f>{(3) — (f>{ 0) may be a non- vanishing multiple of 2-k). We will 

return to the discussion of this ambiguity shortly. (Notice that a similarly hazardous manipulation is performed 
in the last equality of Eq. (3.52).) 
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One has to concede that the result obtained for the action ,S\ 0 p depends on the cho- 
sen gauge of the monopole vector potential! The difference between the northern and the 
southern variant of our analysis is given by 

iS [ dS ■ B m + iS [ dS ■ B m = iS [ dS ■ e r = 4 niS, 

Ja^,„ Ja^, b Js 2 

where we have made use of the fact that A 7j „ U A 7iS = S 2 is the full sphere. At first sight, 

it looks as if our analysis has led us to a gauge-dependent, 
and therefore pathological, result. Let us recall, however, 
that physical quantities are determined by the exponenti- 
ated action exp(— S'[n]) and not by the action itself. Now, 
S is either integer or half integer, which implies the factor 
exp(47rbS'j = 1 is irrelevant. In the operator representation 
of the theory, spin quantization follows from the repre- 
sentation theory of the algebra su(2). It is a “non-local” 
feature, in the sense that the action of the spin operators on 
all eigenstates has to be considered to fix the dimensionality 
2 S + 1 of Hs- In hindsight, it is thus not too surprising that the same information is 
encapsulated in a “global” condition (gauge invariance) imposed on the action of the path 
integral. 

Summarizing, we have found that the classical dynamics of a spin is that of a massless 
point particle on a sphere coupled to a monopole field B m . We have seen that the vector 
potential of the field cannot be globally continuous on the full sphere. More generally, the 
phase space S 2 cannot be represented in terms of a global system of “coordinates and 
momenta” which places it outside the scope of traditional treatments of classical mechanics. 
This probably explains the failure of early attempts to describe the spin in terms of a path 
integral or, equivalently, in terms of a Hamiltonian action. 

In Chapter 9 we will use the path action (3.53) as a building block for our construction 
of the field theory of higher-dimensional spin systems. However, before concluding this sec- 
tion, let us make some more remarks on the curious properties of the monopole action <St op . 
Unlike all other Euclidean actions encountered thus far, the action (3.54) is imaginary. In 
fact, it will stay imaginary upon Wick rotation r — »• it back to real times. More generally, 
Stop is invariant under the rescaling r — > cr, and invariant even under arbitrary reparam- 
eterizations r — > g(r) = t' . This invariance is a hallmark of a topological term. Loosely 
speaking (see Chapter 9 for a deeper discussion), a topological term is a contribution to 
the action of a field theory that depends on the global geometry of a field configuration 
rather than on its local structure. In contrast, “conventional” operators in field theoretical 
actions measure the energy cost of dynamical or spatial field fluctuations. In doing so they 
must relate to a specific spatio-temporal reference frame, i.e. they cannot be invariant under 
reparameterization. 

Summarizing our results, we have found that: 




142 


Feynman path integral 


1. The classical action of a spin is one of a massless particle (there is no standard kinetic 
energy term in Eq. (3.48)) moving on a unit sphere. The particle carries a magnetic 
moment of magnitude S. It is coupled (a) to a conventional magnetic field via its magnetic 
moment, and (b) to a monopole field via its orbital motion. Note that we have come, 
finally, to a position which hints at the difficulties plaguing attempts to formulate a 
classical mechanics of spin. The vector potential of a monopole, A, cannot be globally 
defined on the entire sphere. The underlying physical reason is that, by the very nature of 
the monopole (flux going radially outwards everywhere), the associated vector potential 
must be singular at one point of the surface. 39 As a consequence, the classical phase 
space of the system, the sphere, cannot be covered by a global choice of coordinate 
system. (Unlike most standard problems of classical mechanics there is no system of 
globally defined “p”s and “q” s.) This fact largely spoils a description within the standard - 
coordinate-oriented - formulation of Hamiltonian mechanics (cf. the discussion in the 
article by Stone 40 ). 

2. Terms akin to the monopole contribution to the spin action appear quite frequently 
within path integral formulations of systems with non-trivial topology (like the two- 
sphere above). Depending on the particular context under consideration, one distinguishes 
between Wess— Zumino— Witten (WZW) terms, 11-terms, Chern— Simons terms 
and a few other terms of topological origin. What makes these contributions generally 
important is that the value taken by these terms depends only on the topology of a field 
configuration but not on structural details. For further discussion of phenomena driven 
by non-trivial topological structures we refer to Chapter 9 below. 


Trace formulae and quantum chaos 

As a final application of the path integral, we turn now to the consideration of problems 
in which the dynamics of the classical system is, itself, non-trivial. Introductory courses 
on classical mechanics usually convey the impression that dynamical systems behave in 
a regular and, at least in principle, mathematically predictable way. However, experience 
shows that the majority of dynamical processes in nature do not conform to this picture. 
Partly, or even fully, chaotic motion (i.e. motion that depends in a singular and, thereby, in 
an essentially unpredictable way on initial conditions) is the rule rather than the exception. 
In view of the drastic differences in the observable behavior of classically integrable and 
chaotic systems, an obvious question arises: in what way does the quantum phenomenology 
of chaotic systems differ from that associated with integrable dynamics? This question 
defines the field of quantum chaos. 


To better understand this point, consider the integral of A along an infinitesimal closed curve 7 on the sphere. If 
A were globally continuous, we would have two choices to transform the integral into a surface integral over B: 
an integral over the “large” or the “small” surface area bounded by 7 . The monopole nature of B would demand 
that both integrals are proportional to the respective areas of the integration domain which, by assumption, 
are different, contradiction. The resolution of this paradox is that A must be discontinuous at one point on 
the sphere, i.e. we cannot globally set B = X A, and the choice of the integration area is prescribed by the 
condition that it must not encompass the singular point. 

M. Stone, Supersymmetry and the quantum mechanics of spin, Nucl. Phys. B 314 (1986), 557—86. 
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Understanding signatures of classically chaotic motion in quantum mechanics is an issue 
not only of conceptual, but also of great practical relevance, impinging on areas such as 
quantum electron transport in condensed matter systems. The inevitable presence of impuri- 
ties and imperfections in any macroscopic solid renders the long-time dynamics of electronic 
charge carriers chaotic. Relying on a loose interpretation of the Heisenberg principle, At ~ 
h/AE, i.e. the relation between long-time dynamical behavior and small-scale structures in 
energy, one would expect that signatures of chaotic quantum dynamics are especially impor- 
tant in the low-energy response in which one is usually interested. This expectation has 
been confirmed for innumerable observables related to low-temperature electronic transport 
in solid state systems. 

Disordered conducting media represent but one example of a wide class of dynamical 
systems with long-time chaotic dynamics. Indeed, recent experimental advances have made 
it possible to realize a plethora of effectively non- disordered chaotic dynamical systems in 
condensed matter devices. For example, by employing modern semiconductor device tech- 
nology, it has become possible to manufacture small two-dimensional conducting systems, 
of a size 0(< 1 pm) and of almost any geometric shape. Here, the number of imperfections 
can be reduced to a negligible minimum, i.e. electrons propagate ballistically along straight 
trajectories, as with a billiard system. The smallness of the devices further implies that the 
ratio between Fermi wavelength and system size is of 0(1O -1 -1O -3 ), so, while semiclassical 
concepts will surely be applicable, the wave aspects of quantum propagation remain visi- 
ble. In recent years, the experimental and theoretical study of electron transport in such 
quantum billiards has emerged as a field in its own right. 

How then can signatures of chaotic dynamics in quantum systems be sought? The most 
fundamental characteristic of a quantum system is its spectrum. Although not a direct 
observable, it determines the majority of properties accessible to measurement. On the other 
hand, it is clear that the manifestations of chaos we are looking for must relate back to the 
classical dynamical properties of the system. The question then is, how can a link between 
classical mechanics and quantum spectra be drawn? This problem is tailor-made for analysis 
by path integral techniques. 


Semiclassical approximation to the density of states 
The close connection between the path integral and classical mechanics should be evident 
from the previous sections. However, to address the problem raised above, we still need to 
understand how the path integral can be employed to analyze the spectrum of a quantum 
system. The latter is described by the (single-particle) density of states 

p(e) = trd(e — H) = — e a ), (3.57) 

where {e Q } represents the complete set of energy levels. To compute the sum, one commonly 
employs a trick based on the Dirac identity, 
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where V{l/x) denotes the principal part of 1/x. By taking the imaginary part of Eq. (3.58), 
Eq. (3.57) can be represented as p(e) = — ^Im ^2 a e+ l e = — ^Im tr ( + W ), where e + = 
e + iS and the limit lim^o is implicit. Using the identity 1 /x~ = — i J 0 °° dte lx+t , and 
representing the trace tr A = f dq (q\A\q) as a real space integral, 

p(e) = — J dt Re tr(e^ e+_ ^^ R ) = —Re J dte ie+t ^ H J dq(q\e~ l ^ t ^ h \q), (3.59) 

we have made the connection between the density of states and the quantum propagation 
amplitude explicit. 

Without going into full mathematical detail (see, for example, the work by Haake, 41 for a 
modern discourse) we now outline how this integral is evaluated by path integral techniques 
within the semiclassical approximation. Although, for brevity, some of the more tricky steps 
of the calculation are swept under the carpet, the sketch will be accurate enough to make 
manifest some aesthetic connections between the spectral theory of chaotic quantum systems 

and classically chaotic dynamics. (For a more formal and thorough discussion, we refer to 
41 

Haake and Gutzwiller. 42 ) 

Making use of the semiclassical approximation (3.28) established earlier, and substituting 
into Eq. (3.59), one obtai ns p(e) ~ g Re / 0 °° dte ze+t ^ h f dqA[q c where, following 

our discussion in Section 3.2, we have defined A[q c i] = det ( 2 nh dq(ty ag(t) ) an d 9ci re P re ~ 
sents a closed classical path that begins at q at time zero and ends at the same coordinate 
at time t. Again relying on the semiclassical condition «S[g c i] h, the integrals over q and 
t can be performed in a stationary phase approximation. Beginning with the time integral, 
and noticing that = — e 9cl is the (conserved) energy of the path q c \ , we obtain the 

saddle-point condition e = e 9cl and 

P( e ) ~ J dq 

where the symbol q c i >e indicates that only paths q — »• q of energy e are taken into account, 
and the contribution coming from the quadratic integration around the saddle-point has 
been absorbed into a redefinition of A[q c i iC ]. 

Turning to the ^-integration, making use of the fact that 
^5^1] = —Pi,d qt S[q c i] = pf, where are the initial and 
final coordinates of a path g cl , and p\.f are the initial and final 
momentum, the stationary phase condition assumes the form 
0 = d q S[q c i te ] = ( d qi + d qf ) S[q c i te ]\ q .- qi = g = pt-pu i.e. the sta- 
tionary of the integrand under the g-integration requires that 
the initial and final momentum of the path g c u be identical. 
We thus find that the paths contributing to the integrated transition amplitude are peri- 
odic not only in coordinate space but even in phase space. Such paths are called periodic 
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orbits -“periodic” because the path comes back to its initial phase space coordinate after 
a certain revolution time. As such, the orbit will be traversed repeatedly as time goes by 
(see the figure, where a periodic orbit a with initial coordinates x = ( p,q ) is shown). 

According to our analysis above, each coordinate point q lying on a periodic orbit is a 
stationary phase point of the (/-integral. The stationary phase approximation of the integral 
can thus be formulated as 

p ( e ) ~ ^.Ref lJ2j a d q A <* einSa ’ 

where J2 a s t an< As for a sum over all periodic orbits (of energy e) and S a is the action 
corresponding to one traversal of the orbit (all at fixed energy e). The index n accounts 
for the fact that, owing to its periodicity, the orbit can be traversed repeatedly, with total 
action nS a . Furthermore, f dq is an integral over all coordinates lying on the orbit and 
we have again absorbed a contribution coming from the quadratic integration around the 
stationary phase points in the pre-exponential amplitude A a . 

Finally, noting that f dq (X T a , where T a is the period of one traversal of the orbit a (at 
energy e), we arrive at the result 


p(e) ~ 


1 . 

7 rh 


Re£^T a A a e 


(3.60) 


This is a (simplified, see info block below) representation of the famous Gutzwiller trace 
formula. The result is actually quite remarkable: the density of states, an observable of 
quantum mechanical significance, has been expressed entirely in terms of classical quantities. 


EXERCISE Making use of the Feynman path integral, show that the propagator for a particle 
of mass m confined by a square well potential of infinite strength is given by 


G{qi,qi\t) = 


E 


im(qf — q\ + 2na)‘ 
2ht 


im(qf + qi + 2 no)' 
2ht 


I N FO Had we carefully kept track of all determinants arising from the stationary phase integrals, 
the prefactor A a would have read 


^ |detM£ — 1|3 ’ 

where v a is known as the Maslov index (an integer-valued factor associated with the singular 
points on the orbit, i.e. the classical turning points). The meaning of this object can be under- 
stood, e.g., by applying the path integral to the problem of a quantum particle in a box. To 
correctly reproduce the spectrum, the contribution of each path must be weighted by (— )" = 
exp(*7rn), where n is the number of its turning points in the box potential), and M a represents 
the monodromy matrix. To understand the meaning of this object, notice that a phase space 
point x on a periodic orbit can be interpreted as a fixed point of the classical time evolution 
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operator U(T a )\ U(T a ,x) = x, which is just to say that the orbit is periodic. As with any other 
smooth mapping, U can be linearized in the vicinity of its fixed points, U(T a ,x + y) = x + M a y, 
where the linear operator M a is the monodromy matrix. Evidently, M a determines the stability 
of the orbit under small distortions, which makes it plausible that it appears as a controlling 
prefactor of the stationary phase approximation to the density of states. 


3.4 Summary and outlook 

In this chapter we have introduced the path integral formulation of quantum mechanics, 
an approach independent of, yet (modulo certain mathematical imponderabilities related to 
continuum functional integration) equivalent to, the standard route of canonical operator 
quantization. While a few precious exactly solvable quantum problems (e.g. the evolution 
of a free particle, the harmonic oscillator, and, perhaps intriguingly, quantum mechanical 
spin) are more efficiently formulated by the standard approach, a spectrum of unique fea- 
tures makes the path integral an indispensable tool of modern quantum mechanics. The 
path integral approach is highly intuitive, powerful in the treatment of non-perturbative 
problems, and tailor-made to formulation of semiclassical limits. Perhaps most importantly, 
we have seen that it provides a unifying link whereby quantum problems can be related 
to classical statistical mechanics. Indeed, we have found that the path integral of a quan- 
tum point particle is, in many respects, equivalent to the partition function of a classical 
one-dimensional continuum system. We have hinted at a generalization of this principle, i.e. 
an equivalence principle relating d-dimensional quantum field theory to ( d + l)-dimensional 
statistical mechanics. However, before exploring this bridge further, we first need to gener- 
alize the concept of path integration to problems involving quantum fields. This will be the 
subject of the next chapter. 


3.5 Problems 

Quantum harmonic oscillator 

As emphasized in the main text, the quantum harmonic oscillator provides a valuable environment in 
which to explore the Feynman path integral and methods of functional integration. In this, along with a 
small number of other precious examples, the path integral may be computed exactly, and the Feynman 
propagator explored rigorously. 


(a) Starting with the Feynman path integral, show that the propagator for the one- 
dimensional quantum harmonic oscillator, H = p 2 /2m + mu) 2 q 2 /2, takes the form 


<9f|« 


- i6t/h \ «> = (HpW 

V Zma sin c ot. 


\) 1 [2h 


Suggest why the propagator varies periodically on the time interval t, and explain the 
origin of the singularities at t = mr /u>, n = 1,2,... Taking the frequency ui — > 0, show 
that the propagator for the free particle is recovered. 
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(b) Show that the wavepacket ip(q,t = 0) = (27ra) -1 / 4 exp[— g 2 /4a] remains Gaussian at all 
subsequent times. Obtain the width a(t ) as a function of time. 

(c) Semiclassical limit: Taking the initial wavepacket to be of the form 

ip(q,t = 0) = (27ra) -1 / 4 exp m?.:q — > 

(which corresponds to a wavepacket centered at an initial position q = 0 with a velocity 
v), find the wavepacket at times t > 0, and determine the mean position, mean velocity, 
and mean width as functions of time. 

Answer: 


(a) Making use of the Feynman path integral, the propagator can be expressed as the 
functional integral, 

<* \e~ iAt/n \ qi ) = [ q(t)=9t Dq %] = f dt' | (g 2 - caV) • 

Jq(0)=qi Jo ^ 

The evaluation of the functional integral over field configurations q{t') is facilitated by 
parameterizing the path in terms of fluctuations around the classical trajectory. Setting 
q{t') = q c i(t') + r{t') where q c i(i L /) satisfies the classical equation of motion mq c \ = 
—mu> 2 q c 1 , and applying the boundary conditions, one obtains the solution q, : \{t r ) = 
Asin(wt') + Bcos(cjt'), with the coefficients B = q± and A = q^/ sin(c ot) — qiCot(ut). 
Being Gaussian in q, the action separates as S[q] = 5[g c i] + .S' [r] , where 


S[q c i] 


— (A 2 - p^ sin ( 2 ^) , 

2 [ K ’2u 2 w 


muj 

~Y 




Finally, integrating over the fluctuations and applying the identity 


zf sinz = (1 — z 2 /n 2 n 2 ) 1 , 

n= 1 

one obtains the required result, periodic in t with frequency ui, and singular at t = nn/oj. 
In particular, a careful regularization of the expression for the path integral shows that 


(q { \e- ifit/h \qi) -> 


\S(q { + qi), 


t = 2nn/u, 
t = n(2n + 1 )/cj. 


Physically, the origin of the singularity is clear. The harmonic oscillator is peculiar in 
having a spectrum with energies uniformly spaced in units of hu>. Noting the eigenfunc- 
tion expansion {qf\e~ lHt 4 h \qi) = 'Zh n {qi\n){n\q\)e~' luint , this means that when Hu> x t/h = 
27 r x integer there is a coherent superposition of the states and the initial state is recov- 
ered. Furthermore, since the ground state and its even integer descendants are symmetric 



148 Feynman path integral 

while the odd states are antisymmetric, it is straightforward to prove the identity for 
the odd periods (exercise). 

(b) Given the initial condition ip(q,t = 0), the time evolution of the wavepacket can be 
determined from the propagator as ^>(< 7 , t) = dq' (q\e~ lHt ^ h \q')il)(q' , 0), from which 
one obtains 

<P(q,t) = J(t) j°° dq' 

where J(t) represents the time-dependent contribution arising from the fluctuations 
around the classical trajectory. Being Gaussian in q ' , the integral can be performed 
explicitly. Setting a = l/2a — imuicot(uit)/h,p = irnu>q/(hsm{uji)), and performing the 
Gaussian integral over q ' , one obtains 

where ft 1 /2a = —(1 + incot(ujt))q 2 /&a(t). Rearranging terms, it is straightforward 
to show that ip(q,t) = (2tt a(f )) -1 / 4 exp e lv( - q ’ t \ where a(t) = a[cos 2 (u;t) + 

k ~ 2 sin 2 (cut)], k = 2amu)/h, and <p(q, t ) represents a pure phase . 43 As required, under the 
action of the propagator the normalization of the wavepacket is preserved. (A graphical 
representation of the time evolution is shown in Fig. 3.9(a).) Note that, if a = h/2mu> 
(i.e. k = 1 ), a{t ) = a for all times - i.e. it is a pure eigenstate. 

(c) Still of a Gaussian form, the integration can again be performed explicitly for the new 
initial condition. In this case, we obtain an expression of the form above but with 
f3 = | s i^t) x (q — ^sin(wt)). Reading off the coefficients, we find that the position 
and velocity of the wavepacket have the forms qo(t) = (v/u))sm(ut), v(t) = vcos(cjt), 
coinciding with those of classical dynamics. Note that, as above, the width a(t ) of the 
wavepacket oscillates at frequency u. (A graphical representation of the time evolution 
is shown in Fig. 3.9(b).) 




Figure 3.9 (a) Variation of a “stationary” Gaussian wavepacket in the harmonic oscillator taken 
from the solution, and (b) variation of the moving wavepacket. 


J For completeness, we note that (p(q,t) = — | tan 1 (-^ cot(u;i)) — - 
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Density matrix 

Using the results derived in the previous section as an example, we explore how real-time dynamical 
information can be converted into quantum statistical information. 

Using the results of the previous question, obtain the density matrix p(q, q') = (q\e~^ H \q') 
for the harmonic oscillator at finite temperature, /3 = 1/T (fee = 1). Obtain and comment 
on the asymptotics: (i) T <C TujJ and (ii) T Hlu. (Hint: In the high-temperature case, be 
sure to carry out the expansion in tuo/T to second order.) 


Answer: 

The density matrix can be deduced from the general solution of the previous question. 
Turning to the Euclidean time formulation, 




__ ( mu \ 1 

\2nhsmh(/3hu>) ) 


exp I ~^2h ' ^ q 2 + q ' ^ coth (^ fiw ) _ iT 


sinh(^fiu;) 


(i) In the low-temperature limit T (ptvjj l),coth {fittiJ) l,sinh(/37iu;) -4 e /3fiw /2, 

and 




2 h 


tf+n 


(q\n = 0) e ^ E ° (n = (%') . 


(ii) Using the relations coth(x) = l/x + a;/3-| and l/sinh(a;) = l/x-x/6-\ , the 

high-temperature expansion ( T Ttw) of the density operator gives 


p{q,q') ^ 


(. 




\2nph 2 J 

- 6(q-q')t 


-m(q-q') 2 /2pti 


H/3mU! 2 o 

exp| — m~ {q +q 


+ qq') 


i.e. one recovers the classical Maxwell-Boltzmann distribution! 


Depinning transition and bubble nucleation 

In Section 3.3 we explored the capacity for a quantum field to tunnel from the metastable minimum of 
a potential, the “false vacuum." Yet, prior to the early work of Coleman on the quantum mechanical 
problem, similar ideas had been developed by Langer in the context of classical bubble nucleation. 
The following problem is an attempt to draw the connections between the classical and the quantum 
problem. As posed, the quantum formulation describes the depinning of a flux line in a superconductor 
from a columnar defect. 
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Consider a quantum elastic string embedded in a three-dimensional 
space and “pinned” by a columnar defect potential V oriented parallel 
to the z-axis. The corresponding Euclidean time action is given by 

S[u] = J dr J dz 0pu 2 + ^(d z u) 2 + v (\ u \)j > 

where the two-dimensional vector field u (z,t) denotes the string dis- 
placement within the xy-plane, p represents the density per unit length, 
and cr defines the tension in the string. On this system (See figure), let us 
suppose that an external in-plane field / is imposed along the x-direction, 

S'ex t = — / f dr f dz ue x . Following the steps below, determine the prob- 
ability (per unit time and per unit length) for the string to detach from 
the defect: 

(a) Derive a saddle-point equation in the two-dimensional zr- space. Rescaling the coordi- 
nates, transform the equation of motion to a problem with circular symmetry. 

(b) If the field is weak, one can invoke a “thin-wall” or “bubble” approximation to describe 
the saddle-point solution ufy, t) by specifying two regions of space-time, where the string 
is free, or is completely locked to the defect, respectively. In this approximation, find 
u (z,t). (Hint: Use the fact that, in either case, complete locking or complete freedom, 
the potential does not exert a net force on the string.) 

(c) With exponential accuracy, determine the detaching probability. You may assume that, 
for all values of u x obtained in (b), V(|u|) ~ Vo = const. 

(Exercise: Think how the quantum model can be related to the classical system.) 

Answer: 



(a) Varying the action with respect to u x , the saddle-point equation assumes the form pil x + 
ad 2 u x = —f + V'(u)(u x /u), where u = |u|. Applying the rescaling r = (p/cr) 1//4 r and 
0 = (cr/p) 1 / 4 ?, the equation takes the symmetrized form v / apd 2 u x = —f+V'(u)(u x /u), 
where d 2 = d 2 + d 2 and boundary conditions 


u x {r) 


0, r > R, 
g(r), r < R, 


on the radial coordinate (r, z) (—)• (r, (p) are imposed. 

(b) In the thin-wall approximation, the potential gradient can be neglected. In this case 
the saddle-point equation assumes the form V 2 g = —f/^crp, with the solution g = 
{R 2 - r 2 )f /\/4ap. 

(c) With the result, the tunneling rate can be estimated from the saddle-point action 


^bubble — 


/' 


dr 


Of 


(dg) 2 + Vo- fg) = -wR: 


gm 

16(fjp) 1 / 2 


— Vo 
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Minimizing over R, one obtains the optimal radius R 2 = . As a result, we obtain 

the estimate for the tunneling rate W oc = exp [ — 47r j • 


Tunneling in a dissipative environment 

In Section 3.3 we considered the impact of dissipation on the action of a point-particle in a quantum 
well. There a model was chosen in which the degrees of freedom of the environment were represented 
phenomenologically by a bath of harmonic oscillators. In the following we will explore a model in which 
the particle is coupled to the fluctuations of a quantum mechanical "string.” Later, in Section 8.2, we 
will see that this model provides a description of tunneling through a single impurity in a Luttinger 
liquid. 


(a) A quantum particle of mass m is confined by a sinusoidal potential U ( q ) = 
2gsin 2 (7T(j l /() , o). Employing the Euclidean (imaginary time) Feynman path integral 

Z = j Dq(r) S pa rt //(r)' =t 'j, dr [y q 2 + £/(<?)] , 

confirm by direct substitution that the extremal contribution to the propagator connect- 
ing two neighboring degenerate minima (g(r = — oo) • 0 and q(r = oo) = qo ) is given by 
the instanton trajectory < 7 c i(t) = (2qo/n) arctan (exp [tuor]), where u>o = (2n/qo)y/g/m. 
Show that = (2/n 2 )mqQOJo- (Note: Although the equation of motion associated 

with the minimum of the Euclidean path integral is nonlinear, the solution above is 
exact. It is known in the literature as a soliton configuration.) 

(b) If the quantum particle is coupled at one point to an infinite “string,” the path integral 
is given by 

Z = J Du(x,t) J Dq{r) S (q(r) — u(t,x = 0)) e~ SstTi,ls ^ x ’ T ^~ Sp!irt ^ 9 ^\ 


where the classical action of the string is given by (cf. the action functional for phonons 
discussed in Section 1.1) 


, S'strmg[w(a;, t)] — J dr j 


dx 


p a ( du\ 
2 U + 2 ) 


Here S(q(r) — u(t,x = 0)) represents a functional 5-function which enforces the condi- 
tion q(r) = u(t, x = 0) for all times t. Operationally, it can be understood from the 
discretized form rin ~ u ( T m x = 0))- By representing the functional 5-function 

as the functional integral 


S(q(T) - u(t, 0)) = / Df(r) exp 


/' 


dr /(t)(5(t)-u(t,0)) 


and integrating over the fluctuations of the string, show that the dynamics of the par- 
ticle is governed by the effective action S'effM = -Spaxt [<?] + (??/2) f(dou/2Tr)\uj q(w ) | 2 , 
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where r] = yfpa. How does this result compare with the dissipative action discussed in 
Section 3.3? 

(c) Treating the correction to the particle action as a perturbation, use your result from 
(a) to show that the effective action for an instanton-anti-instanton pair q(r) = q c i(r + 
r/2) — q c i(r — r/2), where u>of 1, is given approximately by 

S eS [q] = 2 ( S'p art [g c i] - ^ In (w 0 f) . 

(Hint: Note that, in finding the Fourier decomposition of q c i(r), a crude estimate is 
sufficient.) 

(d) Using this result, estimate the typical separation of the pair (i.e. interpret the over- 
all action as an effective probability distribution function for r, and evaluate (r) = 
I df re Sefs ). Comment on the implications of your result for the nature of the tunneling 
probability. 

Answer: 


(a) Varying the Euclidean time path integral with respect to q(r) one finds that the extremal 
field configuration obeys the classical sine— Gordon equation 


2nq . ( 2irq 
■ sin ' 


= 0 . 


Applying the trial solution, one finds that the equation of motion is satisfied if u o = 
(2n/qo)yfgjm. From this result one obtains the classical action 

%i! = J dr [^dcl + ^(tfd)] = J dr mq ^ = m J dqq c \ = 2^-w 0 - 

(b) In Fourier space, the action of the classical string takes the form 



dk pu> 2 + ak 2 
2tt 2 


|u(w,fc)| 2 . 


Representing the functional (5-function as the functional integral 

J Df exp [i J (q{-u) - J -k)^j J , 


and performing the integral over the degrees of freedom of the string, one obtains 


J Du e- s ° t ™s- i f dT AtM no) 


oc 



dk 1 1 

27 r 2 (poj 2 + ak 2 ) 



Integrating over k, and performing the Gaussian functional integral over the Lagrange 
multiplier field /(w), one obtains the effective action as required. 
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(c) Approximating the instanton/anti-instanton pair q(r) = <7ci('r+r) — g c i(r — r) by a “top- 
hat” function, one finds that q(u>) = 9o^ ZUT = goFsin(u;T/2)/(a;T/2). Treating 

the dissipative term as a perturbation, the action then takes the form 


Seff ~ 2S , p ar t = l 




M(<7 ot) 


_ , 2 sin 2 (wr/ 2) 


M) J 


~ — 7?ln(wor), 


where cuo serves as a high-frequency cut-off. 

(d) Interpreted as a probability distribution for the instanton separation, one finds 


(t) = J dr t exp — — r/hi(uJoT) ~ J drr 1 q o r >/' K 


The divergence of the integral shows that, for r] > 2n/qQ, instanton-anti-instanton pairs 
are confined and particle tunneling is deactivated. Later, in Chapter 8 we revisit the 
dissipative phase transition from the standpoint of the renormalization group. 


Winding numbers 

In the main text, we considered the application of the Feynman path integral to model systems where 
trajectories could be parameterized in terms of their harmonic (Fourier) expansion. However, very often, 
one is interested in applications of the path integral to spaces that are not simply connected. In this 
case, one must include classes of trajectories which cannot be simply continued. Rather, trajectories are 
classified by their “winding number" on the space. To illustrate the point, let us consider the application 
of the path integral to a particle on a ring. 

(a) Starting with the Hamiltonian H = —(1/2 1)(d 2 /dO 2 ), where 9 denotes an angle variable, 
show from first principles that the quantum partition function Z = tre -/3H is given by 

z = £ ex P [ - 0^] • (3.61) 


(b) Formulated as a Feynman path integral, show that the quantum partition function can 
be cast in the form 


Z = 


l d0 E / (0)= -D#(t) exp 



(c) Varying the Euclidean action with respect to 9, show that the path integral is minimized 
by the classical trajectories 9(r) = 9 + 2ir mr/ Parameterizing a general path as 
9(t) = 9(t) + where rj(r ) is a path with no net winding, show that 


E 


/ ( 27 xm) 

2 B 


(3.62) 


where Zq represents the quantum partition function for a free particle with open bound- 
ary conditions. Making use of the free particle propagator, show that Z§ = ^/I/2n/3. 
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(d) Finally, making use of Poisson’s summation formula, M m ) = f^°oo $$ h(<f>)e 2nm ‘l > , 

show that Eq. (3.62) coincides with Eq. (3.61). 

Answer: 


(a) Solving the Schrodinger equation, the wavefunctions obeying periodic boundary condi- 
tions take the form -tp n = e me ,n integer, and the eigenvalues are given by E n = 
n 2 /2I. Cast in the eigenbasis representation, the partition function assumes the form 
Eq. (3.61). 

(b) Interpreted as a Feynman path integral, the quantum partition function takes the form 
of a propagator with 


Z = 


d6 / D6(t) exp 

Je{p) = 0(o) = e 


dr-e< 


The trace implies that paths 6{r) must start and finish at the same point. However, 
to accommodate the invariance of the field configuration 6 under translation by 27r we 
must impose the boundary conditions shown in the question. 

(c) Varying the action with respect to 6 we obtain the classical equation 19 = 0. Solving 
this equation subject to the boundary conditions, we obtain the solution given in the 
question. Evaluating the Euclidean action, we find that 


{drOydr = 

Thus, we obtain the partition function (3.62), where 


1 , I" 27TTO 

FT 


+ Ht 5 )' 


dT(d T T]) 2 


f 

I fP ' 

/ exp 

[-2I ^ \ 


I 

liji 

denotes the free particle partition function. This can be obtained from direct evaluation 
of the free particle propagator. 


Multiplication by Z 0 obtains the result. 



Particle in a periodic potential 

In Section 3.3 it was shown that the quantum probability amplitude for quantum mechanical tunneling 
can be expressed as a sum over instanton field configurations of the Euclidean action. By generalizing 
this approach, the aim of the present problem is to explore quantum mechanical tunneling in a periodic 
potential. Such an analysis allows us to draw a connection to the problem of the Bloch spectrum. 
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(a) A quantum mechanical particle moves in a periodic lattice potential V with period a. 
Taking the Euclidean action for the instanton connecting two neighboring minima to be 
Sinst) express the Euclidean time propagator G(ma,na;r), with m and n integer, as a 
sum over instanton and anti-instanton field configurations. 

(b) Making use of the identity S qq > = J Q 27r dO e l ^ q ~ q (2-7r) show that 

G(ma,na;r) ~ e~ UT ^ 2 f < ^L e -i(n-m)o ex p I" ^^2 cos# 

J 0 2n l n 

where our notation is taken from Section 3.3. 

(c) Keeping in mind that, in the periodic system, the eigenfunctions are Bloch states 
ip poi (q) = e* P9 u pa (g) where u pa (q + ma) = u pa (q) denotes the periodic part of the Bloch 
function, show that the propagator is compatible with a spectrum of the lowest band 
a = 0, e p = hoj/2 — 2Ae cos(po). 

Answer: 


(a) In the double-well potential, the extremal field configurations of the Euclidean action 
involve consecutive sequences of instanton-anti-instanton pairs. However, in the periodic 
potential, the q instantons and q' anti-instantons can appear in any sequence provided 
only that q — q' = m — n. In this case, the Feynman amplitude takes the form 

G(ma, na\ r) ~ (TKe- Si ™ t/h ) q+q 'e- WT/2 . 

q=0q'=0 q ' q ' 


(b) To evaluate the instanton summation, we can make use of the identity 6 q - q > tn - m = 
/ 0 27r d6 e iW-n+m) 0 /( 27r ). As a result, we obtain 


G(ma, na\ r) ~ 



^ [rKe i6 e~ s ^l n ) q g' (rKe~ 


st/ny 


Aer 

h e 


Act 


from which can be obtained the required result. 

(c) Expanded in terms of the Bloch states of the lowest band of the periodic potential a = 0, 
one obtains 


G(ma,na-,T) = ^ V>*(TOa)t/) p (na)e epT ^ h = ^ \u p (0)\ 2 e ip ^ n m '> a e e P T / h . 

p p 

Interpreting 0 = pa, and taking |u p (0)| 2 = const, independent of p, we can draw the 
correspondence e p = Hlj/2 — 2Aecos (pa). 
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In this chapter, the concept of path integration is generalized to integration over quantum fields. 
Specifically we will develop an approach to quantum field theory that takes as its starting point an 
integration over all configurations of a given field, weighted by an appropriate action. To emphasize the 
importance of the formulation that, methodologically, represents the backbone of the remainder of the 
text, we have pruned the discussion to focus only on the essential elements. This being so, conceptual 
aspects stand in the foreground and the discussion of applications is postponed to the following chapters. 


In this chapter, the concept of path integration is extended from quantum mechanics to 
quantum field theory. Our starting point is a situation very much analogous to that out- 
lined at the beginning of the previous chapter. Just as there are two different approaches 
to quantum mechanics, quantum field theory can also be formulated in two different ways: 
the formalism of canonically quantized field operators, and functional integration. As to the 
former, although much of the technology needed to efficiently implement this framework - 
essentially Feynman diagrams - originated in high-energy physics, it was with the develop- 
ment of condensed matter physics through the 1950s, 1960s, and 1970s that this approach 
was driven to unprecedented sophistication. The reason is that, almost as a rule, problems 
in condensed matter investigated at that time necessitated perturbative summations to infi- 
nite order in the non-trivial content of the theory (typically interactions). This requirement 
led to the development of advanced techniques to sum perturbation series in many-body 
interaction operators to infinite order. 

In the 1970s, however, essentially non-perturbative problems began to attract more and 
more attention - a still prevailing trend - and it turned out that the formalism of canonically 
quantized operators was not tailored to this type of physics. By contrast, the alternative 
approach to many-body problems, functional integration, is ideally suited! The situation 
is similar to the one described in the last chapter, where we saw that the Feynman path 
integral provided a spectrum of novel routes to approaching quantum mechanical problems 
(controlled semiclassical limits, analogies to classical mechanics, statistical mechanics, con- 
cepts of topology and geometry, etc.). Similarly, the introduction of field integration into 
many-body physics spawned new theoretical developments, many of which went beyond 
perturbation theory. In fact, the advantage of the path integral approach in many-body 
physics is more pronounced than in single-particle quantum mechanics: higher dimension- 
ality introduces more complex fields, and the concept of field integration is ideally suited 
to explore the ensuing structures. Also, the connections to classical statistical mechanics 
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play a more important role than in single-particle quantum mechanics. All of these concepts 
will begin to play a role in subsequent chapters when applications of the field integral are 
discussed. 



Degrees of freedom 

Path integral 

QM 

.9 


(i 

t 

QFT 


A 

\y 


Figure 4.1 The concept of field integration. Upper panels: path integral of quantum mechanics - 
integration over all time-dependent configurations of a point particle degree of freedom leads to 
integrals over curves. Lower panels: field integral - integration over time-dependent configurations 
of d-dimensional continuum mappings (fields) leads to integrals over generalized (d-l-l)-dimensional 
surfaces. 

Before embarking on the quantitative construction of the field integral - the subject of 
the following sections - let us anticipate the kind of structures that one should expect. 
In quantum mechanics, we were starting from a single point particle degree of freedom, 
characterized by some coordinate q (or some other quantum numbers for that matter). 
Path integration then meant integration over all time-dependent configurations q(t), i.e. 
a set of curves t i-> q(t) (see Fig. 4.1, upper panel). By contrast, the degrees of freedom 
of field theory are continuous objects <j>(x) by themselves, where x parameterizes some d- 
dimensional base manifold and <f> takes values in some target manifold (Fig. 4.1, lower panel). 
The natural generalization of a “path” integral then implies integration over a single copy of 
these objects at each instant of time, i.e. we shall have to integrate over generalized surfaces, 
mappings from (d+l)-dimensional space-time into the field manifold, (x,t) (j)(x,t). While 

this notion may sound worrying, it is important to realize that, conceptually, nothing much 
changes in comparison with the path integral: instead of a one-dimensional manifold - a 
curve - our object of integration will be a (d + l)-dimensional manifold. 

We now proceed to formulate these ideas in quantitative terms. 

EXERCISE If necessary, recapitulate the general construction scheme of path integrals (Sec- 
tion 3.2) and the connection between quantum fields and second quantized operators. 
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4.1 Construction of the many-body path integral 

The construction of a path integral for field operators follows the general scheme outlined 
at the end of Section 3.2. The basic idea is to segment the time evolution of a quantum 
(many-body) Hamiltonian into infinitesimal time slices and to absorb as much as possible 
of the quantum dynamical phase accumulated during the short-time propagation into a set 
of suitably chosen eigenstates. But how should these eigenstates be chosen? In the context 
of single-particle quantum mechanics, the structure of the Hamiltonian suggested a repre- 
sentation in terms of coordinate and momentum eigenstates. Now, given that many-particle 
Hamiltonians are conveniently expressed in terms of creation/annihilation operators, an 
obvious idea would be to search for eigenstates of these operators. Such states indeed exist 
and are called coherent states. 


Coherent states (bosons) 

Our goal is, therefore, to find eigenstates of the (non-Hermitian) Fock space operators d 
and a. Although the general form of these states will turn out to be the same for bosons and 
fermions, there are major differences regarding their algebraic structure. The point is that 
the anti-commutation relations of fermions require that the eigenvalues of an annihilation 
operator themselves anti-commute, i.e. they cannot be ordinary numbers. Postponing the 
introduction of the unfamiliar concept of anti-commuting “numbers” to the next section, 
we first concentrate on the bosonic case where problems of this kind do not arise. 

So what form do the eigenstates | <f>) of the bosonic Fock space operators a and cd take? 
Being a state of the Fock space, an eigenstate \<f>) can be expanded as 


where a\ creates a boson in state i, C„ lin2j ... represents a set of expansion coefficients, 
and |0) represents the vacuum. Here, for reasons of clarity, it is convenient to adopt this 
convention for the vacuum as opposed to the notation |ff) used previously. Furthermore, 
the many-body state |m, ri 2 , . . .) is indexed by a set of occupation numbers: m in state 1 1) , 
ri 2 in state |2), and so on. Importantly, the state \<p) can, in principle (and will in practice) 
contain a superposition of basis states which have different numbers of particles. Now, if 
the minimum number of particles in state \(f>) is no, the minimum of aj | <p) must be no + 1. 
Clearly the creation operators a\ themselves cannot possess eigenstates. 

However, with annihilation operators this problem does not arise. Indeed, the annihilation 
operators do possess eigenstates, known as boson coherent states, 

(4.1) 
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where the elements of 4> = {<pi} represent a set of complex numbers. The states \<j>) are 
eigenstates in the sense that, for all i, 



(4.2) 


i.e. they simultaneously diagonalize all annihilation operators. Noting that a* and at, with 
j 7^ i, commute, Eq. (4.2) can be verified by showing that aexp(</>at)|0) = 0exp(</>at)|O). 1 
Although not crucial to the practice of field integration, in the construction of the path 
integral it will be useful to assimilate some further properties of coherent states: 


t> By taking the Hermitian conjugate of Eq. (4.2), we find that the “bra” associated with 
the “ket” | <j>) is a left eigenstate of the set of creation operators, i.e. for all i, 

{<j)\a\ = {<t>\4>i, (4.3) 

where <j>i is the complex conjugate of <f>i, and {</> \ = (0|exp[JT 0*0,]. 
t> It is a straightforward matter - e.g. by a Taylor expansion of Eq. (4.1) - to show that the 
action of a creation operator on a coherent state yields the identity 

al\<f>)=d 4 , i \cf>)- (4-4) 

Reassuringly, it may be confirmed that Eq. (4.4) and (4.2) are consistent with the com- 
mutation relations [a„«t| = 5 tJ : [a*, a J]|<£) = 

i> Making use of the relation (6\<j)) = (0|e^‘^ ai | <j>) = {0\(j)) , one finds that the 

overlap between two coherent states is given by 

(#10) — exp ( ^2 . (4.5) 

i> From this result, one can infer that the norm of a coherent state is given by 

{<j)\4>) = exp(^^i)- (4.6) 


t> Most importantly, the coherent states form a complete - in fact an overcomplete - set of 
states in Fock space: 


/ n- 


p-T.ii 




(4.7) 


where dfadifi = d Re 4h d Irn 0, , and ljr represents the unit operator or identity in the 
Fock space. 


1 Using the result [a, (ed) n ] = n(al) n 1 (cf. Eq. (2.38)) a Taylor expansion shows a exp(< 7 jal)JO) = 
[o, exp(0at)]|O> = £~^[a, (»*)”]# = £~=i 2 sr(“ t ) n_1 |0> = <££~ =1 10> = 

</iexp(^o t )|0). 
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INFO The proof of Eq. (4.7) proceeds by application of Schur’s lemma (cf. our discussion of 
the completeness of the spin coherent states in the previous chapter). The operator family {a,}, 
{aj} acts irreducibly in Fock space. According to Schur’s lemma, the proportionality of the left- 
hand side of Eq. (4.7) to the unit operator is, therefore, equivalent to its commutativity with all 
creation and annihilation operators. Indeed, this property is easily confirmed: 


,i J d{<t>,(j>) e E ^ i<#, «|(/>)(0| = J d((f>, (f>) e 0i\4>}{4>\ 

-- -/ d{4>A) dtf-'Z**'*' \m\ 

by !? rts f d$,4>) e~Z***+* \<t>) 

= [ d(4>,4>) e~ T ‘ i$i4 ’ i \(l>){<t>\ai, 


(4.8) 


where, for brevity, we have set d(<j> , (j>) = i dfa dfa/ir. Taking the adjoint of Eq. (4.8), one may 
further check that the left-hand side of Eq. (4.7) commutes with the set of creation operators, 
i.e. it must be proportional to the unit operator. To fix the constant of proportionality, one may 
simply take the overlap with the vacuum: 

J d($, 4 >) = j e-lk*** = 1. (4.9) 

Taken together, Eq. (4.8) and (4.9) prove (4.7). Note that the coherent states are overcom- 
plete in the sense that they are not pairwise orthogonal (see Eq. (4.5)). The exponential weight 
e - Tk it Vt appearing in the resolution of the identity compensates for the overcounting achieved 
by integrating over the whole set of coherent states. 


With these definitions we have all that we need to construct the path integral for the 
bosonic system. However, before doing so, we will first introduce the fermionic version of 
the coherent state. This will allow us to construct the path integrals for bosons and fermions 
simultaneously, thereby emphasizing the similarity of their structure. 


Coherent states (fermions) 


Much of the formalism above generalizes to the fermionic case: as before, it is evident that 
creation operators cannot possess eigenstates. Following the bosonic system, let us suppose 
that the annihilation operators are characterized by a set of coherent states such that, for 
all i, 


a i\v) = Vi\v), 


(4.10) 


where % is the eigenvalue. Although the structure of this equation appears to be equivalent 
to its bosonic counterpart Eq. (4.2) it has one frustrating feature: anti-commutativity of 
the fermionic operators, [a*, %]+ = 0, where i 7^ j, implies that the eigenvalues rji also have 
to anti-commute, 

I ViVj = -VjVi- 


(4.11) 
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Clearly, these objects cannot be ordinary numbers. In order to define a fermionic version 
of coherent states, we now have two choices: we may (a) accept Eq. (4.11) as a working 
definition and pragmatically explore its consequences, or (b) first try to remove any mystery 

from the definitions (4.10) and 
(4.11). This latter task is tackled in 
the Info block below where objects 
{Vi} with the desired properties are 
defined in a mathematically clean 
way. Readers wishing to proceed in 
a maximally streamlined manner 
may skip this exposition and directly 
turn to the more praxis-oriented 
discussion below. 

INFO There is a mathematical structure ideally suited to generalize the concept of ordinary 
number (fields), namely algebras. An algebra A is a vector space endowed with a multiplication 
rule A x A — » A. So let us construct an algebra A by starting out from a set of elements, or 
generators, r)i 6 A, i = 1, . . . , N, and imposing the rules: 

(i) The elements rp can be added and multiplied by complex numbers, i.e. 

Co + Cirp + Cjrjj 6 A, Co, Ci, Cj 6 C, (4-12) 

i.e. A is a complex vector space. 

(ii) The product, A x A — > A, h-> rpr)j, is associative and anti-commutative, i.e. it obeys 

the anti-commutation relation (4.11). Because of the associativity of this operation, there is 
no ambiguity when it comes to forming products of higher order, i.e. (? 7 i%)? 7 fc = rp(rijr)k) = 
fliViVk- The definition requires that products of odd order in the number of generators anti- 
commute, while (even, even) and (even, odd) combinations commute (exercise). 

By virtue of (i) and (ii), the set A of all linear combinations 

oo N 

CO + E E Ci 1 ,...,i n r]i 1 . . .rp n , co,Ci lt ...,i n € C, 

n=l »i,...i„ = l 

spans a finite-dimensional associative algebra A, 2 known as the Grassmann algebra (and 
sometimes also the exterior algebra). For completeness we mention that Grassmann algebras 
find a number of realizations in mathematics, the most basic being exterior multiplication in 
linear algebra. Given an ,/V-dimensional vector space V , let V* be the dual space, i.e. the space 
of all linear mappings, or “forms” A : V ->C,tn-i A(i>), where v 6 V. (Like V, V* is a vector 
space of dimension N.) Next, define exterior multiplication through (A, A') — >- A A', where 
A A' is the mapping 


Hermann Gnther Grassmann 
1809-77 

Credited with inventing what 
is now called exterior alge- 
bra. (Figure reproduced from Her- 
mann Grassmann, Gesamnelte 
Mathematische and Physikalische 
Werke (Druck and Verlag von B. G. Teubner, 
1894).) 



A A'TxF C 

(v, v) i->- A(v)A'(v') — A(v')A'(v). 


Whose dimension 


be shown to be 2 N (exercise). 
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This operation is manifestly anti-commutative: A A' = —A' A. Identifying the N linear basis 
forms Aj i] t with generators and with the product, we see that the space of exterior forms 
defines a Grassmann algebra. 

Apart from their anomalous commutation properties, the generators {r/,}, and their product 
generalizations {r]ir]j, r h r ljrik, ■ ■ •}> resemble ordinary, albeit anti-commutative, numbers. (In 
practice, the algebraic structure underlying their definition can safely be ignored. All we will 
need to work with these objects is the basic rule Eq. (4.11) and the property Eq. (4.12).) We 
emphasize that A contains not only anti-commuting but also commuting elements, i.e. linear 
combinations of an even number of Grassmann numbers % are overall commutative. (This 
mimics the behavior of the Fock space algebra: products of an even number of annihilation 
operators o,aj . . . commute with all other linear combinations of operators a*. In spite of 
this similarity, the “numbers” rii must not be confused with the Fock space operators; there 
is nothing on which they act.) 

To make practical use of the new concept, we need to go beyond the level of pure arith- 
metic. Specifically, we need to introduce functions of anti-commuting numbers, and elements 
of calculus. Remarkably, not only do most of the concepts of calculus - differentiation, inte- 
gration, etc. - naturally generalize to anti-commuting number fields, but, contrary to what 
one might expect, they turn out to be simpler than in ordinary calculus. 


> Functions of Grassmann numbers are defined via their Taylor expansion: 


/«. &)-f; E 

n = 0 »i,...,i n =l 


1 d n f 

nl d£ h ■ ■ ■ dfi ln 


£in ■■■£*!> 

(=0 


6 


(4.13) 


where / is an analytic function. Note that the anti-commutation properties of the algebra 
imply that the series terminates after a finite number of terms. For example, in the simple 
case where t] is first-order in the generators of the algebra, N = 1, and f(rj) = /(0)+/ , (0)ry 
(since rf = 0) - functions of Grassmann variables are fully characterized by a finite 
number of Taylor coefficients! 

> Differentiation with respect to Grassmann numbers is defined by 


— $ij- 


(4.14) 


Note that, in order to be consistent with the commutation relations, the differential 
operator d rjt must itself be anti-commutative. In particular, = —rjj . 

> Integration over Grassmann variables is defined by 



(4.15) 


Note that the definitions (4.13), (4.14), and (4.15) imply that the actions of Grassmann 
di erentiation and integration are e ectively identical , that is 

J dg f{v) = j dr] (/( 0) + /'( 0)fy) = /'( 0) = 'd v f{rj). 
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With this background, let us now proceed to apply the Grassmann algebra to the con- 
struction of fermion coherent states. To this end we need to enlarge the algebra so as to 
allow for a multiplication of Grassmann numbers by fermion operators. In order to be con- 
sistent with the anti-commutation relations, we need to require that fermion operators and 
Grassmann generators anti-commute, 


[Vi,aj] + = 0 . 


(4.16) 


It then becomes a straightforward matter to demonstrate that fermionic coherent states 
are defined by 


\v) =exp(-^ % ai)|0) ! 


(4.17) 


i.e. by a structure perfectly analogous to the bosonic states (4.1 ). 3 It is a straightforward 
matter - and also a good exercise - to demonstrate that the properties (4.3), (4.4), (4.5), 
(4.6), and, most importantly, (4.7) carry over to the fermionic case. One merely has to 
identify a n with a fermionic operator and replace the complex variables <i>i by rn £ A. Apart 
from a few sign changes and the A-valued arguments, the fermionic coherent states differ 
only in two respects from their bosonic counterpart: firstly, the Grassmann variables fji 
appearing in the adjoint of a fermion coherent state, 


{v\ = (Olexp^-^aiJji) = ( 0 |exp( 


are not related to the rji s of the state 1 77 ) via some kind of complex conjugation. Rather rji 
and fji are strictly independent variables . 4 * Secondly, the Grassmann version of a Gaussian 
integral (exercise), f dfj drj e _77?? = 1 does not contain the factors of n characteristic of 
standard Gaussian integrals. Thus, the measure of the fermionic analog of Eq. (4.7) does 
not contain a 7 r in the denominator. 

For the sake of future reference, the most important properties of Fock space coherent 
states are summarized in Table 4.1. 


3 To prove that the states (4.17) indeed fulfill the defining relation (4.10), we note that ajexp(— 7?id|)|0) ^ 4 = 37 
ai(l — T7iaJ)|0) ^ 4 = 6 ^ 77iOiot|0) = 7?i|0) = 7?i(l — ?7iat)|0) = ? 7 iexp(— 7?iaJ)|0). This, in combination with the 
fact that a,i and rjj at ( i ^ j) commute, proves Eq. (4.10). Note that the proof has actually been simpler 
than in the bosonic case. The fermionic Taylor series terminates after the first contribution. This observation 
is representative of a general rule: Grassmann calculus is simpler than standard calculus - all series are finite, 
integrals always converge, etc. 

4 In the literature, complex conjugation of Grassmann variables is sometimes defined. Although appealing from an 
aesthetic point of view - symmetry between bosons and fermions - this concept is problematic. The difficulties 
become apparent when supersymmetric theories are considered, i.e. theories where operator algebras contain 
both bosons and fermions (the so-called super-algebras). It is not possible to introduce a complex conjugation 

that leads to compatibility with the commutation relations of a super-algebra. It therefore seems to be better 

to abandon the concept of Grassmann complex conjugation altogether. (Unlike with the bosonic case where 
complex conjugation is inevitable in order to define convergent Gaussian integrals, no such need arises in the 
fermionic case.) 
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Table 4.1 Basic properties of coherent states for bosons 
(C, = l, tfi £ C) and fermions (C, = —1, ifi £ A). In the last 
line, the integration measure is defined as 


Definition 

1 ip) = exp CEV’ioj |0) 

Action of a. 

aiW W a i = d- i (ip\_ 

Action of a\ 

of \ip) -C<9 Aip), W a i = («/#< 

Overlap 

{ip'\ip)=exp 

Completeness 

f d{fp,ip) e~^ 4 i \tp)(ip\ = lT 


INFO Grassmann Gaussian integration: Finally, before turning to the development of the 
field integral, it is useful to digress on the generalization of Gaussian integrals for Grassmann 
variables. The prototype of all Grassmann Gaussian integration formulae is the identity 


J dp dp e a == a, 


(4.18) 


where a € C takes arbitrary values. Equation (4.18) is derived by a first-order Taylor expansion of 
the exponential and application of Eq. (4.15). The multi-dimensional generalization of Eq. (4.18) 
is given by 

J d(fp, <f>) e “^ Ta0 = det A, (4.19) 

where cp and <p are IV-component vectors of Grassmann variables, the measure dtfcp, cf>) = 
i=1 d4>id(j)i, and A may be an arbitrary complex matrix. For matrices that are unitarily 
diagonalizable, A = U t DU, with U unitary, and D diagonal, Eq. (4.19) is proven in the 
same way as its complex counterpart (3.17): one changes variables <p -t U^, cp U T 0. 
Since detU = 1, the transform leaves the measure invariant (see below) and leaves us with N 
decoupled integrals of the type Eq. (4.18). The resulting product of N eigenvalues is just the 
determinant of A (cf. the later discussion of the partition function of the non-interacting gas). 
For general (non-diagonalizable) A, the identity is established by a straightforward expansion of 
the exponent. The expansion terminates at IVth order and, by commuting through integration 
variables, it may be shown that the resulting IVth-order polynomial of matrix elements of A is 
the determinant. 5 


As with ordinary integrals, Grassmann integrals can also be subjected to variable transforms. Suppose we are 
given an integral J d{<f > , (f > ) /(</>, <f>) and wish to change variables according to 

u = M0, v = M>, (4.20) 

where, for simplicity, M and M' are complex matrices (i.e. we here restrict ourselves to linear transforms). One 
can show that 

v> 1 • • ■ v N = (det M)0i • • • 4>n , i/i • • • VN = (det M 7 )^ • • • (pN- (4-21) 

(There are different ways to prove this identity. The most straightforward is by explicitly expanding Eq. (4.20) 
in components and commuting all Grassmann variables to the right. A more elegant way is to argue that the 
coefficient relating the right- and left-hand sides of Eq. (4.21) must be an ATth-order polynomial of matrix 
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Keeping the analogy with ordinary commuting variables, the Grassmann version of Eq. (3.18) 
reads 

f d(4>,<t>) e -* T ^+- T ■<f+4’ T ■ = det A e Ta_1 . (4.22) 

To prove the latter, we note that f dp f(rj) = f dp f(i 7 + 1 /), i.e. in Grassmann integration one can 
shift variables as in the ordinary case. The proof of the Gaussian relation above thus proceeds 
in complete analogy to the complex case. As with Eq. (3.18), Eq. (4.22) can also be employed 
to generate further integration formulae. Defining (• • • ) = det A -1 f d(cj>, <j>) e A,i (---)> and 
expanding both the left-and the right-hand side of Eq. (4.22) to leading order in the “monomial” 
VjVi, one obtains (c pj4>i ) — A)) 1 . The Ai-fold iteration of this procedure gives 


• • • <t>j n <t>in ■ ■ ■ 0<2 0 u> = ^(sgnPM^pi ' • • A jnip, 
P 


where the sign of the permutation accounts for the sign changes accompanying the interchange of 
Grassmann variables. Finally, as with Gaussian integration over commuting variables, by taking 
N — >■ 00 the Grassmann integration can be translated to a Gaussian functional integral. 


4.2 Field integral for the quantum partition function 

Having introduced the coherent states, the construction of path integrals for many-body 
systems no longer presents substantial difficulties. However, before proceeding, we should 

address the question: what does 
the phrase “path integral for many- 
body systems” actually mean? In 
the next chapter we will see that 
much of the information on quantum 
many-particle systems is encoded 
in expectation values of products of 
creation and annihilation operators, 
i.e. expressions of the structure {ata---}. By an analogy to be explained then, objects 
of this type are generally called correlation functions. More important for our present 
discussion, at any finite temperature, the average (• • • ) entering the definition of the 


Josiah Willard Gibbs 1839-1903 

Credited with the development 
of chemical thermodynamics, 
he introduced concepts of free 
energy and chemical potential. 

(Figure reproduced from The 
Collected Works of J. Willard 
Gibbs, vol. I (Longmans, Green and Co., 1928).) 



elements of M. In order to be consistent with the anti-commutation behavior of Grassmann variables, the 
polynomial must obey commutation relations which uniquely characterize a determinant. Exercise: Check the 
relation for N = 2.) On the other hand, the integral of the new variables must obey the defining relation, 
f dv Pi • • • P N = f dvv\ • • • iy n = (— ) JV ( iV_1 )/ 2 5 where dP = ^ =1 dPi and the sign on the right-hand side is 

attributed to ordering of the integrand, i.e. f dv i dv2 v\V2 = — f dv i vi f dv-z v 2 = —1. Together Eq. (4.21) and 
(4.20) enforce the identities dv = (detM) -1 d<£, dv = (det M , )~ 1 d<f>, which combine to give 


d(4>, 0) /(£, 4>) = det(MM') d(P, v) /(£(*), <f>(v)). 
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correlation function runs over the quantum Gibbs distribution p = e llN ' > j Z. where, 

as usual, 

Z = tre-^ 6 -^ = ^2{n\e-^ 6 -^\n), (4.23) 

is the quantum partition function, f3 = 1/T, p denotes the chemical potential, and the sum 
extends over a complete set of Fock space states {|n)}. (For the time being we specify neither 
the statistics of the system - bosonic or fermionic - nor the structure of the Hamiltonian.) 

Ultimately, we will want to construct the path integral representations of correlation 
functions. Later we will see that all of these representations can be derived by a few straight- 
forward manipulations from a prototypical path integral, namely that for Z. Further, the 
(path integral of the) partition function is of importance in its own right, as it contains 
much of the information needed to characterize the thermodynamic properties of a many- 
body quantum system. 6 We thus begin our journey into many-body field theory with a 
construction of the path integral for Z. 

To prepare the representation of the partition function (4.23) in terms of coherent states, 
one must insert the resolution of identity 

Z = J (4.24) 

We now wish to get rid of the - now redundant - Fock space summation over |n) (another 
resolution of identity). To bring the summation to the form Yl n \ n )( n \ = Ij 7 - one must 
commute the factor (n\ip) to the right-hand side. However, in performing this operation, we 
must be careful not to miss a sign change whose presence will have important consequences 
for the structure of the fermionic path integral. Indeed, it may be checked that, whilst for 
bosons (n\ip)(ip\n) = (ip\n)(n\ip), the fermionic coherent states change sign upon permuta- 
tion, (n|^>)(^>|n) = (—ip\n)(n\ip) (i.e. (—ip \ = (0|exp( — J2i^Pi a i))- The presence of the sign 
is a direct consequence of the anti-commutation relations between Grassmann variables and 
Fock space operators (exercise). Note that, as both H and N contain elements even in the 
creation/annihilation operators, the sign is insensitive to the presence of the Boltzmann 
factor in Eq. (4.24). Making use of the sign factor £, the result of the interchange can be 
formulated as the general expression 

= le-^-^hfr). (4.25) 


In fact, the statement above is not entirely correct. Strictly speaking, thermodynamic properties involve the 
thermodynamic potential ^ = —Tin Z rather than the partition function itself. At first sight it seems that 
the difference between the two is artificial — one might first calculate Z and then take the logarithm. However, 
typically, one is unable to determine Z in closed form, but rather one has to perform a perturbative expansion, 
i.e. the result of a calculation of Z will take the form of a series in some small parameter e. Now a problem 
arises when the logarithm of the series is taken. In particular, the Taylor series expansion of Z to a given order 
in e does not automatically determine the expansion of Q to the same order. Fortunately, the situation is not all 
that bad. As we will see in the next chapter, the logarithm essentially rearranges the perturbation series in an 
order known as a cumulant expansion. 
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Equation (4.25) can now be directly subjected to the general construction scheme of the 
path integral. To be concrete, let us assume that the Hamiltonian is limited to a maximum 
of two-body interactions (see Eq. (2.11) and (2.16)), 

H(a\a) = ^2 hijo\aj + ^ E iife ;o|a]a fc o ; . (4.26) 

ij ijkl 

Note that we have arranged for all of the annihilation operators to stand to the right 
of the creation operators. Fock space operators of this structure are said to be normal 
ordered . 7 The reason for emphasizing normal ordering is that such an operator can be 
readily diagonalized by means of coherent states. Dividing the “time interval” /3 into N 
segments and inserting coherent state resolutions of identity (steps 1, 2, and 3 of the general 
scheme), Eq. (4.25) assumes the form 
r n 

Z= / _ n IJ d(ip n ,ip n ) e~ s ^=«\- s 1 W n -^ +1 )-V' T, +H(^+ 1 > T, )-^W n+1 ,V' n )] ) (4 27) 

J o=C " n=l 

where S = 0/N and } = £* K 3 i>tf 3 + (simi- 

larly N(ip, if')) and we have adopted the shorthand ip n = {V’?}> etc. Finally, sending N — > oo 
and taking limits analogous to those leading from Eq. (3.5) to (3.6) we obtain the continuum 
version of the path integral, 8 


2 =J 

^ D (ip, ip) e -^J, 

rP 

S[ip,ip] = dr [ipd T ip + 

Jo 

H(ip,ip)~ p,N (ip, ip)] 

, (4.28) 

where D(ip 

', VO = limjv^-oo 1 1()— 

x d(ip n , ip n ), and the fields 

satisfy the boundary 

condition 


| 

i(0) = cW), V’(o) = CW 

n- 

(4.29) 

Written in 

a more explicit for 

m, the action associated 

with the general pair 

-interaction 


Hamiltonian (4.26) can be cast in the form 
rP \ 

s= d,T '22 / ifi(T)[{d T - I^)6ij +hij]ifj(T) +^V ijk iipi(T)ip j (T)ip k (T)ifi(T) . (4.30) 

0 ij ijkl 

Notice that the structure of the action fits nicely into the general scheme discussed in 
the previous chapter. By analogy, one would expect that the exponent of the many-body 


More generally, an operator is defined to be “normal ordered” with respect to a given vacuum state |0) if, and 
only if, it annihilates |0). Note that the vacuum need not necessarily be defined as a zero-particle state. If the 
vacuum contains particles, normal ordering need not lead to a representation where all annihilators stand to the 
right. If, for whatever reason, one is given a Hamiltonian whose structure differs from Eq. (4.26), one can always 
effect a normal ordered form at the expense of introducing commutator terms. For example, normal ordering the 
quartic term leads to the appearance of a quadratic contribution that can be absorbed into h a p. 

Whereas the bosonic continuum limit is Jndeed perfectly equivalent to the one taken in constructing the quantum 
mechanical path integral (lims-^o <5“ 1 ( , 0 n "*" 1 — ?/> n ) = d T \ T =n 5 '^{j) gives an ordinary derivative, etc.), a novelty 
arises in the fermionic case. The notion of replacing differences by derivatives is purely symbolic for Grassmann 
variables. There is no sense in which i /; n+1 — i f) n is small. The symbol d T i\) rather denotes the formal (and 
well-defined expression) lim$_).o <5 — 1 (i/; n "*" 1 — ^ n ). 
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path integral carries the significance of the Hamiltonian action, S ~ f d(pq — H), where 
(q, p) symbolically stands for a set of generalized coordinates and momenta. In the present 
case, the natural pair of canonically conjugate operators is (a, oP ) . One would then interpret 
the eigenvalues as “coordinates” (much as ( q,p ) are the eigenvalues of the operators 

( QiP ))• Adopting this interpretation, we see that the exponent of the path integral indeed 
has the canonical form of a Hamiltonian action and, therefore, is easy to memorize. 

Equations (4.28) and (4.30) define the functional integral in the time representation 
(in the sense that the fields are functions of a time variable). In practice we shall mostly 
find it useful to represent the action in an alternative, Fourier conjugate representation. 
To this end, note that, due to the boundary conditions (4.29), the functions V’( T ) can be 
interpreted as functions on the entire Euclidean time axis that are periodic/antiperiodic on 
the interval [0, /?]. As such, they can be represented in terms of a Fourier series, 



where 


2mrT, 

(2 n + 1)t rT, 


bosons, 

fermions, 


n e Z, 


(4.31) 


are known as Matsubara frequencies. Substituting this representation into Eq. (4.28) 
and (4.30), we obtain Z — f D(v. ip) where D($, p). = Yl n d(ip n ,ipn) defines the 

measure (for each Matsubara index n we have an integration over a coherent state basis 
HV’n )})) 9 and the action takes the form 

Sllpflp] = 'y ' Ppj n [(— iid n — p) 5ij + hij] 1pj n + — y ' Vijkl'&ini Vb'n 2 ^Pkri3 V , /n 4 ^n 1 +n 2 ,n 3 +n 4 * 
ij,n ijkl,rii 

(4.32) 

Here we have used the identity J ( f dr e~ ZUnT = /35 Un o- Equation (4.32) defines the fre- 
quency representation of the action. 10 


INFO In performing calculations in the Matsubara representation, one sometimes runs into 
convergence problems (which will manifest themselves in the form of ill-convergent Matsubara 
frequency summations). In such cases it will be important to remember that Eq. (4.32) does not 
actually represent the precise form of the action. What is missing is a convergence generating 
factor whose presence follows from the way in which the integral was constructed, and which 
will save us in cases of non-convergent sums (except, of course, in cases where divergences have 
a physical origin). More precisely, since the fields ip are evaluated infinitesimally later than the 
fields ip (cf. Eq. (4.27)), the h- and /./-dependent contributions to the action acquire a factor 


9 Notice, however, that the fields i/> n carry dimension [energy] 1 / 2 , i.e. the frequency coherent state integral is 

normalized as f i/’n) e~ n n = (/3e) -< “. 

10 As to the signs of the Matsubara indices appearing in Eq. (4.32), note that the Fourier representation of ^ is 

defined as ^(r) — '4>nZ+ tu3nT , Jq dr ^(r)e _ta ’ nT . In the bosonic case, this sign convention is 

motivated by being the complex conjugate of For reasons of notational symmetry, this convention is also 
adopted in the fermionic case. 
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exp(— iu> n 6 ), 8 infinitesimal. Similarly, the V contribution acquires a factor exp(— i(ui ni +u> n 2 ) 8 ). 
In cases where the convergence is not critical, we will omit these contributions. However, once 
in a while it is necessary to remember their presence. 


Partition function of non-interacting gas 

As a first exercise, let us consider the quantum partition function of the non-interacting gas. 
(Later, this object will prove to be a “reference” in the development of weakly interacting 
theories.) In some sense, the field integral formulation of the non-interacting partition func- 
tion has a status similar to that of the path integral for the quantum harmonic oscillator: 
the direct quantum mechanical solution of the problem is straightforward and application 
of the full artillery of the field integral seems somewhat ludicrous. From a pedagogical point 
of view, however, the free partition function is a good problem; it provides us with the 
welcome opportunity to introduce a number of practical concepts of field integration within 
a comparatively simple setting. Also, the field integral representation of the free partition 
function will be an important operational building block for our subsequent analysis of 
interacting problems. 

Consider, then, the partition function (4.28) with JJo(^,^) = J2ij Diagonaliz- 

ing Hq by a unitary transformation U, Hq = UDW , and transforming integration variables, 
U^if) = </>, the action assumes the form S = Y2 a where £ a = e a — g 

and e a are the single-particle eigenvalues. Remembering that the fields 0 a (r) are indepen- 
dent integration variables (Exercise: Why does the transformation — > <j> have a Jacobian 
of unity?), we find that the partition function decouples, Z = 2 a . where 

Z a = J D(0 O , cj> a ) e~ £* „ = J][/3(-i Wn + £„)] < (4.33) 


and the last equality follows from the fact that the integrals over <j> an are one-dimensional 
complex or Grassmann Gaussian integrals. Here, let us recall our convention defining ( = 
1 (—1) for bosonic (fermionic) fields. At this stage, we have left all aspects of field integration 
behind us and reduced the problem to one of computing an infinite product over factors 
iu) n — f a . Since products are usually more difficult to get under control than sums, we take 
the logarithm of Z to obtain the free energy 

F = —TlnZ = 53 M0(-iw n + &,)]. ( 4 - 34 ) 

INFO Before proceeding with this expression let us take a second look at the intermediate 
identity (4.33). Our calculation showed the partition function to be the product over all eigen- 
values of the operator — iui + H — gN defining the action of the non-interacting system (here, 
ui = {w n <5 nn /}). As such, it can be written compactly as: 


Z = det [/?( —iu + H-pN)]~ 


(4.35) 


This result was derived by first converting to an eigenvalue integration and then performing 
the one-dimensional integrals over “eigencomponents” <j> a n ■ While technically straightforward, 
that - explicitly representation-dependent - procedure is not well suited to generalization to 
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more complex problems. (Keep in mind that later on we will want to embed the free action of 
the non-interacting problem into the more general framework of an interacting theory.) 

Indeed, it is not necessary to refer to an eigenbasis at all. In the bosonic case, Eq. (3.17) 
tells us that Gaussian integration over a bilinear ~ 4>Xrp generates the inverse determinant of 
X. Similarly, as we have seen, Gaussian integration extends to the Grassmann case with the 
determinants appearing in the numerator rather than in the denominator (as exemplified by 
Eq. (4.35)). (As a matter of fact, Eq. (4.33) is already a proof of this relation.) 


We now have to face up to a technical problem: how do we compute Matsubara sums 
of the form ]T) n \n(iu> n — x)? Indeed, it takes little imagination to foresee that sums of 
the type „ 2 ^( w »n Wn 2 > • • •)> where X symbolically stands for some function, will 

be a recurrent structure in the analysis of functional integrals. A good ansatz would be 
to argue that, for sufficiently low temperatures (i.e. temperatures smaller than any other 
characteristic energy scale in the problem), the sum can be traded for an integral, specifically 
T t / dw/(2ir). However, this approximation is too crude to capture much of the 
characteristic temperature dependence in which one is usually interested. Yet there exists an 
alternative, and much more accurate, way of computing sums over Matsubara frequencies 
(see Info below). 

INFO Consider a single Matsubara frequency summation, 

S =£>(<*»«), (4.36) 


where h is some function and oj„ may be either bosonic or fermionic (cf. Eq. (4.31)). The basic 
idea behind the standard scheme of evaluating sums of this type is to introduce a complex 
auxiliary function g(z ) that has simple poles at z = ioj„. The sum S then emerges as the sum 
of residues obtained by integrating the product gh along a suitably chosen path in the complex 
plane. Typical choices of g include 


9{z) 



bosons, | C oth(/3«/2), bosons, 

fermions, ' ' ^ tanh(/3«/2), fermions, 


(4.37) 


where, in much of this section, we will employ the functions of the first column. (Notice the 
similarity between these functions and the familiar Fermi and Bose distribution functions.) In 
practice, the choice of the counting function is mostly a matter of taste, save for some cases 
where one of the two options is dictated by convergence criteria. 

Integration over the path shown in the left part of Fig. 4.2 then produces 

^lj dz g(z)h{-iz) = C Res(g(z)h(-iz))\ z=iUn = ^ fr(w„) = S, 


where, in the third identity, we have used the fact that the “counting functions” g are chosen 
so as to have residue £ and it is assumed that the integration contour closes at z ±ioo. 
(The difference between using the first and the second column of Eq. (4.37) lies in the value 
of the residue. In the latter case, it is equal to unity rather than £.) Now, the integral along a 
contour in the immediate vicinity of the poles of g is usually intractable. However, as long as 
we are careful not to cross any singularities of g or the function h(—iz) (symbolically indicated 
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Figure 4.2 (a) The integration contour employed in calculating the sum (4.36). (b) The deformed 
integration contour. 


by isolated crosses in Fig. 4.2 11 ) we are free to distort the integration path, ideally to a contour 
along which the integral can be done. Finding a suitable contour is not always straightforward. 
If the product hg decays sufficiently fast as \z\ — > oo (i.e. faster than 2 _1 ), one will usually try 
to “inflate” the original contour to an infinitely large circle (Fig. 4.2(b)). 12 The integral along 
the outer perimeter of the contour then vanishes and one is left with the integral around the 
singularities of the function h. In the simple case where h(—iz) possesses a number of isolated 
singularities at { Zk } (i.e. the situation indicated in the figure) we thus obtain 

S = 2^ / dZ h ^~ iz ^( z ) = -C5Z Res h (~ iz M z )\ z=Zk > (4-38) 


where the contour integral encircles the singularities of h(—iz) in clockwise direction. The com- 
putation of the infinite sum S has been now been reduced to the evaluation of a finite number 
of residues - a task that is always possible! 

To illustrate the procedure on a simple example, let us consider the function 


h(u n ) = 


q 

ioj„e~ iu,nS — £ ’ 


where S is a positive infinitesimal. 13 To evaluate the sum S = ^2 n h(u} n ), we first observe that 
the product h(—iz)g(z) has benign convergence properties. Further, the function h(—iz) has a 


11 Remember that a function that is bounded and analytic in the entire complex plane is constant, i.e. every 
“interesting” function will have singularities. 

12 Notice that the condition limj^i^oo \hg\ < z~ x is not as restrictive as it may seem. The reason is that the 
function h will be mostly related to physical observables that approach some limit (or vanish) for large excitation 
energies. This implies vanishing in at least portions of the complex plane. The convergence properties of g 
depend on the concrete choice of the counting function. (Exercise: Explore the convergence properties of the 
functions shown in Eq. (4.37).) 

13 In fact, this choice of h is actually not as artificial as it may seem. The expectation value of the number of 
particles in the grand canonical ensemble is defined through the identity N = —dF/dfi where F is the free 
energy. In the non-interacting case, F is given by Eq. (4.34) and, remembering that £ 0 = e a — Ab one obtains 
N Pb £ T^2 an _ iu; ^_|_g a ■ Now, why did we write “w” instead of “=”? The reason is that the right-hand side, 
obtained by naive differentiation of Eq. (4.34), is ill-convergent. (The sum ^3^__ 00 n ^. a , x arbitrary, does 
not exist!) At this point we have to remember the remark made on page 168, i.e., had we carefully treated 
the discretization of the field integral, both the logarithm of the free energy and d^F would have acquired 
infinitesimal phases exp(— iuj n S). As an exercise, try to keep track of the discretization of the field integral from 
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simple pole that, in the limit <5 — > 0, lies on the real axis at z = £. This leads to the result 


J2 h (“n) = K Res g(z)h(-iz)\ z ( = 


We have thus arrived at the important identity 


where 


fT V 1 

1 n.B(e a ), bosons, 

^ ^ i0J n - £ 

a 1 u F (ca), fermions, 


n F (e) = 


1 

exp(e — fi) + 1 ’ 


«»(«0 -- 


1 

exp(e — yu) — 1 ’ 


(4.39) 


(4.40) 


are the Fermi/Bose distribution functions. As a corollary we note that the expectation value 
for the number of particles in a non-interacting quantum gas assumes the familiar form N = 
n F/Es(ea). 

Before returning to our discussion of the partition function, let us note that life is not always 
as simple as the example above. More often than not, the function h contains not only isolated 
singularities but also cuts or worse singularities. In such circumstances, finding a good choice of 
the integration contour can be far from straightforward! 


Returning to the problem of computing the sum (4.34), consider for a moment a fixed 

eigenvalue £ a = e a — /i. In this case, we need to evaluate 
the sum S = Y^ n M w n)> where h(uj n ) = (T\n[/3(—iu n + 
£)] = £Tln[/3(iu;„ — £)] + C and C is an inessential con- 
stant. As discussed before, the sum can be represented 
as S = 2 ^ § dz g{z)h{—iz), where g(z) = /3(e^ z — C) -1 
^ is ( t 3 times) the distribution function and the contour 
encircles the poles of g as in Fig. 4.2(a). Now, there is 
an essential difference from the example discussed pre- 
viously, i.e. the function h(—iz) = £Tln(z — £) + C has 
a branch cut along the real axis, 2 G (— oo,£) (see the 
figure). To avoid contact with this singularity one must 
distort the integration contour as shown in the figure. 
Noticing that the (suitably regularized, cf. our previous discussion of the particle number 
N) integral along the perimeter vanishes, we conclude that 

S = ‘Li / d€9 ^ ( ln ^ + - ln ( e “ - 0 ) - 



its definition to Eq. (4.34) to show that the accurate expression for N reads 

N = • 

where h is the function introduced above. (Note that the necessity to keep track of the lifebuoy e~ tSoJn does 
not arise too often. Most Matsubara sums of physical interest relate to functions / that decay faster than z _1 .) 
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where = e± if], r) is a positive infinitesimal, and we have used the fact that <j{e ) — g(e) 
is continuous across the cut. (Also, without changing the value of the integral (exercise: 
why?), we have enlarged the integration interval from (— oo,£] to (— 00 , 00 ).) To evaluate 
the integral, we observe that g(e) = C,d f In (l — £e _/3e ) and integrate by parts: 

s = f 1 - “-*) 1 < ri - 1 1 - ^- K ) ■ 

Insertion of this result into Eq. (4.34) finally gives the familiar expression 

F = (T In (l - Ce -/3(ea-M) ) , (4.41) 

for the free energy of the non-interacting Fermi/Bose gas. While this result could have been 
obtained much more straightforwardly by the methods of quantum statistical mechanics, 
we will shortly see how powerful a tool the methods discussed in this section are when it 
comes to the analysis of less elementary problems! 


4.3 Field theoretical bosonization: a case study 

The field integral (4.28) provides an exact representation of the quantum partition function; 
it contains the full information on the microscopic Hamiltonian operator. However, what 
we are actually interested in is the universal large-scale behavior of a quantum system. To 
extract this information from the field integral we will need to identify the relevant long- 
range degrees of freedom and to dispense with the abundance of microscopic data controlling 
the short-range behavior. In other words, we will have to pass from the microscopic field 
theory to some effective long-range theory. 

In Chapter 1 we saw that there are usually two principal strategies to obtain effective 
long range theories of microscopic systems: explicit construction - the subject of the next 
two chapters - and more phenomenological approaches based on consistency considerations 
and symmetry arguments. Besides a certain lack of rigor, the principal disadvantage of the 
second route is the lack of quantitative control of the results (which implies susceptibility 
to mistakes). On the other hand, the phenomenological approach is far less laborious and 
involves a minimal amount of technical preparation. Often, the phenomenological deduction 
of a low-energy field theory precedes its rigorous construction (sometimes by years). In fact, 
there are cases where phenomenology is the only viable route. 

Below we will illustrate the power of the phenomenological approach on the example of 
the interacting one-dimensional electron gas. We will map the microscopic partition function 
of the system onto a free (and thus exactly solvable) bosonic theory. 14 In this section the 
emphasis is placed on purely methodological aspects, i.e. we will derive an effective theory, 
but will not do anything with it. (Nonetheless, the derivation is instructive and helps us to 
understand the essential physics of the system!) In later chapters, the field theory derived 


A preliminary account of the ideas underlying this mapping has already been given in Section 2.2. 
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below will then serve as the starting point for the discussion of a number of interesting 
applications. 


One- dimensional electron gas (fermionic theory) 

Non-interacting system 

Let us begin by considering the action of a non-interacting one-dimensional electron gas 

= E / dxdr ipl (—isv?d x + d T ) 0 S , 
s=± l 1 ' 

where 0 + /_ are the right-/left-moving fermions and we have denoted the Grassmann field 
conjugate to 0 by '0b 15 For later reference we recall that the left-/right-moving fermion 
operators are projections of the global momentum-dependent fermion operator to the vicin- 
ity of the left/right Fermi point, i.e. = 0|, F+g , 0l(<7) = 0lfc F+g) where |g| <C kp. 

Fourier transforming this expression we obtain the approximate decomposition 

ip(x) = e lkFX 'i/) + (x) -(- e~ lkFX ip-(x). (4.42) 

Before proceeding, let us rewrite the action in a form that emphasizes the symmetries of 
the problem: 

S'o[V> t ,0] = J d 2 xip^ (a 0 d Xo + io 3 d Xl )ip = J d 2 xfi> (<Jid Xo + a 2 d xi ) 0, (4.43) 

where we have set vf = 1 for notational simplicity. Here, 0 = (0+, 0_) T is a two-component 
field comprising left- and right-moving fermions, x = (xo, £i) = (t, x) parameterizes (1 + 1)- 
dimensional space-time, and 0 = 0t <j \ . The second equality identifies the action of the free 
one-dimensional fermion with that of a (1 + l)-dimensional Dirac particle. 16 

We next turn to the discussion of the symmetries of the problem. For one thing, 
the action is clearly invariant under the transformation, 0 — > e*^ v 0, where 0 V = const. 
What is the resulting conserved current? The infinitesimal variant of this transformation is 
described by 0 — > 0 + (i£0 v )0 or, in a notation adapted to Eq. (1.42), 0 -e-i 0®, cn a *4 i<j> v , 
FI = 1. Equation (1.43) then gives the conserved current j v ^ = 0 = 0 7^0- For later 

reference, we mention that, under a rotation of space-time, l (R ■ x)^. the components 
of j v transform like a vector, —1 ( R-j ) li . In relativistic field theory, j v is therefore usually 

called a vector current. 

Notice that the two components of the vector, jo = 0t0 = 0^0+ + 0l0_ = p and j\ = 
i0t<730 = i — 0^0_^ = ij, are the charge density, p , of the system and (i times) 


Following the remarks earlier, this represents an abuse of notation; there is no Grassmann complex conjugation! 
However, within the context of relativistic fermions our standard symbol ip is reserved for another object (see 
below) . 

For a review of the theory of the standard four-dimensional massless Dirac equation 'i ix d lx ip = 0 , see, e.g., L. 
H. Ryder, Quantum Field Theory (Cambridge University Press, 1996 ). The Dirac equation affords a natural 
generalization to any even-dimensional space. Specifically, for d = 2 , the algebra of Dirac 7-matrices, {7^, 7iy} = 
11, v = 0 , 1 , 5 , is satisfied by 70 = <ji , 71 = 75 = —£7071 = <73. 
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the current density, j, respectively. 17 Thus, the equation = —id T p + d x j = 0 simply 

expresses the conservation of particle current. 

INFO A particular example of the general fact that the U(l)-symmetry of quantum mechanics 
(the freedom to multiply wave functions or operators of a second quantized approach by a 
constant phase e <,i ) implies the conservation of particle current. 

EXERCISE Subject the action of the general field integral (4.28) to the transformation ip — >- 
e^ip , ip — > and compute the resulting Noether current. Convince yourself that the com- 

ponents of the current are the coherent state representation of the standard density/current 
operator of quantum mechanics. 

Now, the action (4.43) possesses a somewhat less obvious second symmetry: it remains 
invariant under the transformation ip — > e“t >a -' T3 ip, ip -7 ipe l ^ a - a3 . (Notice that this is not a 
unitary symmetry, i.e. the matrices transforming ip and ip are not inverse to each other.) 
Indeed, one may note that (^e*^ aCT3 )CT /i <9^(e^ a<73 ^’) ^ 3,CT = + 0 ipa^d^ip. A straightforward 
application of Noether’s theorem based on the infinitesimal variant ip — > ip + (*© a )er;si> gives 
the conserved current j aj/i = itpa^a^ip = e^ipavip. Introducing a unit vector e 2 pointing 
into a fictitious third dimension perpendicular to the space-time plane, the current can be 
written as j A = x j v . This representation shows that, under rotations, j a transforms like 
an axial vector (similar to, say, a magnetic field) . For this reason, j. A is commonly called an 
axial current. 

INFO The axial symmetry of the relativistic electron gas is a prominent example of symmetry 
that does not pervade to the quantum level. I.e. the conservation of the axial current breaks 
down once quantum fluctuations are taken into account, a phenomenon known as the chiral 
or axial anomaly . 18 Although we will meet with various manifestations of the chiral anomaly, 
a thorough discussion of all its implications is beyond the scope of this text. (However, most 
textbooks on particle physics contain an extensive coverage of anomalies.) 

Finally, notice that the existence of two distinct symmetries, vectorial and axial, affords 
a simple interpretation in the original representation of the theory (the first equality in 
Eq. (4.43)). All it means is that the left- and right-moving fermion states do not couple, 
i.e. that the left- and the right-moving fermion particle currents are separately conserved. 
(Compute the corresponding Noether currents!) 

Given that we are dealing with but the simplest one-dimensional theory one can imagine, 
the formal discussion of symmetries may seem to be a bit of an overkill. However, we shall 
see in a moment that the effort was well invested: as soon as we switch on interactions, the 
fermionic theory ceases to be exactly solvable. It turns out, however, that our symmetry 
discussion above provides the key to a bosonic reformulation of the problem which does enjoy 
exact solvability. Yet, before turning to the bosonic approach, let us briefly recapitulate how 
interactions couple to the model. 


17 Notice that, for a one-dimensional Fermi system with uniform Fermi velocity up = 1, the current density is 
equal to the density of right movers minus that of the left movers. 

18 In field theory, the quantum violation of a classical conservation law is generally called an anomaly. 
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As in Section 2.2 we assume a short-range interaction between the left- and right-moving 
densities. Quantitatively, this is described by the coherent state representation of the second 
quantized Hamiltonian (2.36), i.e. 

= dxdr (g2PsPs+ 9iPsPs), (4.44) 

where p s = 0^ip s - Notice that the interaction term leaves the vectorial/axial symmetry of 
the system intact (why?). But what else can we say about the interacting system? In fact, 
we have seen in Section 2.2 that it is difficult to understand the physics of the system in the 
microscopic language of interacting fermion states. Rather, one should turn to a formulation 
in terms of the effective long-range degrees of freedom of the model - non-dispersive charge 
density fluctuations. In the next section, we will formulate the dynamics of these excitations 
in a field theoretical language. Remarkably, it will turn out that all we need to extract this 
formulation from the microscopic model is a minimal investment of phenomenological input 
plus symmetry considerations. 


One- dimensional electron gas (bosonic theory) 

We have seen in Section 2.2 that the one- dimensional fermion system supports collective 
bosonic excitations. In this section, we apply phenomenological and symmetry arguments 
to construct a field theory of these excitations. 19 Consider the electron operators ()(x) of a 
one-dimensional system. As seen in Chapter 2, the operators c* afford a representation in 
terms of bosons - the Jordan-Wigner transformation. As we are after an effective bosonic 
theory, it is certainly a good idea to switch to this Bose representation right from the 
outset. Expressed in terms of Jordan-Wigner bosons, c\x) = e in f*'<* dx bt ( x ) b ( x ^^(x), 
where tf(x) creates a (bosonic) unit-charge excitation at x and the “Jordan-Wigner string” 
~ S x > <x dx ' implements the fermionic exchange statistics of the cs in terms of 

Bose operators. Now, we have seen in Section 2.2 that the dominant excitations of the 
(spinless) system are charge-density fluctuations. Thus, our next step is to switch to a 
representation that separately treats density and phase degrees of freedom. To this end, we 
define charge and phase operators through 

b(x) = ^— + p(x)J e*^( x ) , tf(x) = ^— + p(x)J , (4.45) 

where kp/ir is the average background density of the system, p describes density fluctuations, 
and </> describes the phase of the Bose excitations. 

I N FO Charge-density representations of this type are frequently employed in bosonic theo- 
ries. Importantly, the transformation (b, 6 1 ) — > (p,<j>) is canonical, i.e. the density and the phase 

19 For a detailed account of the rigorous construction (elements of which have been used in Section 2.2) we refer 
to H. Scholler and J. von Delft, Bosonization for beginners - refermionization for experts, Ann. Phys. 7 (1998), 
225-305. 
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of a bosonic excitation form a canonically conjugate pair, [p(x), 4>{x')\ = iS(x — x'). 20 To check 
this assertion, let us temporarily shift the density operator p — > p+kF/ir, so that p now describes 
the total density of the system. (Of course, the shift by a number does not alter the commutation 
relations.) We thus write b = p 1 ^ 2 exp(*0) and = exp(— i^jp 1 ^ 2 , where the position index has 
been dropped for notational transparency, and check that [6, b *] = 1 indeed requires [p, 0] = i: 

[6, 6*] = [p 1/2 e^, e-^p 1/2 ] = p - e-*pe* 

= p- e~ tl *’ ] p = p- p + i[4>,p\ - ^[4>, [4>,p]\ + ••• [ =~ l i, 

as required. (In the last line we have used the general operator identity e A Be~ A = e^ A ’^B.) 

To represent the fermions in terms of bosonic degrees of freedom we might now substitute the 
charge/phase decomposition directly into our expression for the Jordan-Wigner transformed 
fermion. It turns out, however, that the resulting representation - though perfectly valid - is 
not quite what we want. To obtain a representation better adjusted to the present problem, 
let us, rather, construct a fermion “from scratch.” To this end, let us think of the fermion as 
a structureless charge endowed with fermionic exchange statistics. The charge is created by 
application of e <l?i ’ 21 while the fermionic exchange statistics are then generated in a second 
step through multiplication by a Jordan-Wigner string. In fact, there is a certain ambiguity 
in the definition of the string variable: defining 6(x) = 7r dx' p(x'), it is straightforward 
to show that for m, m! odd integers 

|" e jro0(x) e i0(x), e im'<j(x') e *0(z')J OC e i7rm ' + e i7rm (*“*') s Q, 

which implies that the most general representation of the fermion operator reads as, 22 
C t ( x ) = T X!m=±i e im ( kFX+e ( x ^ I 1 *} e l ^ x \ Here we have introduced a scalar prefactor T whose 
presence is needed to regularize the right-hand side of the equation in the limit of vanishing 
lattice spacing (the short distance cutoff of the theory, see the problem set). We have also 
reinstalled the constant shift, p—ip + kF/n=^0—i0 + k?x. 

EXERCISE Verify the commutator identity above. (Hint: Use the general identities e A Be~ A = 
e^-B and e A e^e~ A = e eA ^ e A .) 

For most purposes, it is sufficient to keep the lowest two contributions to the harmonic 
expansion of the fermion operator: c^( x) = T . Comparison with 

Eq. (4.42) then leads to the tentative identification 

c\ (x) = Te is§ ( x) e^ (x) . (4.46) 


20 This implies, in particular, that the corresponding field integral transformation —¥ ( p , <t > ) from complex 

integration variables to the two real variables (p, <fi) has a unit Jacobian (check this!). 

21 Notice that 4> is the “momentum” conjugate to p, i.e. the “translation operator” increases the charge at 

x by unity. 

To obtain the correct normalization of the anti-commutator at coinciding arguments x = x' , all expansion 
coefficients have to be equal to each other. 
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Finally, one may notice that the two fields 9 and <p - our principal degrees of freedom 
throughout - are characterized by the commutation relations 


[<j>{x),9{x')\ = *770(2/ — x). 


(4.47) 


At first sight, the results derived above may look strange: we have traded our simple fermion 
operator for a nonlinear representation in terms of bosonic degrees of freedom - but why? 
The point is that we are usually interested not in expressions linear in the fermion oper- 
ators but in fermion bilinears (currents, densities, etc.). In contrast to the case of a single 
fermion, fermion bilinears have very simple expressions in terms of bosons. In particular, 
the Hamiltonian operator of the interacting system becomes quadratic (manifestly solvable) 
when expressed in terms of bosons. This aspect of the theory - undoubtedly the prime 
motivation behind the method of bosonization - will be explored next. 


Non-interacting system 

We now apply a combination of symmetry and dynamical arguments to identify the Hamil- 
tonian of the non-interacting system. The fermionic prototype action is invariant under 
global rotations of space-time, x — > x' = R-x, ip'( x ') = tp(x). This symmetry must pertain to 
the bosonic description of the theory. Turning to the system’s “intrinsic” symmetries, we 
notice that the vectorial and axial symmetry operations considered in the previous section 
act on the left-/right-moving fermion states as rp s — > and ip s — t e respectively. 

Of course, these transformations must continue to be symmetries no matter which repre- 
sentation of the theory (bosonic, fermionic, or whatever) is chosen. A glance at Eq. (4.46) 
shows that the symmetry transformation acts on the bosonic variables by a simple shift 
operation, vectorial: ((f), 9) — > (<j) + (j) v ,9), axial : (</>,#) — > (</>,9 + <j> a )- For (f> a>v = const., these 
transformations must not change the action, which excludes the presence of non-derivative 
terms. (For example, a contribution such as ~ f 9 2 would not be invariant under a uniform 
axial transformation, etc.) 

Of course, symmetries alone do not suffice to fix the action of the system. What we need, in 
addition, is a minimal amount of dynamical input. More specifically, what we shall build 
on is the fact that the creation of a density distortion p = d x 9/ir in the system costs a certain 
amount of energy U. Assuming that U ~ p 2 (i.e. that screening has rendered the Coulomb 
interaction effectively short-range), the Lagrangian action of the charge displacement field 
9 will contain a term ~ (d x: 9) 2 . However, this expression lacks rotational invariance. Its 
canonical rotationally invariant extension reads (d x 6) 2 + (d T 9) 2 . Thus, up to second order 
in derivatives, the Lagrangian action is given by So[9 ] = § f dx dr [( d x 9 ) 2 + ( d T 9 ) 2 ], where 
the coupling constant c needs to be specified. This expression tells us that the field 9 has 
a linear dispersion and propagates at constant velocity (recall our discussion of, e.g., the 
phonon action in Chapter 1). Recalling that d x 9 ~ p. this confirms our results regarding the 
behavior of density distortions in one-dimensional electron systems derived in Section 2.2. 
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To fix the value of the coupling constant c, we must compute the “correlation function” 23 

C(x,t ) ee 0, 0))v,, (4.48) 

first in the fermionic, then in the bosonic, language, and require coinciding answers. Refer- 
ring for a detailed discussion to the problem set, we here merely note that the result obtained 
for the free fermion action Eq. (4.43) reads C(x, r) = (47r 2 (a: 2 +r 2 )) -1 . The bosonic variant 
C(x,t) = T 4 ( e 2l0 ( a: .' I ') e - 2l f( o » o )) e leads to the same expression provided that we set c = 1/-7T 
and T = l/(27ra) 1 / 2 . 

Thus, our final result for the Lagrangian of the non-interacting system reads 

s[0] = -L J dx dr [(, d x ef + ( d T ef ]. 

Before proceeding to include interactions, let us turn from the Lagrangian to the Hamil- 
tonian formulation. Using the fact that the canonical momentum corresponding to 6 is 
ng = dg T gC = d T 0/Tr, we obtain the Hamiltonian density ^ \{d x 0) 2 + 7t 2 7t|] and the action 

S[6.-kq\ = ^ J dx dr ^ — ( d x 6 ) 2 +7r7Tg + 2id T 0ng 

At the same time, the relation (4.47) identifies the canonical momentum as n g = d x <f>/Tr. 
Expressed in terms of the two fundamental degrees of freedom of the theory, the Hamiltonian 
action thus reads 

S[0, 0] = ~ J dx dr (( d x 0) 2 + {d x <j>) 2 + 2 id T 6d x <j)) . (4.49) 

It is instructive to inspect the Noether current corresponding to the vectorial symmetry in 
the bosonic language. A straightforward application of Noether’s theorem to the transfor- 
mation (j> — »• (j> + (j> v obtains = 0, where j v ,o P = P = d x 6/ir and j v , l P = ij = 

—id x 4>/'K. 2A While the first of these relations merely reiterates the definition of the dis- 
placement field, the second contains new information. Remembering that p = p + + p- and 
j = p + — p-, we obtain the important identification 


P± 


1 

27T 




(4.50) 


of the left- and right-moving densities in terms of the Bose field. 

EXERCISE Compute the bosonic representation of the axial Noether current. 


The function C describes the correlation of the bilinear with itself measured at different values of space 

and time. We have more to say on the subject of correlation functions in the next chapter. 

The (arbitrary) normalization constant tt~ has been chosen to obtain consistency with previous definitions, 
see below. 
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Interacting system 

We are now in a position to turn to the interacting case. In fact, all the hard work necessary 
to get the interacting problem under control has already been done! We simply substitute 
the bosonic representation Eq. (4.50) into the Coulomb action Eq. (4.44) and obtain 

Sint = ^2 J dxdT \<ten x {0 - s4>)d x {6 + s<t>) + g 4 d x {6 - s<j))d x (0 - s<j>)] 

= ^2 J dxdT [^2 + 94){d x 0) 2 + (g 4 - g2){d x 4>f] -J. 

i.e. an action that is still quadratic and, thus, exactly solvable. 

INFO In the field theoretical literature (especially the literature on conformal field theory), 
interactions of this type are commonly referred to as current— current interactions. This 
is because Sint can be expressed as a bilinear form in the Noether currents generated by the 
symmetries of the system. 

Adding Sint to the action of the non-interacting action Eq. (4.49), we arrive at the final 
expression 

S[0, 4>\ = J dxdr (g~ 1 v(d x 6) 2 + gv(d x <fi) 2 + 2 id T 0d x (f>) , ( 4 - 51 ) 

where we have introduced the parameters 


7 

, 9i\ 2 

(92\Y 


7 f + “' 

( Vf 

^2n) ~ 

\2tt) 

», 9 = 



and have reinstated vf- Comparison with the results of Section 2.2 identifies v as the 
effective velocity of the charge-density wave excitations of the system. 25 Equation (4.51) 
represents the main result of our analysis. We have succeeded in mapping a non-linear (i.e. 
interacting) problem onto a linear bosonic field theory. Critical readers may justly object 
that this result does not contain much new information. After all, a (second quantized) 
representation of the theory in terms of free bosons has been derived previously in Section 
2.2. Nonetheless, the analysis of this section is very valuable from both a methodologi- 
cal and a conceptual point of view. Methodologically, it turns out that the field integral 
formulation f D(6, (j>) exp(— S[0, </>]) is very convenient for calculation. Many intriguing phe- 
nomena displayed by one-dimensional electron systems (we will meet some examples) can 
straightforwardly be addressed in this language. From a conceptual point of view, it is 
quite remarkable that no “microscopic” calculations had to be performed to determine the 
effective field theory above. “All” we had to invest was symmetry arguments, a minimal 
amount of phenomenological input, and a number of consistency checks. In Chapter 9 we 
will meet a related problem - the low-energy field theory of quantum spin chains - where 


Of course, we do not need to resort to the results of Section 2.2 to draw this conclusion: the structure of 
Eq. (4.51) by itself determines the interpretation of v as an effective velocity. To see this, one may integrate 
over <(> to arrive at an effective wave- type Lagrangian for 9 whose characteristic velocity is set by v (exercise). 
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a microscopic construction scheme for the effective field theory is not known at all. In this 
case a (suitably generalized) version of the symmetry analysis above will represent the only 
viable route towards the solution of the problem. 


4.4 Summary and outlook 

This concludes our preliminary introduction to the field integral. We have learned how to 
represent the partition function of a quantum many-body system in terms of a generalized 
path integral. The field integral representation of the partition function will be the basic 
platform on which all our further developments will be based. In fact, we are now in a 
position to face up to the main problem addressed in this text: practically none of the 
“non-trivial” field integrals in which one might be interested can be executed in closed 
form. This reflects the fact that, save for a few exceptions (such as the one considered in 
Section 4.3 above), interacting many-body problems do not admit closed solutions. Before 
employing the field integral to solve serious problems, we need to develop a spectrum of 
approximation strategies - perturbation theory, linear response theory, mean-field methods, 
instanton techniques and the like. The construction and application of such methods will 
be the subject of the following chapters. 


4.5 Problems 

Exercises on fermion coherent states 

To practice the coherent state method, we begin with a few simple exercises on the fermionic coherent 
state which complement the structures discussed in the main text. 

Considering a fermionic coherent state | rj), verify the following identities: (a) (r]\a\ = 
(v\ Vh (b) a- 1?/) = ~d m \ rj) and (r)\ai = (c) (r]\u) = exp (Y,iVm), and (d) 

f dr\i e-'£‘* r,ir]i \rj){r)\ = ljr, where d{fj,rj) = YlidVi dr)i . Finally, (e) show that 

(nl'tp) {4>\n ) = (CV’I where | n) is an n-particle state in Fock space while |t/>) is a 

coherent state. 

Answer: 

Making use of the rules of Grassmann algebra, 

(a) (v\ <4 = (0|exp( - £q^)a ,J = (0| ]1(1 - a#M = (0|(1 - (3,0jj)u \ ]1 0- - %%) 

V 3 ' 3 jfi 

= (OjojoJ fji n (1 - ajijj) = (0| n(l - = {v\ Vi- 

=<0|[a i) a)] + =<0| 

(b) a|| rj) = Ui 1 ~ Vj a ])\°) = ~ Vj a ])\°) = ~ d m\v), 

=af=a . I7i af = -a i (l-7 7i at) 

(v\ai = (0| n.(l-a^) (1 - aif)i)ai = %<0| EK 1 - ajijj) = %(ry|. 

=a i =-d- i a i rji=d- i (i-a i rji) 
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(c) (r) \v) = (y\ U (1 -^«t) l°) = (v\ ]1 C 1 +vm) |0) = exp (jpjiFij. 

(d) To prove the completeness of fermion coherent states, we apply Schur’s lemma, i.e. we 
need to show that [a^\ f d(fj, y)e~^i f ' irii \y){y\] = 0. 


a] J d(y,y) e ^ iT,i \y){y\ = 

~ J d(y,y) e E *^^|r?)(??| 

j d(y, rj) d Vj |??)(7?| = 

J d{y,y) e~^ r}ir,i \y){y\a], 

=^e-Sj“» i 


aj J d(ij, rj) e~ ^ fiirii \y){y\ = 

J d(fj,y) e~ E * \y)(y\ 



= J d{y , y) e~ E * \y)d nj (y\ = 

J d(y,y) e >: ‘ rHni \y)(y\aj. 


The constant of proportionality is fixed by taking the expectation value with the vacuum. 

(0| J d(y,y)e~^ i ' fiirii \y){y\0) = J d(fj,r]) = 1. 

(e) A general n-particle state is given as |n) = a{ . . . 1 0) , (n\ = (0|a n . . . ai, where we 

neglected a normalization factor. The matrix element (n\tp), thus, reads 

(n|V>) = (0|o„ . . . oilV’) = (0| ip n . . . ipi\ip) = il> n . . . ipx. 

Similarly, we obtain (ip\n) = . . . ty,. Using these results, 

{n\i>){il)\n) = ip n . . . ipx-ipx . . . xp n = • . • Ipn'&n 

= (CVfi^i) • • • (CVv^») = (CVfi) • • • (C^n)V’n • • • V’i = (Ci’\n){n\ip). 


Feynman path integral from the functional field integral 

The abstraction of the coherent state representation betrays the close similarity between the Feynman 
and coherent state path integrals. To help elucidate the connection, the goal of the present problem is 
to confirm that the Feynman path integral of the quantum harmonic oscillator follows from the coherent 
state path integral. 

Consider the simplest bosonic many-body Hamiltonian, H = hui(a^ a+^), where creates 
“structureless” particles, i.e. states in a one-dimensional Hilbert space. Note that H can be 
interpreted as the Hamiltonian of a single oscillator degree of freedom. Show that the field 
integral for the partition function Z = tr[exp(—/3H)] can be mapped onto the (imaginary- 
time) path integral of a harmonic oscillator by a suitable variable transformation. (Hint: Let 
yourself be guided by the fact that the conjugate operator pair (a, a^) is related to the 
momentum and coordinate operators (p, q) through a canonical transformation.) 
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Answer: 

In the coherent state representation, the quantum partition function of the oscillator 
Hamiltonian is expressed in terms of the path integral (H = 1): 



where </>{t) denotes a complex scalar field, the constant factor has been absorbed 

into the measure of the functional integral D((j>,(j)), and we have set the chemical 
potential p, = 0. The connection between the coherent state and Feynman integral 
is established by the change of field variables, <j>{r) = (mu/2) 1 / 2 (q(r) + ip(r)/mu), 
where p(r) and q(t) represent real fields. Substituting this representation in Eq. (4.53), 
and rearranging some terms by integrating by parts, the connection is established: 
Z = f D(p, q) exp jjjf ' dr (—ipq + ^ + • (Of course, the “absorption” of the 

constant Hu / 2 in the Hamiltonian into the measure has been a bit of cheating. In operator 
quantum mechanics, the correspondence between the (q,p) and the (a. oX) representation 
of the Hamiltonian includes that constant. Keeping in mind that the constant reflects 
the non-commutativity of operators, think how it might be recovered within the path 
integral formalism. Hint: it is best to consider the mapping between representations within 
a time-slice discretized setting.) 


Quantum partition function of the harmonic oscillator 

The following involves a practice exercise on elementary field integral manipulations, and infinite prod- 
ucts. 


Compute the partition function of the harmonic oscillator Hamiltonian in the field integral 
formulation. To evaluate the resulting infinite product over Matsubara frequencies apply 
the formula x/ sinx = n^=i (f — x 2 /(nn) 2 ) . (Hint: The normalization of the result can 

be fixed by requiring that, in the zero-temperature limit, the oscillator occupy its ground 
state.) Finally, compute the partition function by elementary means and check your result. 
As an additional exercise, repeat the same steps for the “fermionic oscillator,” i.e. with a, a' 
fermion operators. Here you will need the auxiliary identity cos x =rr=i 

Answer: 

Making use of the Gaussian functional integral for complex fields, one obtains from 
Eq. (4.53) (H = 1) 


Z B ~ det(<9 r + u) 1 — (~iu n + u) 1 ~ 


(f) 


n 


ii (St 


V 27 rn 


sinh(/3cj/2) * 
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Now, in the limit of small temperatures, the partition function is dominated by the ground 
state, lirng_$. (x) Z^ = exp[—/3cu/2], which fixes the constant of proportionality. Thus, Z-q = 
[2 sinh(/3fiu;/2)] _1 . 

In the fermionic case, the Gaussian integration gives a product over eigenvalues in the 
numerator and we have to use fermionic Matsubara frequencies, oj n = (2 n + 


Z F ~ det(<9 T + w) ~ (— icu n + ~ 


(2 n + 1 ) 7 r 


0: 


n 


1+ 


( 2 n + 1 ) 7 r 


cosh(/3w/2). 


Fixing the normalization, one obtains Z F = 2e cosh(/3u;/2). Taken together, these results 
are easily confirmed by direct computation, viz. 






e -/3w/2 


e -/3w/2 



',{3uj 


e -/5w/2 

1 - 


psl _ e ~P u / 2 (l _|_ 


1 

2sinh(/?cu/2) ’ 

e~P u ) = 2e~P u 


cosh(/3tu/2). 


Boson-fermion duality 

The equivalence of the bosonic and the fermionic representation of the one-dimensional electron gas is 

exemplified by computation of the correlation function Eq. (4.48) considered in the main text. 

(a) Employ the free fermion field integral with action (4.43) to compute the zero- 
temperature limit of the correlation function (4.48) considered in the text (assume 
x > 0). 

(b) Considering a free scalar bosonic field 6 with action ^ f dx dr [(d T 6) 2 + ( d x 6 ) 2 ], 

compute the correlation function K(x,t) = (6(x, t)6(0, 0) — 0(0, 0)0(0, 0)) for x > 0. 

(c) Compute the correlation function C(x,t) = y 2 (exp[2f0(x, r)] exp[— 2i0(0, 0)]). 

Answer: 


(a) Setting v F = 1 and defining Z± = f D(^},^}) exp (— S±[ip,ip]') , where S±[rp,xp] — 
f dx dr t/>(<9 t =f id x )ip, we obtain 

G±(x,t) = Z±' 1 ^(x,t) ^(0,0)e~ s ^^ = -(d T , Jf ^)^ T;0 , 0 ) 

= _Hy' 1 c -i vx -i Un r_ 

L ^ -iui n =F P 
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Assuming for definiteness that x > 0 and integrating over momenta, we arrive at 


G±(x,t) = T fT^0(±n)e“”( 


1 1 
27 r Hx — t 


where in the last equality we have approximated the frequency sum by an integral. Thus, 
the correlation function (4.48) is given by C(x,t ) = G + (x,t)G-(—x,t) = • 

(b) Expressed in a frequency/momentum Fourier representation, $[#] = Yl q , n l^g,n| 2 (9 2 + 
w 2 ). Performing the Gaussian integral over 9, we obtain 


K(x,t) = 


cT 


E 


e iqx+iu n 


-i*?E 


p— \uJ n \x-\-iuJr 


T - 1 


47T 


q 2 +uj'i “ 2 

aco 


+ c.c. ln((x 2 + r 2 )/a 2 ), 

47T 


where we have approximated the momentum sum by an integral and the frequency sum 
by an integral, cut off at large frequencies by Ep — vp a -1 VF = a -1 . 

(c) Using the results derived in (b), 


C(x, t) 


= 7 2 e 2i(e(x,r)-e(0,0)) _ 7 2 e -2((0(x,r)-0(O,O)) 2 > 


= q 2 exp 




Setting c = 7r and T = 1 /2tki. we obtain equivalence to the fermionic representation of 
the correlation function considered in (a). 


Frequency summations 

Using the frequency summation techniques developed in the text, this problem involves the computation 
of two basic correlation functions central to the theory of the interacting Fermi gas. 


(a) The pair correlation function \n , q is an important building block entering the cal- 
culation of the Cooper pair propagator in superconductors (see Section 6.4). It is given 
by 


Xn, 


q — 


T 

ta 


E G 0 (p, fcu TO )Go(-p + q, -iu m +iu> n ) 

m,p 



P 


1 U,p(^p) np(^_p-|-q) 

iw n — £p — C-p+q 


where Go(p,iuj m ) = — £ p ). Verify the second equality. (Note that u) m = (2m + 

1)7t/ 3 are fermionic Matsubara frequencies, while u> n = 2-rrnT is a bosonic Matsubara 
frequency.) 

(b) Another correlation function central to the theory of the interacting Fermi gas (see 
Section 5.2), the so-called density— density response function, is given by 


T_ 

it 


E G o(p,*w m )G 0 (p + q,*w, 

P ,Wm 


+ iu n ) 


J_ 'Vjspl _ ? b UpAq] 

L d p iu> n + £ p — Cp+q 


Again verify the second equality. 
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Answer: 


(a) To evaluate a sum over fermionic frequencies u rn . we employ the Fermi function 

finp{z) = [3(eP z + l) -1 defined in the left column of Eq. (4.37). Noting that the 
function Go(pi, z)Go{p 2 , z + iw n ) has simple poles at z = £ Pl and z = ioj n ~ £ P2 , 
and applying Eq. (4.38) (with the identification S = h and h = G'qGo), we 
obtain S = , Using the fact that np(x + iu n ) = rip (a:) and 

np(—x) = 1 — Tip(x) we arrive at the result. 

(b) One may proceed as in part (a). 


Pauli paramagnetism 

There are several mechanisms whereby a Fermi gas subject to an external magnetic field responds to 
the perturbation. One of these, the phenomenon of Pauli paramagnetism, is purely quantum mechanical 
in nature. Its origin lies in the energy balance of spinful fermions rearranging at the Fermi surface in 
response to the field. We explore the resulting contribution to the magnetic susceptibility of the electron 
gas. 

Fermions couple to a magnetic field by their orbital momentum as well as by their spin. 
Concentrating on the latter mechanism, consider the Hamiltonian 

H z = -poB • S, S = ^a^o-crtr' a a<7 >, 

where a = (a x ,a y , a z ) T is a vector of Pauli matrices, a an orbital quantum number, and 

go = e/(2 m) the Bohr magneton. It turns out that the presence of 
H z in the energy balance leads to the generation of a net paramag- 
netic response of purely quantum mechanical origin. To understand 
the origin of the effect, consider a two- fold (spin!) degenerate single- 
particle band of free electrons states (see the figure). Both bands are 
filled up to a certain chemical potential g. Upon the switching on of 
an external field, the degeneracy is lifted and the two bands shift in 
opposite directions by an amount ~ go B . While, deep in the bands, 
the Pauli principle forbids a rearrangement of spin configurations, up 
at the Fermi energy , .[ states can turn to energetically more favorable 
t states. More precisely, for bands shifted by an amount ~ goB, a 
number ~ go B pig) of states may change their spin direction, which 
leads to a total energy change of A E ~ —goB 2 p(g). Differentiating 
twice with respect to the magnetic field gives a positive contribution 
X ~ — <9fjA E ~ gop(g) to the magnetic susceptibility of the system. 

(a) To convert the qualitative estimate above into a quantitative result, write down the 
coherent state action of the full Hamiltonian H = Hq + H z . where Hq = ’Yh a a aJ (a .e Q a acr 
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is the non-magnetic part of the Hamiltonian. Integrate out the Grassmann fields to 
obtain the free energy F as a sum over frequencies. 

(b) Show that, at low temperatures, the spin contribution to the magnetic susceptibility 

X = -8b\ b =o F is s iven b x 


(4.54) 


where p(e) = J^ a <5(e — e «) denotes the single-particle density of states. (Hint: It is 
convenient to perform the field derivatives prior to the frequency summation.) 


Answer: 


(a) Choosing the quantization axis parallel to the magnetic field the Hamiltonian assumes 

a diagonal form H = Y^ a <r a L [ £ « ~~ (A*}*™-] a a<r and the (frequency representation 

of the) action reads S[ip,ip] = T^avn^ni-i^n + - ^((TeW)VWn- Integrating 

over ip, we obtain the partition function Z = El a - n P 2 {{~iu n +£a) 2 - \{p 0 B) 2 ) and 

F = —T\nZ = [? (H w » +£«) 2 - \(»o S) 2 )] • 

(b) Differentiating the free energy twice with respect to B, we obtain % = — \ g^T ( — 

ia)~ 2 - Defining x = where h a (uj n ) = \pq T(— iui n + £ a ) -2 , Eq. (4.38) can 

be applied to perform the frequency sum. Noting that the function h(—iz) has poles of 
second order at z = £ a , i.e. Res[/i(— iz)g{z)]\ z= ^ a = </(£ a )> we obtain 

^ = -yE n ^») = "f J de P( e ) n F( e ~ p)- 

At low temperatures, T — » 0, the Fermi distribution function approaches a step function, 
riF(e) —> 0(— e), i.e. np(e) = — (5(e) and our result reduces to Eq. (4.54). 


Electron-phonon coupling 

As follows from the structure of our prototypical condensed matter “master Hamiltonian" (1.1), mobile 
electrons in solids are susceptible to the vibrations of the host ions, the phonons. This coupling mecha- 
nism generates a net attractive interaction between the electrons. Referring for a qualitative discussion 
of this interaction mechanism to page 266 below, it is the purpose of this problem to quantitatively 
explore the profile of the phonon mediated electron-electron interaction. In Section 6.4 we will see that 
this interaction lies at the root of conventional BCS superconductivity. 

Consider the three-dimensional variant of the phonon Hamiltonian (1.34), 

H p h = + const., 

qb 
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where u q is the phonon dispersion (here assumed to depend only on the modulus of the 
momentum, |q| = q) and the index j = 1, 2, 3 accounts for the fact that the lattice ions 
can oscillate in three directions in space (i.e. there are three linearly independent oscil- 
lator modes 26 ). Electrons in the medium sense the induced charge pi I1( j ~ V • P, where 
P ~ u is the polarization generated by the local distortion u of the lattice (u(r) is 
the three-dimensional generalization of the displacement field Mr) considered in Chap- 
ter 1). Expressed in terms of phonon creation and annihilation operators (cf. Eq. (1.32)), 
u q = ej(a clt j + oL q j) / (Zmojq) 1 / 2 , where ej is the unit vector in the j-direction, 26 and we 
conclude that the electron-phonon Hamiltonian reads 

1 -ph = 7 J d d rn (r)V • u(r) = 7 (2 ^) 1/2 + a -qj)‘ 

Here, n q = c k+q Ck denotes the electronic density expressed in terms of fermion creation 
and annihilation operators, and the electron spin has been neglected for simplicity. 

(a) Formulate the coherent state action of the electron-phonon system. 

(b) Integrate out the phonon fields, and show that an attractive interaction between elec- 
trons is generated. 


Answer: 


(a) Introducing a Grassmann field tp (a complex field <p>) to represent the electron (phonon) 
operators, one obtains the coherent state field integral 

Z = J D[i>,ip\ J D[M<j)\ e - s efiV ; .V']-Sph[0,<i.]-Sei-phW,V'^^] ) 

where 

Sph[M4>] = ^2<i>qj{-iUn+Uq)(l) qj , 

q,j 

Sei-ph^P, ip, (/>,<!>] = 7 ( 2 mJ Y/ 2 

q,j ' 9> 

p q = PMk ' l lk+q' l Pk- and the electron action need not be specified explicitly. Here we have 
adopted a short-hand convention setting q = (u n , q). 27 


For more details see N. W. Ashcroft and N. D. Mermin, Solid State Physics (Holt-Saunders International, 
1983). 

Do not confuse the 4-momentum q with the modulus |q| = q. 



4-5 Problems 


189 


(b) We next perform the Gaussian integration over the phonon fields to obtain the effective 
electron action 

V’] - In (y D[4>, </>] 

= 5el[t),#] - "■ 


Sloppily transforming from Matsubara to real frequencies, u) n — > —iui, we notice that, 
for every momentum mode q, the interaction is attractive at low frequencies, w <u q . 


Disordered quantum wires 

In this problem, we consider a one-dimensional interacting Fermi system - a "quantum wire” - in the 
presence of impurities. Building on the results obtained in Section 4.3, we derive an effective low-energy 
action of this system. (The actual analysis of the large-scale behavior of the disordered quantum wire 
necessitates the application of renormalization group methods and is postponed to Chapter 8.) 

In Sections 2.2 and 4.3 we discussed the physics of interacting fermions in one dimension. 
We saw that, unlike in a Fermi liquid, the fundamental excitations of the system are charge 
(and spin) density waves - collective excitations describing the wave-like propagation of 
spin and charge degrees of freedom, respectively. Going beyond the level of an idealized 
translationally invariant environment, the question we wish to address currently is to what 
extent the propagation of these modes will be hampered by the presence of spatially localized 
imperfections. This problem is of considerable practical relevance. All physical realizations of 
one-dimensional conductive systems - semiconductor quantum wires, conducting polymers, 
carbon nanotubes, quantum Hall edges, etc. - generally contain imperfections. Further, and 
unlike systems of higher dimensionality, a spin/charge degree of freedom propagating down 
a one-dimensional channel will inevitably hit any impurity blocking its way. We thus expect 
that impurity scattering has a stronger impact on the transport coefficients than in higher 
dimensions. 

However, there is a second and less obvious mechanism behind the strong impact of 

disorder scattering on the conduction behavior 
of one-dimensional quantum wires: imagine a 
wavepacket of characteristic momentum k f col- 
liding with an impurity at position x = 0 (see 
figure). The total wave amplitude to the left of 
the impurity, ip( x ) ~ exp {ik-px) + r exp(— ikpx) 
will be a linear superposition of the incoming amplitude ~ exp(i/cFx) and the reflected out- 
going amplitude ~ r exp(— ik^x), where r is the reflection coefficient. Thus, the electronic 
density profile is given by p(x) = \ip(x)\ 2 ~ 1 + |r| 2 + 2Re (re _2 * fcFX ), which contains an 
oscillatory contribution known as a Friedel oscillation. Moreover, a closer analysis (see 
exercise below) shows that, in one dimension, the amplitude of these oscillations decays 
rather slowly, varying as ~ |x| _1 . The key point is that, in the presence of electron-electron 
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interactions, other particles approaching the impurity will notice, not only the impurity 
itself, but also the charged density pattern of the Friedel oscillation. The additional scatter- 
ing potential then creates a secondary Friedel oscillation, etc. We thus expect that even a 
weak imperfection in a Luttinger liquid acts as a “catalyst” for the recursive accumulation 
of a strong potential. In this problem, we will derive the effective low-energy action describ- 
ing the interplay of interaction and impurity scattering. The actual catalytic amplification 
mechanism outlined above is then explored in Chapter 8 by renormalization group methods. 

EXERCISE To explore the Friedel oscillatory response of the one-dimensional electron 

gas to a local perturbation, consider the connected density-density correlation function 


(M) = (p(M)p(0, 0)> - (p(x,t))(p( 0,0)), 

where (•••) denotes the ground state expectation value, p = eda and a(x ) = e lkFX a+(x ) + 
e~ lkFX a-(x) splits into a left- and a right-moving part as usual. Using the fact that H = 
£ qs UF(pF + sq)a\ q a sq and the von Neumann equation a sq = i[H,a sq ], show that the time- 
dependence of the annihilation operators is given by a sq (t) = e~ tVF< - PF+sq ^ t a sq . Use this result, 
the canonical operator commutation relations, and the ground state property a±, q | ) = 0 for 
±g > 0 to show that 


(M) = 


l 


1 2 cos(2pf:e) \ 

c + v F t ) 2 + :|p- (vH)* ) ' 


47 r 2 y (x — VFt) 2 

Use this result to argue why the static response to an impurity potential decays as '■ 


Consider the one-dimensional quantum wire, as described by the actions Eq. (4.43) and 
(4.44). Further, assume that, at x = 0, the system contains an imperfection or impurity. 
Within the effective action approach, this is described by 

Simph^, V’] = J dr , 

where all field amplitudes are evaluated at x = 0 and the constants:'^ £ M and v £ C 
describe the amplitudes of forward and backward scattering, respectively. 


(a) Show that the forward scattering contributions can be removed by a gauge transfor- 
mation. This demonstrates that forward scattering is inessential as long as only gauge 
invariant observables are considered. What is the reason for the insignificance of forward 
scattering? 


We next reformulate the problem in a bosonic language. While the clean system is 
described by Eq. (4.51), substitution of (4.46) into the impurity action gives <Si m p[$] = 
7 f dr cos(2#(t)), where 7 = 2tT 2 and we have assumed the backward scattering amplitude 
to be real. (Any phase carried by the scattering amplitude can be removed by a global 
gauge transformation of the fields ip±. How?) Notice the independence of on the field 
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(b) Integrate out the Gaussian field <j> to obtain the Lagrangian formulation of the action, 

S[0] = ^- g J dxdr [v(d x 0f +v~\d T 0f] + S imp [0}. 

The formulation of the problem derived in (b) still contains redundancy. The point is that, 
everywhere except for x = 0, the action is Gaussian. This observation suggests that one may 
integrate out all field degrees of freedom 0(x 7^ 0), thus reducing the problem to one that is 
local in space (though, as we shall see, non-local in time). To this end, we reformulate the 
field integral as Z = f .D0exp(— 5[0]), where 

exp (-S(0}) = j DO (n^(r) - 0(0, r))^ exp(-S[0]), 

is the action integrated over all field amplitudes save for 0(0, r) and <5(0(r) — 0(0, r)) is a 
product of (5-functions (one for each time slice) imposing the constraints 0(0, r) = 0(r). We 
next represent these 6- functions as 5(0 — 0(0, r)) = 2 '_ J dk(r ) exp(ifc(r)(0(r) — 0(0, r))) to 
obtain 

exp(— S[0]) = J DO Dk exp — -S'[0] -(- i J dr k(r)(0(r) — 0(0, r)) 

= J DODkex p J dxdr \v(d x 0) 2 + u -1 (<9 r 0) 2 ] + (ccos(2 0) + ik(0 — 0))5(x) 

The advantage gained by this representation is that it permits us to replace cos(20(O, r)) — >■ 
cos(20(r)), whereupon the 0-dependence of the action becomes purely quadratic. 



(c) Integrate out the field 0(x,r) to obtain the representation Z = J DO e 


S e s[0] = -^ S }20 n \u n \0- n + ') J dr cos(20(t)), 


entirely in terms of a single time-dependent degree of freedom 0(t). 


(4.55) 


Notice that the entire effect of the bulk of the 
electron gas at x ^ 0 went into the first, dissipa- 
tive term. We have, thus, reduced the problem 
to one involving a single time-dependent degree 
of freedom subject to a dissipative damping 
mechanism and a periodic potential (cf. our discussion of this problem in Problem 3.5 
above). 

INFO To understand the physical origin of the dissipative damping mechanism notice 
that, in the absence of the impurity, the system is described by a set of harmonic oscillators. We 
can thus think of the degree of freedom 0(0, r) as the coordinate of a “bead” embedded into an 
infinitely extended harmonic chain. From the point of view of this bead, the neighboring degrees 
of freedom hamper its free kinematic motion, i.e., in order to move, the bead has to drag an 
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entire “string” of oscillators behind. In other words, a local excitation of the x = 0 oscillator will 
lead to the dissipation of kinetic energy into the continuum of neighboring oscillators. Clearly, 
the rate of dissipation will increase with both the stiffness of the oscillator chain (<? _1 ) and the 
frequency of the excitation (w n ), as described by the first term in the last operator in Eq. (4.55). 

Answer: 

(a) Consider the gauge transformation _> e. ir >- v+eW. While ^int and S ] In p are 

gauge invariant and do not change, substitution of the transformed field into the non- 
interacting action leads to 

.SV<Ar] Sof#||§ - r . J drphv-.. 

The induced term cancels against the contribution to S; mp . A similar transformation 
removes the contribution. The physical reason for the insignificance of the forward 
scattering operators is that they describe the scattering of states | ± kp) into the same 
states | ± ky). The optional phase shift picked up in these processes is removed by the 
transformation above. 

(b) This involves an elementary Gaussian integral. 

(c) Expressed in momentum space, the effective action assumes the form 

e~ s = jDODke^-lY: + v ~ lu> n ) l^<z>«| 2 + ik n e q ■ ,-nj 

—ik n 6- n - Si m p[0]] 

= f Dk exp ^2 k n (vq 2 + — ik n 9- n — Si mp [0] 

J [ 2L q, Un 

= AT j Dk exp - 7 ^^k n \u n \~ 1 k_ n -ik n e_ n - S imp [6] 

= exp . 

Denoting 6{t) by 0(t). we obtain the effective action Eq. (4.55). 
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In this chapter, we introduce the analytical machinery to investigate the properties of many-body systems 
perturbatively. Specifically, employing the “0 4 -theory" as an example, we learn how to describe systems 
that are not too far from a known reference state by perturbative means. Diagrammatic methods are 
introduced as a tool to e ciently implement perturbation theory at large orders. The new concepts are 
then applied to the analysis of various properties of the weakly interacting electron gas. 


In previous chapters we have emphasized repeatedly that the majority of many-particle 
problems cannot be solved in closed form. Therefore, in general, one is compelled to 
think about approximation strategies. One promising ansatz leans on the fact that, when 
approaching the low-temperature physics of a many-particle system, we often have some 
idea, however vague, of its preferred states and/or its low-energy excitations. One may 
then set out to explore the system by using these prospective ground state configurations 
as a working platform. For example, one might expand the Hamiltonian in the vicinity 
of the reference state and check that, indeed, the residual “perturbations” acting in the 
low-energy sector of the Hilbert space are weak and can be dealt with by some kind of 
approximate expansion. Consider, for example, the quantum Heisenberg magnet. In dimen- 
sions higher than one, an exact solution of this system is out of the question. However, 
we know (or, more conservatively, “expect”) that, at zero temperature, the spins will be 
frozen into configurations aligned along some (domain-wise) constant magnetisation axes. 
Residual fluctuations around these configurations, described by the Holstein-Primakoff 
boson excitations, or spin waves, discussed before can be described in terms of a controlled 
expansion scheme. Similar programs work for countless other physical systems. 

These considerations dictate much of our further strategy. We will need to construct 
methods to identify and describe the lowest-energy configurations of many-particle systems - 
often called “mean-fields” - and learn how to implement perturbation theory around them. 
In essence, the first part of that program amounts to solving a variational problem, a 
relatively straightforward task. However, the formulation of perturbation strategies requires 
some preparation and, equally important, a good deal of critical caution (because many 
systems notoriously defy perturbative assaults - a fact easily overlooked or misjudged!). We 
thus turn the logical sequence of the two steps upside down and devote this chapter to an 
introduction to many-body perturbation theory. This will include a number of applications, 
i.e. problems where the mean-field is trivial and perturbation theory on its own suffices to 
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produce meaningful results. Perturbation theory superimposed on non-trivial mean-fields 
will then be the subject of the next chapter. 


5.1 General structures and low-order expansions 

As with any other perturbative approach, many-body perturbation theory amounts to an 
expansion of observables in powers of some parameter - typically the coupling strength of 
an interaction operator. However, before discussion of how this program is implemented in 
practice, it is imperative to develop some understanding of the mathematical status of such 
“series expansions.” (To motivate the point: it may, and often does, happen that the infinite- 
order expansion in the “small parameter” of the problem does not exist in a mathematical 
sense!) This can be achieved by considering the following. 


An instructive integral 


Consider the integral 

/(9) = ^ exp [~t ! ■ H - m 


This can be regarded as a caricature of a particle subject to some harmonic potential 
( x 2 ) together with an “interaction” (a: 4 ). For small g 1, it seems natural to treat the 
interaction perturbatively, i.e. to develop the expansion 1(g) « ]T) r) 9™ In, where, applying 
Stirling’s approximation, n! ”~ 1 n n e ~ n , 


g n in = - 


1 = (- g y 


( (4n- 1)!! „»i / gn\ 


This estimate should alarm us: strictly speaking, it states that a series expansion in the 
“small parameter” g does not exist. No matter how small g, at roughly the (l/g)th order 
in the perturbative expansion the series begins to diverge. In fact, it is easy to predict 
this breakdown on qualitative grounds: for g > 0 (g < 0), the integral (5.1) is convergent 
(divergent). This implies that the series expansion of the function 1(g) around g = 0 must 
have zero radius of convergence. 

However, there is also a more “physical” way of understanding the phenomenon. Consider 
a one-dimensional version of Eq. (3.16), where the ‘Gaussian average’ is given by Eq. (3.15): 



dx i 
^= e_5 
V2tt 


E 1 = (4n — 1)!!. 



The factor (4n — 1)!! measures the combinatorial freedom to pair up 4n objects. This sug- 
gests an interpretation of the breakdown of the perturbative expansion as the result of a 
competition between the smallness of the expansion parameter g and the combinatorial 
proliferation of equivalent contributions, or “pairings,” to the Gaussian integral. Physically, 
the combinatorial factor can be interpreted as the number of different “partial amplitudes” 
contributing to the net result at any given order of perturbation theory. Eventually, the 
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exponential growth of this figure overpowers the smallness of the expansion parameter, 
which is when perturbation theory breaks down. (Oddly the existence of this rather gen- 
eral mechanism is usually not mentioned in textbook treatments of quantum perturbation 
theory!) 

Does the ill-convergence of the series imply that perturbative approaches to problems 
of the structure of Eq. (5.1) are doomed to fail? Fortunately, this is not the case. While 
the infinite series 9 n ^n is divergent, a partial resummation X^n=o x ( j"^n can yield 

excellent approximations to the exact result 1(g). To see this, let us use the fact that 
\e~ gx 4 — o' ^ ~ 9 n\ ^ I < ^(n ^ °^i)i to estimate the error 


1(g) - E 9 n in 


<9 n ™ +X \I nm 


- ^grhna^y 


Variation with respect to n max shows that the error reaches its minimum when n max ~ g _1 
where it scales like e -1 /®. (Notice the exponential dependence of the error on the coupling 
g - e.g. for a small coupling g « 0.01, the 100th order of the perturbation theory would 
lead to an approximation of astronomic absolute precision e -100 .) By contrast, for g « 0.3, 
perturbation theory becomes poor after the third order! 

Summarizing, the moral to be taken from the analysis of the integral (5.1) (and its 
generalizations to theories of a more complex structure) is that perturbative expansions 
should not be confused with rigorous Taylor expansions. Rather they represent asymptotic 
expansions, in the sense that, for weaker and weaker coupling, a partial resummation of 
the perturbation series leads to an ever more precise approximation to the exact result. For 
weak enough coupling the distinction between Taylor expansion and asymptotic expansion 
becomes academic (at least for physicists). However, for intermediate or strong coupling 
theories, the asymptotic character of perturbation theory must be kept in mind. 


4 -theory 

While the ordinary integral discussed in the previous section conveyed something of the 
general status of perturbation theory, we need to proceed to the level of functional integrals 
to learn more about the practical implementation of perturbative methods. The simplest 
interacting field theory displaying all relevant structures is defined through the field integral 

Z = J Dcj>e~ s M, S[<j>\ = j d d x QW ) 2 + V -4> 2 +# 4 ) , (5.2) 

where <j> is a scalar bosonic field. Owing to the structure of the interaction, this model is 
often referred to as the 4 -theory. The </> 4 -model not only provides a prototypical envi- 
ronment in which features of interacting field theories can be explored, but also appears 
in numerous applications. For example, close to its critical point, the d-dimensional Ising 
model is described by the ^ 4 -action (see Info below). More generally, it can be shown that 
the long-range behavior of classical statistical systems with a single order parameter (e.g. 
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the density of a fluid, uniaxial magnetization, etc.) is described by the ^-action . 1 * * Within 
the context of statistical mechanics, £[0] is known as the Ginzburg— Landau free energy 
functional (and less frequently also as the Landau— Wilson model). 

I N FO The d-dimensional Ising model describes the classical magnetism of a lattice of magnetic 
moments Si € {1,-1} that can take only two values ±1. It is defined through the Hamiltonian 

Hi S i ng = SiCijSj (5.3) 

where Cij = C(\i — j|) is a (translationally invariant) correlation matrix describing the mutual 
interaction of the spins, H is an external magnetic field, and the sums run over the sites of a d- 
dimensional lattice (assumed hypercubic for simplicity). The Ising model represents the simplest 
Hamiltonian describing classical magnetism. In low dimensions d = 1,2, it can be solved exactly, 
i.e. the partition function and all observables depending on it can be computed rigorously (see 
our discussion in Chapter 8). However, for higher dimensions, no closed solutions exist and one 
has to resort to approximation strategies to analyze the properties of the partition function. 
Below we will show that the long-range physics of the system is described by </> 4 -theory. Notice 
that (save for the exceptional case d = 1 discussed in Section 8.1) the system is expected to 
display a magnetic phase transition. As a corollary this implies that the 0 4 -model must exhibit 
much more interesting behavior than its innocent appearance suggests! 

Consider the classical partition function 

Z = J2e^ S * K v S * + ^ hiSi , (5.4) 

{Si} 

where K = —BC, and hi = BH,. and we have generalized Eq. (??) to the case of a spatially 
varying magnetic field, Hi. The feature that prevents us from rigorously computing the configu- 
rational sum is, of course, the interaction between the spins. However, at a price, the interaction 
can be removed: let us consider the “fat unity,” 1 = J\f f Dipe ^4 #, where Dtp — 
i dipt, K _1 is the inverse of the correlation matrix, and J\T = l/-y/det(47r/f) is a factor normal- 
izing the integral to unity. A shift of the integration variables, ipi ipi — 2T. KtjSj, brings the 
integral into the form 

1=U J Dipe - *^ i(K_1) « i+^iSi i-T.ijSiK^Sj _ 

Incorporating the fat unity under the spin sum in the partition function, one obtains 

Z=N f Dip j+E.sd i+hi). (5.5) 

{Si} 

Thus, we have removed the interaction between the spin variables at the expense of introducing a 
new continuous field {ipi}- Why should one do this? A multi-dimensional integral f Dtp is usually 
easier to work with than a multi-dimensional sum Xqs-} over discrete objects. Moreover, the 
new representation may provide a more convenient platform for approximation strategies. The 


1 Heuristically, this is explained by the fact that S[<£] is the simplest interacting (i.e. non-Gaussian) model action 

invariant under inversion <j> — »• — </>. (The action of a uniaxial magnet should depend on the value of the 

local magnetization, but not on its sign.) A purely Gaussian theory might describe wave-like fluctuations of 

the magnetization, but not the “critical” phenomenon of a magnetic transition. One thus needs, at least, a 

</> 4 -interaction term. Later on we will see that more complex monomials of (/>, such as 4> 6 or (d(p) 4 , are inessential 

in the long-range limit. 
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transformation leading from Eq. (5.4) to (5.5) is our first example of a Hubbard— Stratonovich 
transformation. The interaction of one field is decoupled at the expense of the introduction of 
another. Notice that, in spite of the somewhat high-minded designation, the transformation is 
tantamount to a simple shift of a Gaussian integration variable, a feature shared by all Hubbard- 
Stratonovich transformations! 

The summation ^{s-} can now be trivially performed: 

Z = N j j J|(2cosh(^ + hi)) 

= M f D%!> e _ 3 E y ( *-*<)(*-%( j-h^+T-i infcosh i > i 

where we have absorbed the inessential factor j 2 into a redefinition of the normalization 
M. Finally, changing integration variables from to = \ one arrives at the 

intermediate result 

Z=U J D(j>e~ *i h i+Z.i lncosh(2 Y. } KtjOj) 

This representation of the problem still does not look very inviting. To bring it into a form 
amenable to further analytical evaluation, we need to make the simplifying assumption that we 
are working at low temperatures such that the exponential weight Kij = /3C(\i — j'|) inhibits 
strong fluctuations of the field cf>. More precisely, we assume that |</>i| -C 1 and that the spatial 
profile of the field is smooth. To make use of these conditions, we switch to a Fourier repre- 
sentation, fa — L'tZue ,k ' r ' o(k). e_ * k and expand lncosh(a:) = 

\x 2 — j^a: 4 + • • • . Noting that {Kcf))( k) = Jf(k)<^(k) = Jf(0)</>(k) + |k 2 /sT"(O)0(k) + 0(k 4 ), we 
conclude that the low-temperature expansion of the action has the general structure 

S[4>] = I'M 01 + c 2 k ' k)0_k + c 3 0kh_k] 

k 

+ -^ </>ki</>k20k 3 <?ik45ki+k2+k3+k4,O + ^(k 4 , /l 2 , </> 6 ). 

V ki,...,k 4 

EXERCISE Show that the coefficients e, are given by ci = JsT(0)(l — 2iC(0)), C 2 = \K"{ 0)(1 — 
4K(0)), c 3 = 1, c 4 = 

Switching back to a real space representation and taking a continuum limit, S[0] assumes the 
form of a prototypical 0 4 -action 

5[0] = J d d x c 2 {d<t>) 2 + ci0 2 + c 3 <t>h + c 4 0 4 . 

A rescaling of variables <fi — > o finally brings the action into the form Eq. (5.2) with coeffi- 

cients r = C 1 /C 2 and g = 04 /( 202). 2 

We have thus succeeded in describing the low-temperature phase of the Ising model in terms 
of a 0 4 -model. While the structure of the action could have been guessed on symmetry grounds, 
the “microscopic” derivation has the advantage that it yields explicit expressions for the coupling 
constants. There is actually one interesting aspect of the dependence of these constants on the 


The only difference is that the magnetic <fi -action contains a term linear in </) and h. The reason is that, in the 
presence of a finite magnetic field, the action is no longer invariant under inversion <f> —<f>. 
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parameters of the microscopic model. Consider the constant ci controlling the k-independent 
contribution to the Gaussian action: ci oc /f(0)(l — 2Jf(0)) oc (1 — 2/3C(0)). Since G(0) must be 
positive to ensure the overall stability of the model (exercise: why?) the constant ci will change 
sign at a certain “critical temperature” 0* . For temperatures lower than 3* , the Gaussian action 
is unstable (i.e. fluctuations with low wavevector become unbound) and the anharmonic term 
0 4 alone controls the stability of the model. Clearly, the behavior of the system will change 
drastically at this point. Indeed, the critical temperature ci(0*) = 0 marks the position of the 
magnetic phase transition, a point to be discussed in more detail below. 


Let us begin our primer of perturbation theory by introducing some nomenclature. 3 For 
simplicity, let us first define the notation 


= jD(j)e- s W 


(5.6) 


for the functional integral, weighted by the action S, of any expression (•••). Due to the 
structural similarity to thermal averages of statistical mechanics, ( • • • ) is sometimes called 
a functional average or functional expectation value. Similarly, let us define 


_ f D(pe~ So ^(- ■ ■ ) 
= fD<f)e- s °W 


(5.7) 


for the functional average over the Gaussian action So = S'| fl= o. The average over a product 
of field variables, 


C'n(xi,x 2 , . . . ,x„) = (0(xi)0(x 2 ) • • • <Hx n )), (5.8) 

is known as an n-point correlation function or, for brevity, just the n-point function . 4 

The one-point function C'i(x) = 
(</>(x)} simply measures the expec- 
tation value of the field amplitude. 
For the particular case of the (j> A - 
problem above the phase transition 
and, more generally, the majority of 
field theories with an action even in 
the field amplitudes, Ci = 0 and the 
first non-vanishing correlation func- 
tion is the two-point function 

G(x] x 2 ) = <7 2 (xi,x 2 ). (5.9) 

(Why does C 2 depend only on the 
di erence of its arguments?) The two-point function is sometimes also called the propa- 
gator of the theory, the Green function or, especially in the more formal literature, the 

resolvent operator. 


George Green 1793-1841 

His only schooling consisted of four terms in 
1801/1802. He owned and worked a Nottingham 
windmill. Green made major contributions to poten- 
tial theory although where he learnt his mathemat- 
ical skills is a mystery. The inventor of Green func- 
tions, he used the method of sources and sinks in 
potential flows. He published only ten mathemati- 
cal works, the first and most important being pub- 
lished at his own expense in 1828, “An essay on the 
application of mathematical analysis to the theories 
of electricity and magnetism.” He left his mill and 
became an undergraduate at Cambridge in 1833 at 
the age of 40, then a Fellow of Gonville and Caius 
College in 1839. 


3 Needless to say, the jargon introduced below is not restricted to the <p example! 

Notice that, depending on the context and/or scientific community, the phrase “n-point function” sometimes 
refers to C2n instead of C n • 
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The existence of different names suggests that we have met with an important object. 
Indeed, we will shortly see that the Green function not only represents a central building 
block of the theory but also carries profound physical significance. 


INFO To develop some understanding of the physical meaning of the correlation function, 
let us recall that the average of a linear field amplitude, (0(0)), vanishes. (See the figure, where 
a diagram of a few “typical” field configurations is sketched as functions of a coordinate.) 


However, the average of the squared 
amplitude (0(O)) 2 is certainly non-vanishing 
simply because we are integrating over 
a positive object. Now, what happens if 
we split our single observation point into 
two, (0 2 (O))o (0(O)0(x)) o = G(x)? For 

asymptotically large values of x, it is likely 
that the two amplitudes fluctuate indepen- 



dently of each other, i.e. G(x) o. However, this decoupling will not happen locally. The 

reason is that the field amplitudes are correlated over a certain range in space. For example, 
if 0(0) > 0, the field amplitude will, on average, stay positive in an entire neighborhood of 0 
since rapid fluctuations of the field are energetically costly (i.e. due to the gradient term in the 
action!). The spatial correlation profile of the field is described by the function G(x). 

How does the correlation behavior of the field relate to the basic parameters of the action? 
A quick answer can be given by dimensional analysis. The action of the theory must be 
dimensionless (because it appears as the argument of an exponential). Denoting the dimension of 
any quantity X by [X], and using the fact that [/ d d xj = L d , [9] = L _1 , inspection of Eq. (5.2) 
obtains the set of relations 


L d - 2 [0] 2 = 1, L d [r][0] 2 = 1, L d [gM 4 = 1, 

from which it follows that [0] = L - ^ -2 )/ 2 , [r] = L~ 2 , [<?] = L d ~ 4 . In general, both system 
parameters, g and r, carry a non-zero length-dimension. However, temporarily concentrating on 
the non-interacting sector, g = 0, the only parameter of the theory, r, has dimensionality L~ 2 . 
Arguing in reverse, we conclude that any intrinsic length scale produced by the theory (e.g. the 
range over which the fields are correlated), must scale as ~ r -1 / 2 . 


A more quantitative description can be obtained by considering the free propagator of 
the theory, 

G 0 (x) IS (0(O)0(x)) o . (5.10) 

Since the momentum representation of the Gaussian action is simply given by £u[0] » 
| ^2 _0 p (p 2 + r)0_ p , it is convenient to first compute Go in reciprocal space: Go, p s 
f d d a:e ip ' x Go(x) = ^2„i{<f>p<f> p ')o- Using the Gaussian contraction rule Eq. (3.14), the free 
functional average takes the form (0 p 0 p ')o = 8p+ p ',o(p 2 + r) -1 , i.e. 5 

Go, P = (0 P 0 — P ) o = _|_ r • (5-11) 


5 The result C?o, P = ( p 2 + r) 1 clarifies why G is referred to as a “Green function.” Indeed, Go,p is (the Fourier 
representation of the) Green function of the differential equation + r)G{ r, r / ) = <5(r — r'). 
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To obtain G(x), we need to compute the inverse transform 

where we have assumed that the system is large, i.e. the sum over momenta can be exchanged 
for an integral. 

For simplicity, let us compute the integral for a one-dimensional system. (For the two- and 
three-dimensional cases see exercise below.) Setting p 2 + r = (p + ir 1 ^ 2 )(p — ir 1 ^ 2 ), we note that 
the (complex extension of the) p integral has simple poles at ±ir 1,/2 . For x smaller (larger) than 
zero, the integrand is analytic in the upper (lower) complex p-plane and closure of the integration 
contour to a semicircle of infinite radius gives 

f dp e~ ipx e -' ,1/2 M 

Go ^ = J = • (5-13) 

This result conveys an interesting observation: typically, correlations decay exponentially, at 
a rate set by the correlation length £ = r -1 / 2 . However, as r approaches 0, the system 
becomes long-range correlated. The origin of this phenomenon can be understood by inspecting 
the structure of the Gaussian contribution to the action (5.2). For r — > 0 (and still neglecting the 
<p 4 contribution) nothing prevents the constant field mode 0(x) = £>o = const, from becoming 
infinitely large, i.e. the fluctuating contribution to the field becomes relatively less important 
than the constant offset. The increasing “stiffness” of the field in turn manifests itself in a growth 
of spatial correlations (cf. the figure on page 201). Notice that this dovetails with our previous 
statement that r = 0 marks the position of a phase transition. Indeed, the build-up of infinitely 
long-range spatial correlations is known to be a hallmark of second-order phase transitions (see 
Chapter 8). 


EXERCISE Referring to Eq. (5.12), show that, in dimensions d = 2 and d = 3, 


Go(x) 


Go(x) 


f d 2 k e~ ikx 

J (2ivyW+r 

f d 3 k e~ ikx 

J (2rr fW+r 


^K 0 {Sr\x\) 

fi VC*| 

4tt|x| 


~L & W • M«Wr, 

1(27^1x1)- 5 e-^l, |x| » 1/y/r, 


Notice that, in both cases, the Green function diverges in the limit |x| 0 and decays exponen- 

tially (at a rate ~ r~ 1/f2 ) for |x| 3> r -1 / 2 . 


Perturbation theory at low orders 


Having discussed the general structure of the theory and of its free propagator, let us turn 
our attention to the role of the interaction contribution to the action, 



d d x 4 ) 4 . 


Within the jargon of field theory, an integrated monomial of a field variable (like (f i 4 ) is 
commonly called an (interaction) operator or a vertex (operator) . Keeping in mind the 
words of caution given in Section 5.1, we wish to explore perturbatively how the interaction 
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vertex affects the functional expectation value of any given field observable, i.e. we wish to 
analyze expansions of the type 


W]> 


e ^(xmjd d xcj>*r) o 

n=0 

e ^((fd dx m o 

n = 0 



(5.14) 


where X may be any observable and denotes the contribution of nth order to the 
expansion in g. The limits on the summation in the numerator and denominator are symbolic 
because, as explained above, we will need to terminate the total perturbative expansion at 
a certain finite order n max . 


EXERCISE To navigate the following section, it is helpful to recapitulate Section 3.2 on contin- 
uum Gaussian integration. 

To keep the discussion concrete, let us focus on the perturbative expansion of the propa- 
gator in the coupling constant, g. (A physical application relating to this expansion will be 
discussed below.) The zeroth-order contribution G^ = G o has been discussed before, so 
the first non-trivial term we have to explore is G^: 

G (1) (x,x') = -g ( </>(x) J d d y <p(y) 4 (f>(x') - ^(x)^(x') J d d ycf)( y) 4 (5.15) 

Since the functional average is now over a Gaussian action, this expression can be evaluated 
by Wick’s theorem, Eq. (3.21). For example, the functional average of the first of the two 
terms leads to (integral signs and constants stripped off for clarity) 

(<£(x)0(y)V(x')) o = 3(cj)(x)(j)(x'))o [(^(y)^(y))o] 2 
+12(^(x)^(y)) 0 (^(y)^(y))o(^(y)^(x , ))o 

= 3G 0 (x - x')Go(O) 2 + 12G 0 (x - y)G 0 (0)G 0 (y - x'), (5.16) 

where we have used the fact that the operator inverse of the Gaussian action is, by definition, 
the free Green function (cf. Eq. (5.10)). Further, notice that the total number of terms 
appearing on the right-hand side is equal to 15 = (6— 1)!! which is just the number of distinct 
pairings of six objects (cf. Eq. (3.21) and with our discussion of Section 5.1). Similarly, the 
second contribution to G^ leads to 

(^>(x)0(x , ))o(0(y) 4 )o = 3(^(x)^(x , ))o[(^(y) 2 )o] 2 = 3Go(x- x')Go(O) 2 . 

Before analyzing these structures in more detail, let us make some general observations. 
The first-order expansion of G contains a number of factors of G'o(0), the free Green function 
evaluated at coinciding points. This bears disturbing consequences. To see this, consider 
Go(0) evaluated in momentum space: 

g “<°> = /(0 «pE- (5 « 

For dimensions d > 1, the integral is divergent at large momenta or short wavelengths; we 
have met with an ultraviolet (UV) divergence. Physically, the divergence implies that, 
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already at first order, our expansion runs into a difficulty that is obviously related to the 
short-distance structure of the system. How can this problem be overcome? One way out 
is to remember that field theories like the ^> 4 -model represent effective low-temperature, or 
long-wavelength, approximations to more microscopic models. The range of applicability of 
the action must be limited to wavelengths in excess of some microscopic lattice cutoff a (e.g. 
the lattice spacing), or momenta k < a -1 . It seems that, once that cutoff has been built 
in, the convergence problem is solved. However, there is something unsatisfactory in this 
argument. All our perturbative corrections, and therefore the final result of the analysis, 
exhibit sensitivity to the microscopic cutoff parameter. But this is not what we expect of a 
sensible low-energy theory (cf. the discussion of Chapter 1)! The UV problem signals that 
something more interesting is going on than a naive cutoff regularization has the capacity 
to describe. We discuss this point extensively in Chapter 8. 

However, even if we temporarily close our eyes to the UV-phenomenon, there is another 
problem. For dimensions d < 2, and in the limit r — > 0, Go(0) also diverges at small 
momenta, an infrared (IR) divergence. Being related to structures at large wavelengths, 
this type of singularity should attract our attention even more than the UV-divergence men- 
tioned above. Indeed, it is intimately related to the accumulation of long-range correlations 
in the limit r — > 0 (cf. the structure of the integral (5.12)). We come back to the discussion 
of the IR singularities, and their connection to the UV phenomenon, in Chapter 8. 

The considerations above show that the perturbative analysis of functional integrals will 
be accompanied by all sorts of divergences. Moreover, there is another, less fundamental, but 
also important, point: referring to Eq. (5.16), we have to concede that the expression does 
not look particularly inviting. To emphasize the point, let us consider the core contribution 
to the expansion at second order in g. 

EXERCISE Show that the lOth-order contraction leads to the 945= (10-1) ! ! terms 

(«Kx)<K y)U(y')U(^% = 9 G 0 (x - x')G 0 (0) 4 + 72 Go(x - x')G 0 (y - y') 2 Go(0) 2 
+24G 0 (x - x')G 0 (y - y') 4 + [36G 0 (x - y)G 0 (x' - y)G 0 (0) 3 

+144(G 0 (x - y)G 0 (x' - y)G 0 (y - y') 2 G 0 (0) + G 0 (x - y)G 0 (x' - y')G 0 (0) 2 G 0 (y - y')) 
+96G 0 (x- y)G 0 (x' - y')G 0 (y' - y) 3 + (y y')] • (5.18) 

Note: Our further discussion will not rely on this result. It only serves an illustrative purpose. 

Clearly Eq. (5.18) is highly opaque. There are eight groups of different terms, but it is 
not obvious how to attribute any meaning to these contributions. Further, should we con- 
sider the full second-order Green function G ^ , i.e. take account of the expansion of both 
numerator and denominator in Eq. (5.14), we would find that some contributions cancel 
(see Problem 5.5). Clearly, the situation will not improve at third and higher orders in g. 
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Figure 5.1 Graphical representation of a first-order-in-^ contraction contributing to the expansion 
of the Green function. 


To efficiently apply perturbative concepts beyond lowest orders, 
a more efficient formulation of the expansion is needed. The key 
to the construction of a better language lies in the observation 
that our previous notation is full of redundancy, i.e. in the full 
contraction of a perturbative contribution, we represent our fields 
by 0(x). A more compact way of keeping track of the presence 
of that field is shown in the upper portion of the figure to the 
right. Draw a point (with an optional “x” labeling its position) and attach a little leg to 
it. The leg indicates that the fields are sociable objects, i.e. they need to find a partner 
with which to pair. After the contraction, a pair — > Go(x — y) becomes a free 

Green function. Graphically, this information can be represented by a pairwise connection 
of the legs of the field symbols to fines, where each line is identified with a Green function 
connecting the two terminating points. The full contraction of a free correlation function 
(<^(xi)^>(x 2 ) • • • <^>(x 2 „))o is represented by the set of all distinct diagrams formed by pairwise 
connection of the field vertices. 

Figure 5.1 shows the graphical representation of the contraction of Eq. (5.16). (The cross 
appearing on the left-hand side represents four field operators sitting at the same point y.) 
According to our rule formulated above, each of the two diagrams on the right-hand side 
represents the product of three Green functions, taken between the specified coordinates. 
Further, each contribution is weighted by a combinatorial factor, i.e. the number of identical 
diagrams of that structure. Consider, for example, the second contribution on the right- 
hand side. It is formed by connecting the “external” field vertex at x to any of the legs 
of the internal vertex at y: four possibilities. Next, the vertex at x' is connected with one 
of the remaining three unsaturated vertices at y: three possibilities. The last contraction 
y F* y is fixed, i.e. we obtain altogether 3 x 4 = 12 equivalent diagrams - “equivalent” in 
that each of these represents the same configuration of Green functions. 

EXERCISE Verify that the graphical representation of the second-order contraction Eq. (5.18) 
is as shown in Fig. 5. 2. 6 Associate the diagrams with individual contributions appearing in 
Eq. (5.18) and try to reproduce the combinatorial factors. (For more details, see Problem 5.5.) 

The graphical representation of the contractions shown in Fig. 5.1 and 5.2 provides us with 
sufficient background to fist some general aspects of the diagrammatic approach: 


(/>(%) 


<K y) 


< >o 


G 0 (x, y) 


In the figure, the coordinates carried by the field vertices have been dropped for notational simplicity. To restore 
the full information carried by any of these “naked” graphs one attaches coordinates x and x 7 to the external 
field vertices and integration coordinates y i to each of the i nodes that do not connect to an external field vertex. 
Since no information is lost, diagrams are often represented without explicit reference to coordinates. 
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m x x • 



+ 72 — » OOO 


+ 24 




)oo 


+ 192 



Figure 5.2 Graphical representation of the second-order correction to the Green function. In the 
main text, the seven types of diagram contributing to the contraction will be referred to (in the 
order they appear above) as diagrams 1 to 7. 


> Firstly, diagrammatic methods help to efficiently represent the perturbative expansion. 
However, we are still left with the problem (see the discussion above) of computing the 
analytical expressions corresponding to individual diagrams. To go back from an nth-order 
graph to its analytical representation one (i) attaches coordinates to all field vertices, 
(ii) identifies lines between points with Green functions, (iii) multiplies the graph by the 
overall constant g n /n\, and (iv) integrates over all of the internal coordinates. When one 
encounters expressions like G ^ = “sum of graphs,” the operations (i)-(iv) are implicit. 

> As should be clear from the formulation of our basic rules, there is no fixed rule as to 
how to represent a diagram. As long as no lines are cut, any kind of reshaping, twisting, 
rotating, etc. of the diagram leaves its content invariant. (At large orders of perturbation 
theory, it often takes a second look to identify two diagrams as equivalent.) 

> From the assembly of diagrams contributing to any given order, a number of internal 
structures common to the series expansion become apparent. For example, looking at the 
diagrams shown in Fig. 5.2, we notice that some are connected, and some are not. Among 
the set of connected diagrams (nos. 5, 6, 7) there are some whose “core portion,” i.e. 
the content of the diagram after the legs connecting to the external vertices have been 
removed, can be cut into two pieces just by cutting one more line (no. 7). Diagrams of 
this type are called one-particle reducible while the others are termed one-particle 
irreducible. More generally, a diagram whose core region can be cut by severing n lines is 
called n-particle reducible. (For example, no. 6 is three-particle reducible, no. 7 one- 
particle reducible, etc.) One can also attach a loop order to a diagram, i.e. the number of 
inequivalent loops formed by segments of Green functions (for Fig. 5.2: 4, 3, 3, 3, 2, 2, 2, 
in that order). One (correctly) expects that these structures, which are difficult to discern 
from the equivalent analytical representation, will reflect themselves in the mathematics 
of the perturbative expansion. We return to the discussion of this point below. 

> Then there is the issue of combinatorics. The diagrammatic representation simplifies 
the determination of the combinatorial factors appearing in the expansion. However, the 
problem of getting the combinatorics right remains non-trivial. (If you are not impressed 
with the factors entering the second-order expansion, consider the (14 — 1)!! = 135135 
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terms contributing at third order!) In some sub-disciplines of theoretical physics, the art 
of identifying the full set of combinatorial coefficients at large orders of perturbation 
theory has been developed to a high degree of sophistication. Indeed, one can set up 
refined sets of diagrammatic construction rules which to a considerable extent automate 
the combinatorics. Pedagogical discussions of these rules can be found, for example, in 
the textbooks by Negele and Orland, and Ryder. 7 However, as we will see shortly, the 
need to explicitly carry out a large-order expansion, with account of all diagrammatic 
sub-processes, rarely arises in modern condensed matter physics; mostly one is interested 
in subclasses of diagrams, for which the combinatorics is less problematic. For this reason, 
the present text does not contain a state-of-the-art exposition of all diagrammatic tools 
and interested readers are referred to the literature. 

> Finally, and perhaps most importantly, the diagrammatic representation of a given con- 
tribution to the perturbative expansion often suggests a physical interpretation of the 
corresponding physical process. (After all, any term contributing to the expansion of a 
physical observable must correspond to some “real” physical process.) Unfortunately, the 
</> 4 -theory is not well suited to illustrate this aspect, i.e., being void of any dynamical con- 
tent, it is a little bit too simple. However, the possibility of “reading” individual diagrams 
will become evident in the next section when we discuss an application to the interacting 
electron gas. 

Above we have introduced the diagrammatic approach on the example of field expectation 
values (<^(x)(</>(y) 4 )"<^(x'))o. However, to obtain the Green function to any given order in 
perturbation theory, we need to add to these expressions the contributions emanating from 
the expansion of the denominator of the functional average (cf. Eq. (5.14) and (5.15)). 
While, at first sight, the need to keep track of even more terms seems to complicate matters, 
we will see that, in fact, quite the opposite is true! The combined expansion of numerator 
and denominator leads to a miraculous “cancellation mechanism” that greatly simplifies the 
analysis. 

M-l-Fi(x), 



Figure 5.3 Graphical representation of the first-order correction to the Green function: vacuum 
graphs cancel out. 

Let us exemplify the mechanism of cancellation on The three diagrams correspond- 
ing to the contractions of Eq. (5.15) are shown in Fig. 5.3, where integral signs and coordi- 
nates are dropped for simplicity. On the left-hand side of the equation, the brackets (• • • )o 


J. W. Negele and H. Orland, Quantum Many Particle Systems (Addison- Wesley, 1988); L. H. Ryder, Quantum 
Field Theory (Cambridge University Press, 1996). 
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indicate that the second contribution comes from the expansion of the denominator. The 
point to be noticed is that the graph produced by the contraction of that term cancels 
against a contribution arising from the numerator. One further observes that the canceled 
graph is of a special type: it contains an interaction vertex that does not connect to any of 
the external vertices. Diagrams with that property are commonly termed vacuum graphs . 8 

EXERCISE Construct the diagrammatic representation of G ® and verify that the expansion of 
the denominator eliminates all vacuum graphs of the numerator. In particular, show that G*- 2) is 
given by the sum of connected diagrams shown in Fig. 5.4. (For more details, see Problem 5.5.) 


G (Z) = 192 


]x>0 


Figure 5.4 Graphical representation of the second-order contribution to the Green function. 
Indeed, the cancellation of vacuum graphs pertains to higher-order correlation functions 
and to all orders of the expansion: 


The contribution to a correlation function C^ 2n \x i, . . . , X 2 n ) at Zth order of 
perturbation theory is given by the sum of all graphs, excluding vacuum graphs. 


For example, the first-order expansion of the four-point 
function C^ 4> (xj , . . . X 4 ) is shown in the figure, where coor- 
dinates Xj <-> i are abbreviated by indices and “+ perm.” 
stands for the six permutations obtained by interchang- 
ing arguments. In the literature, the statement of vacuum 
graph cancellation is sometimes referred to as the linked 
cluster theorem. Notice that the linked cluster feature 
takes care of two problems: firstly we are relieved of the burden of a double expansion of 
numerator and denominator, and secondly only non-vacuum contributions to the expansion 
of the former need to be kept. 

I N FO The proof of the linked cluster theorem is straightforward. Consider a contribution 
of nth order to the expansion of the numerator of Eq. (5.14): '4 |X[0](/d^0 4 ) n } o . The 
contraction of this expression will lead to a sum of vacuum graphs of pth-order and non-vacuum 
graphs of (n — p)th-order, where p runs from 0 to n. The pth-order contribution is given by 



where the superscript (• • • ) n v ' indicates that the contraction excludes vacuum graphs and the 
combinatorial coefficient counts the number of possibilities to choose p vertices (j 4 of a total of n 



The term “vacuum graph” has its origin in the diagrammatic methods invented in the 1950s in the context 
of particle theory. Instead of thermal averages (• • • )o» one considered matrix elements • • • |fi) taken on the 
ground state or “vacuum” of the field theory. This caused matrix elements (f2| (<Sint[0]) n |^) not containing an 
external field vertex to be dubbed “vacuum graphs.” 




5.1 General structures and low-order expansions 


207 


Pi -P!-P 2 -P 3 Q Pl QPi 

Figure 5.5 Momentum space representation of a first-order contribution to the Green function. 
Internal momenta p, are integrated over. 


vertices to form a vacuum graph. Summing over p, we find that the expansion of the numerator, 
split into vacuum and non-vacuum contributions, reads 




~9) n 


m u 


t'v £S("-P)!p! 

By a straightforward rearrangement of the summations, this can be rewritten as 


(■ -g) n , 


2-^ n\ 


E 


(~9) P 


The p-summation recovers exactly the expansion of the denominator, so we are left with the sum 
over all non-vacuum contractions. 


Before concluding this section, let us discuss one last technical point. The translational 
invariance of the (^-action suggests a representation of the theory in reciprocal space. 
Indeed, the momentum space representation of the propagator Eq. (5.11) is much 
simpler than the real space form, and the subsequent analytical evaluation of diagrams will 
be formulated in momentum space anyway (cf. the prototypical expression (5.17)). 


The diagrammatic formulation of the theory in momentum 
space is straightforward. All we need to do is to slightly adjust 
the graphical code. Inspection of Eq. (5.11) shows that the ele- 
mentary contraction should now be formulated as indicated in the 
figure. Only fields with opposite momentum can be contracted; 
the line carries this momenta as a label. Notice that the momen- 
tum representation of the field vertex 0 4 (x) is not given by ci 4 . 
Rather, Fourier transformation of the vertex leads to the three- 
fold convolution 




P 

G 0 (P ) 


d d x^ 4 (x) - 


L d 


E 


'Pl ^P2 ^P3 ^P4 ^Pl +P2+P3 +P4 ' 


The graphical representation of the first-order correction to the Green function (i.e. the 
momentum space analog of Fig. 5.3) is shown in Fig. 5.5. It is useful to think about the 
vertices of the momentum-space diagrammatic language in the spirit of “Kirchhoff laws” : 
the sum of all momenta flowing into a vertex is equal to zero. Consequently (exercise) the 
total sum of all momenta “flowing” into a diagram from external field vertices must also 
equal zero: (<^ Pl <^ P2 • • • 4>p n } 0 Pi ,o( - ■ ■ )• This fact expresses the conservation of the 

total momentum characteristic for theories with global momentum conservation. 
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EXERCISE Represent the diagrams of the second-order contraction shown in Fig. 5.2 in momen- 
tum space. Convince yourself that the “Kirchhoff law” suffices to fix the result. Observe that the 
number of summations over internal momenta is equal to the number of loops. 

This concludes the first part of our introduction to the formal elements of perturbation 
theory. Critical readers will object that, while we undertook some efforts to efficiently rep- 
resent the perturbative expansion, we have not in the least addressed the question of how 
interactions will actually modify the results of the free theory. Indeed, we are not yet in 
a position to quantitatively address this problem, the reason being that we first need to 
better understand the origin and remedy of the UV /IR divergences observed above. 

However, temporarily ignoring the presence of this roadblock, let us try to outline what 
kind of information can be extracted from perturbative analyses, in principle. One impor- 
tant point to be noted is that, in condensed matter physics, 9 low-order perturbation theory 
is usually not enough to obtain quantitative results. The fact that the “perturbation” cou- 
ples to a macroscopic number of degrees of freedom 10 usually necessitates summation of 
infinite (sub)series of a perturbative expansion or even the application of non-perturbative 
methods. This, however, does not mean that the tools developed above are useless: given 
a system subject to unfamiliar interactions, low-order perturbation theory will usually be 
applied as a first step to qualitatively explore the situation. For example, a malign diver- 
gence of the expansion in the interaction operator may signal the presence of an instability 
towards the formation of a different phase. Or it may turn out that certain contributions to 
the expansion are “physically more relevant” than others. Technically, such contributions 
usually correspond to diagrams of a regular graphical structure. If so, a summation over 
all “relevant processes” may be in reach. In either case, low-order expansions provide vital 
hints as to the appropriate strategy of further analysis. In the following we discuss two 
examples that may help to make these remarks more transparent. 


5.2 Ground state energy of the interacting electron gas 

In Section 2.2 we began to consider the physics of highly mobile electron compounds. We 
argued that such a system can be described in terms of the free particle Hamiltonian (2.18) 
together with the interaction operator (2.19). While we have reviewed the physics of the 
non-interacting system, nothing has hitherto been said about the role of electron-electron 
interactions. Yet by now we have developed enough analytical machinery to begin to address 
this problem. Below we will apply concepts of perturbation theory to estimate the contribu- 
tion of electronic correlations to the ground state energy of a Fermi system. However, before 
plunging into the technicalities of this analysis, it is worthwhile discussing some qualitative 
aspects of the problem. 


There are subdisciplines of physics where the situation is different. For example, consider the high-precision 
scattering experiments of atomic and sub-atomic physics. In these areas, the power of a theory to quantitatively 
predict the dependence of scattering rates on the strength of the projectile/target interaction (the “perturba- 
tion”) is a measure of its quality. Such tests involve large-order expansions in the physical coupling parameters. 
} In contrast, low-order expansions in the external perturbation (e.g. experimentally applied electric or magnetic 
fields) are usually secure; see Chapter 7. 
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Qualitative aspects 

A principal question that we will need to address is under what physical conditions are 
interactions “weak” (in comparison to the kinetic energy), i.e. when does a perturbative 

approach with the interacting electron system make sense at 
all? To estimate the relative magnitude of the two contributions 
to the energy, let us assume that each electron occupies an aver- 
age volume Tq. According to the uncertainty relation, the min- 
imum kinetic energy per particle will be of order 0(h 2 /mr^). 

On the other hand, assuming that each particle interacts pre- 
dominantly with its nearest neighbors, the Coulomb energy is 
of order O(e 2 /ro). The ratio of the two energy scales defines 
the dimensionless density parameter (see figure) 


where ao = h 2 /e 2 m denotes the Bohr radius. 11 Physically, r s is the radius of the spherical 
volume containing one electron on average; for the Coulomb interaction, the denser the 
electron gas, the smaller r s . We have thus identified the electron density as the relevant 
parameter controlling the relative strength of electron-electron interactions. 

Below, we will be concerned with 
the regime of high density, r s <C 1, 
or weak Coulomb interaction. In the 
opposite limit, r s » 1 , properties 
become increasingly dominated by 
electronic correlations. Ultimately, 
for sufficiently large r s (low density) 
it is believed that the electron gas 
undergoes a (first order) transition 
to a condensed phase known as a Wigner crystal. Although Wigner crystals have never 
been unambiguously observed, several experiments performed on low-density electron gases 
are consistent with a Wigner crystal ground state. Monte Carlo simulation suggests that 
Wigner crystallization may occur for densities with r s > 35 in the two-dimensional electron 
gas and r s > 106 in three. 12 (Note that this scenario relies on being at low temperature, 


Eugene P. Wigner 1902-95 

Nobel Laureate in Physics in 1963 
“for his contributions to the the- 
ory of the atomic nucleus and the 
elementary particles, particularly 
through the discovery and appli- 
cation of fundamental symmetry 
principles." (Image © The Nobel 
Foundation.) 


A 


e 2 mrl _ ro 
r 0 ft 2 ag 



11 Notice that the estimate of the relative magnitude of energy scales mimics Bohr’s famous qualitative discussion 
of the average size of the hydrogen atom. 

12 With the earliest reference to electron crystallization appearing in E. Wigner, On the interaction of electrons 
in metals, Phys. Rev. 46 (1934), 1002-11, a discussion of the results of quantum Monte Carlo simulations in 
three-dimensions can be found in the following papers: D. M. Ceperley and B. J. Alder, Ground state of the 
electron gas by a stochastic method, Phys. Rev. Lett. 45 (1980), 566-569 and N. D. Drummond, Z. Radnai, J. 
R. Train, M. D. Towler, and R. J. Needs, Diffusion quantum Monte Carlo study of three-dimensional Wigner 
crystals, Phys. Rev. B 69 (2004), 085116, and in two-dimensions in B. Tanatar and D. M. Ceperley, Ground 
state of the two-dimensional electron gas, Phys. Rev. B 39 (1989) 5005-5016. For a further discussion of the 
subtleties of the transition in two-dimensions, we refer to B. Spivak and S. A. Kivelson, Phases intermediate 
between a two-dimensional electron liquid and Wigner crystal, Phys. Rev. B 70 (2004), 155114. 
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Table 5.1 Density parameters of a number of metals. 


Metal 

r s 

Metal 

r s 

Metal 

r s 

Metal 

r s 

Li 

3.2 

Be 

1.9 

Na 

3.9 

A1 

2.1 

K 

4.9 

Sn 

2.2 

Cu 

2.7 

Pb 

2.3 


Source : Data taken from Ashcroft and Mermin, Solid State 
Physics . a 

a N. W. Ashcroft and N. D. Mermin, Solid State Physics (Holt- 
Saunders International, 1983). 


and on the long-range nature of the Coulomb interaction. In particular, if the Coulomb 
interaction is subject to some screening mechanism, V(r) ~ e” r /\ r s ~ (ro/ao)e -r °/ A , and 
the influence of Coulomb interaction at low densities becomes diminished.) 

For r s ~ 0(1), the potential and kinetic energies are comparable. This regime of inter- 
mediate coupling is notoriously difficult to describe quantitatively. Yet most metals lie in a 
regime of intermediate coupling 2 < r s < 6 (see Table 5.1). Fortunately, there is overwhelm- 
ing evidence to suggest that a weak coupling description holds even well outside the regime 
over which the microscopic theory can be justified. The phenomenology of the intermediate 
coupling regime is the realm of Landau’s Fermi liquid Theory . 13 

The fundamental principle under- 
lying the Fermi liquid theory is 
one of “adiabatic continuity”: 14 in 
the absence of an electronic phase 
transition (such as Wigner crystal- 
lization), a non-interacting ground 
state evolves smoothly or adiabat- 
ically into the interacting ground 
state as the strength of interaction 
is increased. 15 An elementary exci- 
tation of the non-interacting system 
represents an “approximate excita- 
tion” of the interacting system (i.e. 
its “lifetime” is long). Excitations 
are quasi-particles (and quasi-holes) 
above a sharply defined Fermi surface. 


Lev D. Landau 1908-68 

Nobel Laureate in Physics in 1962 
“for his pioneering theories for 
condensed matter, especially liq- 
uid helium." Landau’s work covers 
all branches of theoretical physics, 
ranging from fluid mechanics to 
quantum field theory. A large por- 
tion of his papers refer to the theory of the con- 
densed state. They started in 1936 with a formu- 
lation of a general thermodynamical theory of the 
phase transitions of the second order. After P.L. 
Kapitsa's discovery, in 1938, of the superfluidity 
of liquid helium, Landau began extensive research 
which led him to the construction of the complete 
theory of the “quantum liquids" at very low temper- 
atures. (Image © The Nobel Foundation.) 



J L. D. Landau, The theory of a Fermi liquid, Sov. Phys. JETP 3 (1956), 920. 

1 P. W. Anderson, Basic Notions in Condensed Matter Physics (Benjamin, 1984). 

As a simple non-interacting example, consider the adiabatic evolution of the bound 
states of a quantum particle as the confining potential is changed from a box to a 
harmonic potential well (see figure). While the wavefunctions and energies evolve, 
the topological characteristics of the wavefunctions, i.e. the number of the nodes, 
and therefore the assignment of the corresponding quantum numbers, remain 
unchanged. 
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The starting point of Fermi liquid theory is a few phenomenological assumptions, all 
rooted in the adiabaticity principle. For example, it is postulated that the density of quasi- 
particles can be described in terms of a momentum-dependent density distribution n(p) 
which, in the limit of zero interaction, evolves into the familiar Fermi distribution. From 
this assumption (and a few more postulates) a broad spectrum of observables can be ana- 
lyzed without further “microscopic” calculation. Its remarkable success (as well as the few 
notorious failures) has made Landau Fermi liquid theory a powerful tool in the development 
of modern condensed matter physics but one which we are not going to explore in detail. 16 
Instead, motivated in part by the phenomenological success of the “adiabatic continuity,” 
we will continue with the development of a microscopic theory of the weakly interacting 
three-dimensional electron gas, with r s <C 1. 


Perturbative approach 

The starting point of the perturbative analysis is the functional representation of the free 
energy F = —T In Z through the quantum partition function. (Here, as usual, we set fee = 
1.) Expressed as a coherent state path integral, Z = J Dfyfip) e~ s ^^\ where 

&bP > $] — 'y 1 V* pa- iw n + — p\ 'Ippcr + 2£3 y 1 V’p+qcr'0p , -gcr'E(q)^p' cr 'V>p tT . 

p ' pp'q 

Here, for brevity, we have introduced the “4-momentum” p = (p,w n ) comprising both 
frequency and momentum. 17 As with the Green function discussed in the previous section, 
the free energy can be expanded in terms of an interaction parameter. To fix a reference 
scale against which to compare the correlation energies, let us begin by computing the free 
energy Eq. (4.41) of the non-interacting electron gas: 

F (°) = - T ^l n (l +(: A p V 2 - ,)) L4> £ (5-19) 

pc p 2 /2m</»,(T ' ' 


where p = p|/2rn, N = (2 3 / 2 /37t 2 )rn 3 / 2 L 3 /r 3 / 2 is the number of particles, and the last 
estimate is obtained by replacing the sum over momenta by an integral. According to 
Eq. (5.19), the average kinetic energy per particle is equal to 3/5 of the Fermi energy. To 
relate this scale to the density parameter r s , we choose to measure all energies in units of 
the Rydberg energy (alias the ionization energy of hydrogen), Er y = me 4 ‘/2h 2 = 13.6 eV, 


F (°) n 


(5.20) 


16 Interested readers are referred to one of several excellent reviews, e.g. P. W. Anderson, Basic Notions in 
Condensed Matter Physics (Benjamin, 1984). 

Notice that “4-momentum” p and the modulus of the “3-momentum”, p = |p|, are denoted by the same symbol 
(standard notation!). However, in its context, this should not cause confusion. 
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If we turn now to the discussion of interactions, a formal expansion of F to first order in 
V gives 


pi 1 ) — 


T 2 

2L3 


^2 1 l 1 p+q<7'4 ) p'-q<j'V{q L )lp p 'a'1p pC r 

pp'q o 


(5.21) 


where (• • • )o denotes the functional average with respect to the non-interacting action. The 
two 18 diagrams contributing to this expression are shown in the figure overleaf. To account 
for the specifics of the electron gas, we are using a diagrammatic code slightly different from 
that of the previous section: 


t> The Coulomb interaction is represented by a wavy line labeled by the momentum argu- 
ment q. 

t> A contraction {tppa'ippa) o is indicated by a solid arrow representing the free Green func- 
tion of the electron gas, 


G b = 


(5.22) 


i.e. the inverse of the free action. The labeling of the contraction by an arrow (instead 
of a unidirectional line as in ^-theory) is motivated by two considerations: firstly, the 
mnemonic has the effect of indicating that a contraction {ip^x) o describing the creation 
of an electron with quantum numbers A, followed by the annihilation of an electron at 77 , 
is a directed process; secondly, there are situations (e.g. when a magnetic field is present) 
where (Vw(r)Vw(r'))o + (Vw(r')Vw(r))o- 

t> The sum of all 4-momenta emanating from an interaction vertex formed by a wavy line 
and two electron field lines is equal to zero (exercise) - the “Kirchhoff law.” 

t> Finally, we have to be careful about sign factors arising when Grassmann variables are 
interchanged. However, the anti-commutativity of the fields merely leads to an overall 
factor (— ) Nl , where N\ is the number of loops of a diagram. (To verify this claim, one may 
notice that a loop is formed by a “ringwise contraction” (V’iV’2V’3 • • • V’iv) o 5 (2 — > 3)(4 — > 
5) . . . ((N — 2)—> ( N — 1 ))(N — > 1). The last contraction introduces the minus sign.) 


First-order perturbation theory 

Turning to the discussion of the specific diagrams shown in the figure below, we notice 
that the first, generally known as a Hartree contribution, vanishes. Technically, this is a 

consequence of the fact that the 
interaction fine connecting the two 
loops carries zero momentum. How- 
ever, as discussed in Section 2.2, 
V(q = 0) = 0. Physically, the van- 
ishing of the Hartree contribution is 
a consequence of charge neutrality. 


Douglas R. Hartree, 1897-1958 

Originally appointed as a professor of applied math- 
ematics he became one of the first “computational 
physicists." He developed methods for numerically 
solving the Schrodinger equation, among them the 
celebrated “Hartree approximation." 


Remember that, in a theory with complex or Grassmann fields, only contractions ~ ('&'&) o exist, i.e. there is a 
total of n! distinct contributions to a contraction ('ifj'ijj . . . -0)o of 2 n field operators. 
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Indeed, the two Green function loops G p measure the local particle density of the 
electron gas (cf. the discussion on page 171). Global charge neutrality requires that the 
electronic density cancels against the density of the ionic background. However, notice that 
this cancellation mechanism relies on our assumption of overall spatial homogeneity. 


Only in a spatially uniform system does the density of the 
electron gas locally compensate the positive counter-density. 
In realistic metals, the inevitable presence of impurities breaks 
translational invariance and there is no reason for the Hartree 
contribution to vanish. Indeed, the analysis of Hartree-type 
contributions to the correlation energy in disordered electronic 
media is a subject of ongoing research. 

While the Hartree term describes the classical interaction 
of charge densities through the Coulomb potential, the second 
diagram shown in the figure, known as a Fock contribution, 
is quantum. Translating the diagrammatic language back into 
Green functions, we obtain 




^ (1) = -pEw(p-p') 

p,p' 

= -Z3^ nF(ep)nF(ep,) |p-pT 

P,P' 

T— >o 1 e 2 _ e 2 L 3 p| 

L 3 ep ,v</>~ p, l 2 ~~ (2?r)4 


(5.23) 


Here, the sign factor in the first equality is due to the odd number of fermion loops, and the 
evaluation of the last sum, which can be found, for example, in the text by Kittel, 19 is left 
as an exercise. (Up to numerical factors, the result follows from dimensional considerations. 
We are integrating the inverse square of the distance in momentum space ([momentum] -2 ) 
over two Fermi spheres ([momentum] 6 ). Since the integral is convergent at low momenta, it 

must scale as the fourth power of the upper cutoff ~ Pp.) 

Division by the Rydberg energy leads to the scaling 

F-(l) N 

— — = const. • — , (5.24) 

i'Ry r s 

where the constant is of order unity. This result con- 
forms with our previous estimate of the density depen- 
dence of correlation energies. 


rw 



C. Kittel, Quantum Theory of Solids (Wiley, 1963). 
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INFO A closer analysis shows that other predictions following from Eq. (5.23) are less sensible. 
To understand what goes wrong, let us rewrite the sum of F^ 0> and F™ as 


F (°) +F « = £ 

|p|<PF 



6 

Ip-p'I : 


This way of writing suggests that the interaction of an electron of momentum p with all partner 
electrons of momentum p' leads to a reduction 20 of the single particle energy by an amount 
i- 3 E p ,< M e 2 /|p- P '| 2 . 

Evaluation of the internal sum (exercise) leads to 21 


p(-°) _|_ p( i) _ 



— M — 


r(| P |/p F )) , 


where the function T(a:) = 2 + 1 * In I I is shown in the figure. The most important charac- 
teristic of the scaling function T is a logarithmically diverging derivative at x = 1 (exercise). To 
appreciate the consequences of this feature, let us consider the single particle density of states 
p(e) = E P ct b(e — Cp), where t p = ^ — ^-ff~P(p/pe) is the first-order Fock single-particle energy. 
Approximating the sum by an integral, we obtain 



where p is defined through e p = e. The singularity of the derivative of T(p/pp) at p = pp 
implies that the density of states is predicted to vanish as e approaches the Fermi energy, clearly 
a nonsensical result. 22 The origin of this pathological divergence can be traced back to the long- 
range (|r | _1 — > |qr 2 ) decay of the Coulomb potential (cf. Eq. (5.21)). Yet we know that, in 
systems with mobile charge carriers, electrostatic forces are screened, i.e. they decay exponen- 
tially. Microscopically, screening is due to the polarization of a medium in response to an applied 
electric field. This by itself is an interaction effect, albeit one of “higher order” in the Coulomb 
interaction, i.e. if we ever want to observe signatures of the screening mechanism, we need to 
advance to higher orders in the expansion in V. 


Second-order perturbation theory 
Let us then consider the second-order contribution 

2 2 

= ~ ^ (2!^) ^ ■ qrr tip' qrT'V(<l)Vp'cT' Vpa 

' ' pp'q 

where the superscript indicates that only connected diagrams contribute. 


Why a reduction? Charge neutrality implies that the classical correlation energy vanishes. However, being 
fermionic in nature, electrons tend to avoid each other even more than classical particles, i.e. quantum statistics 
leads to a lowering of the correlation energy below the classical value of zero. 

See, e.g., J. Hubbard, The description of collective motion in terms of many-body perturbation theory, Proc. 
Roy. Soc. (London) A243 (1957), 336. 

The majority of transport observables in metals are proportional to the density of states at the Fermi energy, 
i.e. p(fi) = 0 would entail the vanishing of almost all transport coefficients! 
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EXERCISE Using the field integral representation of Z , show that second-order expansion of 
F = — Tln.Z in the interaction operator Sint leads to F ^ = — (T/2) — [(<S l mt) 0 ] 2 ] ■ 

Convince yourself that the second term cancels disconnected diagrams. Apply arguments similar 
to those involved in the proof of the linked cluster theorem to verily that the cancellation of 
disconnected graphs pertains to all orders in the expansion of F. I.e. the free energy can be 
obtained by expanding the partition function Z (not its logarithm) and keeping only connected 
diagrams; dropping disconnected contributions is equivalent to taking the logarithm. 

Connected contraction of the eight field operators leads to four distinct types of diagram 
(exercise) of which two are of Hartree type (i.e. contain a zero-momentum interaction line 
F(q = 0)). The non-vanishing diagrams F^’ 1 and F^’ 2 are shown in the figure below. 23 
Translating these diagrams to momentum summations over Green functions, one obtains 
(exercise) 


2),1 = 

E G Pl G Pl+q G P2 G P2+q V( q) 2 , 

Pi,P2,q 


^(21,2 = 

1 T 3 

2L6 E G p G p _ 9l G p _ ?1 _ 92 G p _ q2 F( qi )F( q 2 ). 

(5.25) 


1»92 


While, at first sight, these expressions do not look very illuminating, closer inspection reveals 
some structure. Reflecting the fact that electronic transport in solids is carried by excitations 
at the Fermi energy, the electron Green function (5.22) assumes large values for momenta 
p ~ pp. This implies that only configurations where all momentum arguments carried by the 
Green function are close to the Fermi surface contributes significantly to the sums (5.25). 
Considering the first sum, we see that, for small |q| and |pj| ~ pp, this condition is met, 
i.e. there are two unbound summations over momentum shells around the Fermi surface. 
However, with the second sum, the situation is less favourable. For fixed |pi| ~ pp, fine- 
tuning of both qi and q 2 is necessary to bring all momenta close to pp, i.e. effectively one 
momentum summation is frozen out. There is no need to enter into detailed calculations to 
predict that, due to the relatively larger “phase volume,” 1, where the ratio 

will be proportional to the area of the Fermi surface which, in turn, is proportional to the 
density of the electron gas. For large densities, the second Fock diagram can be neglected 
in comparison with the first. 


In fact, one more non- vanishing contribution is obtained by drawing a single “ring” of Green functions contain- 
ing two non-crossing interaction lines. This diagram, and its obvious generalization to higher order processes 
containing sequential “self-interactions” of a single Green function do not play an essential role (in the present 
context.) The reasons why will become clear in section 5.3 when we introduce the notion of self-energies. 
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Figure 5.6 Diagram of second-order Fock contributions to the free energy of the electron gas. 
(a) Interaction process involving a single electron-hole pair. An electron is virtually excited 
above the Fermi energy, and subsequently recombines with a hole state left behind by a second 
electron excitation which then jumps into the hole created by the initial excitation, (b) Virtual 
excitation of independent electron-hole pairs. 


Of course, there must be a ‘more physical’ way of under- 
standing this observation. The Green function lines in the 
diagrams describe the propagation of quasi-particles p( 2),i 
and quasi-holes 24 on the background of the interacting 
medium. Now, the diagram FG)> 2 contains a simply con- 
nected propagator line: a single particle-hole excitation 
undergoes a second-order interaction process with itself 
(see Fig. 5.6(a)). By contrast, the first diagram F^ 2 )’ 1 
involves two independent electron-hole excitations, as F <2> ’- 
shown in Fig. 5.6(b). Since, in a dense electron gas, a 
second-order interaction process will more likely involve 
different particles, this type of contribution is more important. Notice that the process 
shown in Fig. 5.6(b) can be interpreted as a “polarization” of the medium due to the 
excitation of electron-hole pairs. 



Higher orders in perturbation theory 

The picture above readily generalizes to interaction processes of higher order. In the high- 
density limit, dominant contributions to the free energy should contain one free integration 


over the Fermi momentum per interaction process. A moment’s 
thought shows that only diagrams of “ring graph” structure (see the 
figure) meet this condition. Expanding the free energy functional to 
nth order in the interaction operator, and retaining only diagrams 
of that structure, one obtains 

^rpa = ” 2n G P G p +^\ . (5.26) 

q V v / 

(To understand the origin of the multiplicative factor 1/n, notice 
that F <n ' 1 results from the connected contraction of an operator ~ 



In principle, the system consists of physical electrons immersed into a globally positive background. However 
(cf. the discussion of Section 2.2), keeping in mind that at low temperatures dynamical processes take place in 
the immediate vicinity of the Fermi surface, a more problem-oriented way of thinking about states is in terms 
of quasi-particles and quasi-holes, i.e. electronic states immediately above and below the Fermi surface. 
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(n!) _1 ((S'i n t['i/’,V’]) n )o- There are (n— 1)! different ways of arranging the interaction operators 
Sint i n a ring-shaped structure, i.e. the diagram carries a global factor ( n — l)!/n! = 1 /n.) 

In Eq. (5.26) the subscript “RPA” stands for random phase approximation. 25 How- 
ever, more important than the designation are the facts: (a) we have managed to identify 
a particularly relevant subclass of diagrams contributing to the plethora of interaction pro- 
cesses; (b) there is a physical parameter controlling the dominance of these diagrams; and 
(c) we are apparently able to sum up the entire series of nth-order RPA-interaction contri- 
butions. Indeed, summation over n leads to the simple result 

Frpa = £ 4pa = I £ ln (! - v (q)n,) , (5.27) 

n q 

where we have introduced the polarization operator, 26 


n 9 


2 T 

l 3 


£g p g p . 


(5.28) 


Equation (5.27) represents our first example of an infinite-order expansion. However, before 
turning to the discussion of further aspects of infinite-order perturbation theory, let us stay 
for a moment with the RPA free energy. Specifically, let us investigate whether the RPA 
repairs the pathology in the Fermi surface density of states observed above. 

The last unknown we need to compute before turning to the discussion of the RPA free 
energy is the polarization operator. Drawing on the frequency summation in Problem 4.5, 
the polarization operator can be written explicitly as 


n _ 2T 1 1 _ 2 q) - i|ii £ p) 

T 3 ^ —lUJn + £p —iUn+m + £p+q T 3 ^ iw m + £p+q — £p 

The evaluation of the momentum sum is straightforward if a bit tedious. As 
calculation outlined in the Info block below one obtains 


(5.29) 
a result of the 


n q,‘ 


= -fo 


1 - 


icj m 

2v F q 


In 


/ ifjJ m + v F q 
\ i(jj m — v F q 


(5.30) 


where 




d 3 p mp F 


(5.31) 


The attribute “random phase” seems to allude to the fact that the quantum mechanical phase carried by the 
particle— hole excitations stirred up by interactions gets lost after each elementary polarization process. This 
contrasts with more-generic contributions to F where quantum phases may survive more complex interaction 
processes. Also notice that more than one approximation scheme in statistical physics has been dubbed “random 
phase.” 

The definition (5.28) applies to the specific case of a three-dimensional translationally invariant system. More 
generally, the polarization operator is defined as the frequency /momentum Fourier transform of the connected 
average 


<^(X, r)V>(x, Hx'r'Wx'yjlc. 
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is the density of states per volume of non-interacting electrons at the Fermi surface. 


I N FO For an isotropic three-dimensional system at zero temperature, the polarization operator 
(5.29) becomes a function of |q| and u n , known as the Lindhard function. The evaluation 
of the momentum sum is greatly simplified by noting that, due to the difference of the Fermi 
functions, only momenta with |p| ~ pf significantly contribute. Assuming that |q| <pF,we may 
thus effect the linearization £ p+q — £ p = ( l/m)p-q L + 0(q 2 ), where the term of 0(q 2 ) is negligibly 
small. Similarly, riF(e P +q) — np(e p ) — d p nF(e P )(l/m)p • q ~ — <5(e p — p)(l/m)p • q, where, in the 
zero-temperature limit, the last equality becomes exact. Converting the momentum sum into an 
integral, we thus obtain 


" d 3 p 

(27r) 3 ^ €p p- 


Setting 1/(2 tt ) 3 f d 3 p = 1/(27 t) 3 f dpp 2 f d , where fd is the integral over the three- 


and p 

= pn, with |n| = 

1, the 

integral can be evaluated, 

2 

(27T) 3 

J dpp 2 j d 

S(e 

p-p) 

t/Fn • q 

iu m + VFn 

q 

2 

J dpp 2 J d 

S(e 

p~p) 

w! d 

v^n • q 
iojm + VFn • q 


Vo 





v 0 f 1 

vpxq 




/ iu)m + VF q 


iu) m + Vpxq 

= —Vq 

L 2v f q 

\icJm — Vf <7, 


To obtain the free energy, we need 
to substitute the Lindhard func- 
tion into Eq. (5.27) and sum over 
q = { q, ui rn } . Yet, due to the com- 
plicated dependence of the function 
n qiWm on its arguments, this final 
part of the calculation is far from 
trivial. An asymptotic expression 
for the high-density expansion of 
the RPA approximation to F was 
first obtained in a famous study by 
Gell-Mann and Brueckner. 27 Measuring in atomic units, they obtained the free energy per 
particle of 

= -0.142 + 0.0622 In r s . (5.32) 

-C/Ry N 

For a derivation of this result we refer the interested reader to the literature. (In the late 
1950s, motivated largely by applications to the physics of heavy nuclei, the perturbative 
calculation of ground state energies had become an industry. A comprehensive discussion of 


Murray Gell-Mann 1929- 

Nobel Laureate in Physics in 1969 
“for his contributions and discov- 
eries concerning the classification 
of elementary particles and their 
interactions.” Among his many 
accomplishments, he formulated 
the quark model of hadronic res- 
onances, and identified the SU(3) flavor symme- 
try of the light quarks, extending isospin to include 
strangeness, which he also discovered. (Image © 
The Nobel Foundation.) 



M. Gell-Mann and K. Brueckner, Correlation energy of an electron gas at high density, Phys. Rev. 106 (1957), 
364-8. To explore this and related literature, it is useful to refer to D. Pines, The Many-Body Problem (Ben- 
jamin, 1962) where the important reprints are collected. 
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the subject, and details of the asymptotic r s - expansion of Frpa can be found, for example, 
in Mahan. 28 ) When added to the kinetic energy Eq. (5.20) and the first-order correlation 
energy Eq. (5.24), the structure of the density expansion of the free energy becomes clear: 
the sum over all RPA diagrams yields the coefficient of 0(r°) in the expansion in r s . 29 

More important for our present discussion is the conceptual meaning of the RPA, notably 
the connection to screening. To this end, let us temporarily consider the expectation value 
of the particle number N = —d^F, rather than the free energy itself. Specifically, we wish 
to compare the first order correction (with the non-interacting result) A^ 1 ) = —d p F to the 
RPA, TVrpa = — c^Erpa. Noting that d p G p = —(G p ) 2 we readily find (cf. Eq. (5.23)) 

Nll) 

p,q 


After a second differentiation, p W = d p N^\ this expression would lead to the pathological 
density of states, a consequence of the unscreened interaction line. (The diagrammatic 
visualization of is shown in Fig. 5.7(a).) Now, consider the /z-derivative of Frpa, 
Eq. (5.27): 


n kpa = 2 H 


Q 

2 T 2 

IF 


y(q)^,n,, ^ 

1 - R(q)n, 

V V(q) 

Y i - ^(q)n 9 


t 2 v P(q) 

l 3 Y i - ^(q)n 9 

Y,G p+q (G p ) 2 = - 


[E + ** _< ?)J 

27^2 

L'i yi (<?) G p+q (G p ) , 

q p 


(5.33) 


where we have defined the “effective” interaction 


V eS (q) = 



V(q) 

m 5 


with the generalized dielectric function 


e(q) = 1 - P(q)n g . 


(5.34) 


(5.35) 


Structurally, the expression for Vrpa resembles the first-order expression A^ 1 ), only with 
the “bare” Coulomb interaction replaced by the effective interaction V e g . From its definition, 
it is clear that V e g represents a geometric series over polarization bubbles, augmented by 
bare interaction lines. This is visualized in Fig. 5.7(b), where the fat line is defined in the 
bottom part of the figure. In fact, we do not need to stare at the analytical expression 
(5.33) to understand its origin. The //-differentiation acting on Erpa may pick any of the 
n rings contributing to F^} A in Eq. (5.26). The “differentiated ring” becomes the bubble 
in Fig. 5.7, while all other rings conspire to form the (n — l)th-order contribution to the 
effective interaction line. 


G. Mahan, Many Particle Physics (Plenum Press, 1981). 

Here we follow a convention (used mostly in the older literature) where the RPA starts from the second - order 

ring diagram F^’ 1 . However, henceforth we will refer to the “RPA” as the sum over all ring diagrams, including 
the first, F G. * (1) . 
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'VW = ''N/N/V + 


'X/N/V + 


Figure 5.7 Diagrammatic visualization of expectation value of the particle number, (a) First-order 
Fock correction and (b) the RPA approximation, where the definition of the RPA interaction line 
is shown in the bottom part of the figure. 


At this stage, the connection between RPA and the collective electromagnetic response 
of the charged system becomes discernible. We remember that the electric field E in a 
medium is related to the vacuum field D through the relation D(q, w) = e(q, w)E(q, w), 
where the dielectric function e(q, w) = 1 +47rx(q, w) is determined by the electromagnetic 
susceptibility, i.e. a function that measures the tendency of the medium to “respond” 
or adjust to an external electromagnetic perturbation. Identifying E(D) with the gradient 
of the “dressed” potential V e g (the “bare” potential V), we conclude that, on the level of 
the microscopic theory, 47rx(q, cu) = —V (q)n q>Wa , i.e. the susceptibility is proportional to 
the polarization operator Yl q . These connections motivate the introduction of the dielectric 
function as in Eq. (5.35) above. 

The full complexity of the polarization of the dense homogeneous electron gas is encoded 
by the Lindhard function (5.30). However, there are some interesting limiting cases, where 
the situation simplifies. Inspection of Eq. (5.30) shows that the Lindhard function depends 
on the dimensionless ratio between two characteristic length scales: the “wavelength” q~ l , 
and the distance Vp /to an excitation propagating with Fermi velocity travels during the time 
w -1 . For small frequencies, vp/u> q~ x , the electron gas has enough time to optimally 
adjust to the spatial variation ~ q~ l of the potential, i.e. to screen out electro-neutrality 
violating potential fluctuations. In this static limit, we may expand n qjWm = F — uq + 
0(u/vpq). In this case, the electron gas interacts through the effective potential, 


V eS (q) 


f 


1 

v (q ) -1 + u o 


47re 2 _ 47re 2 

q 2 -(- 47re 2 i/Q q 2 + A -2 ’ 


where the constant A = (47re 2 z/o) is known as the Thomas— Fermi screening length. 
Indeed, it is straightforward to verify that the inverse Fourier transform leads to an effective 
interaction potential 



(5.36) 


i.e. the potential is exponentially supressed, or screened, on length scales |r| > A. 




5.2 Ground state energy of the interacting electron gas 
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EXERCISE To verify Eq. (5.36), introduce polar coordinates q = (q,0,<f>) in q-space and per- 
form first the elementary integrations over <f> and 9. It is then straightforward to complete the 
remaining g-integration over the radial coordinate by analogy to the contour integral (5.13). 


INFO Let us briefly recapitulate the heuristic interpretation of Thomas— Fermi screening. 
Imagine a test charge e has been immersed in an electron gas. The host system will respond to 
this perturbation by a local distortion of its density. To compute the distortion, we note that 
the effective potential I 4 ff(r) created by the test charge changes the electronic energy levels 
according to e p — > e p — V e g. (Here, we assume that the external perturbation changes so slowly 
that the simultaneous usage of momentum and coordinate quantum numbers is not in conflict 
with the uncertainty relation.) The induced charge density thus reads as 

Pind(r) = -e J ^3 [n F (e p - U e fi(r)) - n F (e p )] pt eU e ff(r) J d n F (e p ) 

T ~° -el4s(r) J de J 5{e - e p ) <5(e - p) = -eV e «{r)v 0 , 

( ) 

where in the second equality we assumed weakness of the induced potential and in the third 
equality used the low-temperature approximation d n F (e) ~ — <5(e — p). A substitution of this 
result into the Fourier transform of the Poisson equation 2 14fj(r) = — 47 rep(r) — 47 re(e< 5 (r) + 

Pind(r)) gives 

t , , s _ 47re 2 

Veff(Qj — — 7 , ; ^ • 

q 2 + 47re 2 z/o 


We also note that Thomas-Fermi screening repairs our problem with the density of states. 
The pathology discussed on page 213 found its origin in the (l/q 2 )-singularity of the 
unscreened potential. This IR singularity is now cut off at momenta q ~ A -1 , implying 
benign behavior of the density of states. Using the fact that the density parameter r s scales 
with the Fermi energy as (exercise) r s ~ At -1 / 2 , and differentiating p = d^F, it is indeed 
straightforward to verify that the RPA expansion of F no longer leads to singular behavior 
of p at the Fermi energy. 

It is also interesting to consider the limit v-p/oj q -1 of essentially dynamic polariza- 
tion. Expansion of n qjUJm in Eq. (5.30) to first order in vpq/ui 1 leads to the expression 


Ueff(q,W m ) = 


47re 2 


1- J 


where oj p = (Inne 2 /rri ) 1 ^ 2 denotes the plasma frequency and we have used the fact that 
(exercise) the particle density n = N/L 3 = 2k ? f /tm 1 . In fact, the form of the denominator 
hints at a collective instability of the electron gas at frequencies ~ u p . Referring for a more 
rigorous discussion to the next chapter, one can understand the origin of the instability by 
another heuristic argument. What we need to remember is that the current formalism is 
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developed for imaginary times r. Eventually we will be 
interested in real-time dynamics, i.e. we need to analyt- 
ically continue back to t ~ —it. In the language of fre- 
quencies, this amounts to an analytic continuation process 
from Matsubara frequencies to real frequencies, iu> m — > w. 
Sloppily substituting a real frequency into the potential, we 
obtain V e g (q, u) m — > — iuS) a (l — a^/a; 2 ) , i.e. the system 

seems to respond singularly to excitations with frequency 
a ; ~ The cause of this instability is the well-known 

plasmon mode of the electron gas. 



I N FO The physics of the plasmon mode can be understood as follows. Imagine the electron 
gas uniformly displaced by a distance x against the positively charged background (see figure). 
This will lead to the formation of oppositely charged surface layers at the two ends of the system. 
The surface charge densities p± = ±exn lead to an electric field E = knenx directed opposite 
to the displacement vector. Mobile charge carriers inside the system are thus subject to a force 
— 47re 2 n*. The solution of the equation of motion rrix = —4ne J nx oscillates at a frequency 
(Up = (4-7T e 2 n/ni) 1 / 2 , the plasma frequency. Since the motion of the charge carriers is in turn 
responsible for the accumulation of the charged surface layers, we conclude that the system 
performs a collective oscillatory motion, known as the plasmon excitation. 


At this point we conclude our preliminary discussion of the electron gas. We have seen that 
large-order perturbation theory can be applied to successfully explain various features of 
the interacting system: energetic lowering due to quantum correlation, screening, and even 
collective instabilities. 

The interacting electron gas is but one example of the many applications of the diagram- 
matic perturbation theory. After the full potential of the approach had become evident - in 
the late 1950s and early 1960s - diagrammatic techniques of great sophistication were devel- 
oped, and applied to a plethora of many-body problems. Indeed, more than two decades 
passed before large-order perturbation theory eventually ceased to be the most important 
tool of theoretical condensed matter physics. Reflecting the great practical relevance of the 
approach, there is a huge body of textbook literature concentrating on perturbative meth- 
ods. 30 Although it would make little sense to develop the field in its full depth once again, 
a few generally important concepts of diagrammatic perturbation theory are summarized 
in the next section. 


See, e.g., A. A. Abrikosov, L. P. Gorkov, and I. E. Dzyaloshinkii, Methods of Quantum Field Theory in 
Statistical Physics (Dover Publications, Inc., 1975), A. Fetter and J. D. Walecka, Quantum Theory of Many- 
P article Systems (McGraw-Hill, 1971), D. Pines and P. Nozieres, The Theory of Quantum Liquids - Normal 
Fermi Liquids (Addison- Wesley, 1989), and S. Doniach and E. H. Sondheimer, Green Functions for Solid State 
Physicists (Benjamin/ Cummings, 1974). 
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5.3 Infinite-order expansions 

Turning back to the prototypical <^ 4 -model, it is the purpose of the present section to 
introduce a number of general concepts of infinite-order perturbative summations. As should 
be clear from the discussion above, a meaningful summation over an infinite set of diagrams 
necessitates the existence of a class of perturbative corrections that is “more important” 
than others. In practice, what we need is a small parameter that discriminates between 
diagrams of different structure. In our example above, this parameter was the effective 
density r s of the electron gas. However, in other settings, the control parameter N may 
be defined quite differently: large values of a spin, S, the number of colors, N c , in QCD, 
the number of spatial dimensions, d, the number of modes of an optical wave guide, etc. 
Unfortunately, in most physical contexts, these parameters are typically far from large, 
S = 1/2, d = N c = 3, etc. So we have to resort to a “poor man’s” strategy where we 
develop a controlled and self-consistent theory in the limit of asymptotically large control 
parameters and hope that some fragments of truth survive the limit down to more mundane 
values of N. Perhaps unexpectedly, this strategy often works astonishingly well down to 
values N = 0(1). 

So let us, then, begin by introducing a large control parameter into a <p 4 ‘- 
type theory. This can be achieved by promoting (f> from a scalar to an TV- 
component vector field <j> = a = 1, . . . , N. The self- interaction of the 

field is modeled as g f d d x cf> a (j) a (j) b (f> b , i.e. an expression that is “rotationally” invariant in 
<f>- space. The action of our modified theory is thus given by 

£[<£] = J d d x Q 0<MM + £ * • (4> • c6) 2 ) , (5.37) 

where the factor l/N in front of the interaction constant has been introduced for later 
convenience. 

As before, we shall concentrate on the Green function G ab (x — y) = (< p a (x)cf) b (y )) as 
a “test observable.” Denoting the Green function by a bold line, the free Green function 
Go = {(j) a (x)(f> b ( y))o oc 6 ab by a thin line, and the interaction operator by a wavy line, 31 
the structure of the first- and second-order expansion of the Green function is shown in 
the upper portion of Fig. 5.8. For simplicity, the combinatorial factors weighting individual 
diagrams have been omitted. 



Self-energy operator 

Even without resorting to the large- AT structure of the theory, it is possible to bring some 
order in the spaghetti of diagrams contributing to the expansion. Indeed, there are two 
distinct subclasses of diagrams: diagrams that are one-particle reducible (i.e. can be cut 
into two halves by cutting a single internal line, see the classification on page 204) and 
those that are not. This observation motivates the collection of all one-particle irreducible 


Since the four field vertices entering the interaction 
resentation of Section 5.1 is no longer suitable. 


longer indiscriminate, the interaction “point” rep- 
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Figure 5.8 Expansion of the Green function of <p 4 -theory. Bottom: expansion of the self-energy 
operator. 


subportions of the diagrammatic expansion into a structural unit. In Fig. 5.8 this entity, 
which is commonly called the self-energy operator, and sometimes also the effective 
mass operator, is denoted by a hatched circle. The first- and second-order expansion of 
the self-energy is shown in the bottom part of the figure. 


With that definition, the Green 
function becomes a “chain” of self- 
energy operators, separated by free 
Green function lines, as shown in 
the second equality of the figure. 
A convenient representation of the 
expansion is shown in the third 
equality. An insertion of the full 
Green function after the first self- 
energy correction recursively gener- 
ates the full series. Let us translate 
these statements into the language 


Freeman Dyson, 1923- 

Trained as a mathematician, 

Dyson turned to physics in 
the 1940s. His work in con- 
densed matter physics, statisti- 
cal mechanics, and several other 
areas has had lasting influence on 
the development of modern the- 
oretical physics. Beyond his professional work in 
physics, Dyson has written several books on the 
social implications of modern science. (Photo by 
Randall Hagadorn. Courtesy of the Institute of 
Advanced Study.) 
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of formulae. Denoting the set of all self-energy diagrams by E = {E a6 (x — y)}, 32 the 
expansion of the Green function assumes the form 

G = G 0 + G 0 tG 0 + G 0 tG 0 tG 0 = G 0 + G 0 tG. (5.38) 


Here, the operator products involve summation over coordinates and internal indices, i.e 
(AB) ab (pc — y) = f d d z A ac (x — z)B cb (z — y). Recursion relations of this type are commonly 
referred to as Dyson equations. The Dyson equation states that the problem of calculating 
G is essentially tantamount to that of analyzing the self-energy. To make this point more 
explicit, let us reformulate the Dyson equation in momentum space: 

G p = Go,p + G^pSpGp [1 — Go,pS p ]Gp = Go, p . 


Here we have used the convolution theorem or, more physically, the fact that all scattering 
processes lumped into the self-energy conserve momentum separately. Matrix multiplication 
of this identity from the right by [1 — G'o.pEp] -1 leads to the expression 

Gp = [1 - Go.pSp]- 1 ^ = [g-i - E p ] . 

Finally, using the fact that = ( p 2 + r)6 ab , we arrive at the formal solution 


Gf=[(p 2 +r-± p )~ 1 ] ab . 


(5.39) 


This equation provides two lessons: firstly, the full information about the Green function is 
indeed stored in the self-energy; secondly, the self-energy somehow “adds” to the arguments 
p 2 and r entering the quadratic action, a point to be discussed in more detail below. 

But how then do we compute the self-energy operator? In fact, the construction recipe 
follows from what has been said above. By definition, the nth-order contributions to the self- 
energy are generated by a connected and one-particle irreducible contraction of n interaction 
operators (weighted with the appropriate combinatorial factor 1/n!). Two field vertices stay 
uncontracted as connectors to the free Green function lines contacting the self-energy. For 
example, the first-order contribution is given by (exercise) 


[*g>r =-«■*£ . 

p' p' 

where the first (second) contribution corresponds to the first (second) diagram in the self- 
energy expansion of Fig. 5.8. 

EXERCISE Represent the second-order contribution in terms of Green functions. 

Once the self-energy has been computed to any desired order, the result is substituted into 
Eq. (5.39) and one obtains the Green function. 


The conservation of global momentum of the theory implies (exercise: think about it!) that, like the Green 
function, the self-energy depends only on the difference of its coordinate arguments. 
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I N FO The critical reader will note that there are some problems with the line of argument above. 
Firstly, we have tacitly ignored the issue of combinatorics. (How do we know that once we have 
plugged the expansion of the self-energy into the Dyson equation we get the same result as a 
brute- force direct expansion of the Green function would have obtained?) To understand that the 
two-step program - “first compute the self-energy, and then substitute into the Dyson series” - 
indeed produces correct results, let us consider the nth-order contribution to the expansion of 
the Green function with its overall combinatorial factor 1/n!. Now imagine that we want to 
distribute those diagrams that contain, say, one free internal Green function over two self-energy 
operators according to the expression GoEGoEGo- Assuming that the first self-energy operator 
is of order m < n and the second one of order n — m, we notice that there are " n possibilities 
to distribute the interaction vertices over the two self-energies. That means that we obtain an 
overall combinatorial factor of ^ ^ = m! (n_ m )i • But 1/m! and l/(n — m)! are precisely the 
combinatorial factors that appear in the definition of an mth- and an (n — m)th-order self- 
energy operator, respectively. Arguing in reverse, we conclude that the prescription above indeed 
produces the correct combinatorics. 

A second objection concerns the consistency of the expansion, i.e. the nth-order expansion 
of the self-energy is, of course, by no means equivalent to nth-order expansion of the Green 
function, nor to any specific order of the expansion. Indeed, when working with the concept of a 
self-energy, structuring the expansion according to its order in the interaction operator does not 
make much sense. We should rather focus on the summation of specific infinite-order diagram 
classes as exemplified in the previous section and discussed in more general terms below. 


Large-N expansion 

So far we have not made reference to the iV-component structure of the theory. However, 
let us now assume that N is very large, i.e. that we may be content with an expansion of 
the Green function to leading order in 1/iV. This condition can be made explicit by sending 
N — > oo and declaring lim/v^oo G““ to be our observable of interest. 

The limit of large N entails a 
drastic simplification of the diagram- 
matic expansion. Each interaction 
vertex comes with an overall factor 
of 1/N which must be compensated 
by a summation over field com- 
ponents to produce a contribution 
that survives the limit N oo. 

This condition removes numerous 
diagrams contributing to the series. 

For example, in the Green function 
expansion of Fig. 5.8 only the first, 
third, eighth and ninth diagrams survive the limit. In all other contributions, interaction 
and Green function fines are interwoven in a way that does not leave room for one field-index 
summation per interaction vertex. 


Max Bom, 1882-1970 

Worked on the mathematical 
basis of quantum mechanics. 

Amongst his major contributions 
was the interpretation of the abso- 
lute square of the wave function 
as a measure of the probability 
of finding the particle at a given location. Born 
shared the 1954 Nobel Prize in Physics with Walter 
Bothe for "for his fundamental research in quantum 
mechanics, especially for his statistical interpretation 
of the wavefunction.” (Image © The Nobel Foun- 
dation.) 




227 


5.3 Infinite- order expansions 

Inspection of the series shows that only dia- 
grams void of crossing interaction lines (cf. the 
figure on the right) survive the limit of large N. 

The approximation - indeed in the limit of infi- 
nite N it becomes exact - that retains only these 
contributions is commonly called the non-crossing approximation (NCA). More poet- 
ically, the diagrams contributing to the reduced expansion are sometimes called “rainbow 
diagrams.” 

Importantly, the NCA self-energy can be computed in closed form. All one has to realize 
is that the summation over all rainbow diagrams amounts to substitution of the full NCA 
Green function under a single interaction line (exercise). Using the fact that in the NCA 
the self-energy is proportional to unity in the field-index space, we can express this fact 
through the formula 

S p N £ A -gL~ d Y, G P ' = -gL~ d Y (P' 2 + r - S p') _1 • (5-40) 

p' p' 

In the literature, this equation goes under the name self-consistent Born approxima- 
tion. It is a “Born approximation” because, formally, it resembles a first-order perturbative 
correction (the overall factor of g). The approximation is “self-consistent” because the self- 
energy recursively appears on the right-hand side of the equation again, i.e. the equation is 
in fact not of first but of infinite order. 

INFO Although the objective of the present section is to expose general structures, let us 
briefly review the solution of a Born equation. To keep things simple, let us assume that 
we are dealing with the low-energy approximation to some microscopic model, i.e. that the 
momentum summations must be cut off at some upper limit A. We further make the assumption 
(to be checked self-consistently) that the solution for the self-energy operator will turn out to be 
momentum-independent: E p = E. This leads to the expression 

s *- 5 [ a ^L I 

.1 pr|V+r — E’ 

The evaluation of the integral depends on dimensionality and on the analytical structure of 
the energy denominator. For example, taking d = 2 and assuming that the parameter r is 
much smaller than the self-energy induced by scattering - an assumption also to be checked 
self-consistently later on - we obtain 



A solution E(< 7 , A) can now be sought by approximate analytical methods, or graphically. (One 
plots both sides of the equation as functions of E and seeks a crossing point.) 

However, for our present discussion, more important than the detailed dependence of E on the 
input parameters g and A is the principal meaning of the self-energy: apparently, E adds to the 
parameter r of the naked Green function. (Notice that the solution of the equation determining 
E will be negative.) Remembering that r ~ £ 1//2 , one concludes that the interaction operator 
lowers the spatial correlation of the system. This is indeed what one should expect intuitively: 
scattering due to interactions acts as a source of “disorder” inside the system. 
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(p,a) (p,a) (p \b) 



(p+q,a) (p+q,a) (p'+q,6) 



(p,a) (p \b) 



(p+q,a) (p'+q.fc) 



Figure 5.9 Expansion of the four-point function. Notice that the arrows represent the full Green 
function, i.e. all diagrams “renormalizing” the two-particle subunits of the diagram are automati- 
cally included. 

At this stage, it is worthwhile to take a step back and see what we have achieved. We 
have managed to compute the Green function to infinite order in an expansion in the set 
of “relevant” diagrams. How does that fit together with what has been said in Section 5.1 
about the “asymptotic” nature of perturbative series? In fact, the exponential proliferation 
of the number of diagrams, i.e. the mechanism that led to the eventual breakdown of the 
perturbative expansion, is blocked by the limit N —t oo. Only subclasses of diagrams, with 
far fewer members, contribute and the series remains summable. 

The large - N principle is actually not limited to the expansion of the Green function. To 
illustrate the point, let us briefly consider the expansion of the four-point correlation 
function 

= v £ z* <•+,>■ ( 5 41 > 

ab p,p' 

In the next chapter we will see that objects of this architecture represent the most impor- 
tant information carriers of the theory. Unlike the Green function discussed previously, they 
relate directly to observable quantities. In the many-body literature, the four - point function 
is described as a two-particle propagator, indicating that it describes the joint propaga- 
tion of two particles. Leaving a more substantial discussion of the meaning of this object to 
the next chapter, we concentrate here on the formal aspects of its perturbative expansion. 

The structure of the expansion of the four-point function is shown in Fig. 5.9 where, for 
simplicity, momentum/component-indices are indicated only once. The simplest diagram 
contributing to the expansion consists of just two Green functions. It encapsulates all dis- 
connected contractions, i.e. (<^> p (^ p ,) {<j>°L p _q</> p / +q ) ~ 5 ab 5 pp > contributing to . All 
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other contractions simultaneously involve all four field operators, i.e. they contain interac- 
tion lines between the Green functions. The sum of all these contributions is represented by 
the diagram containing the hatched surface. A few low-order contributions to the expansion 
are explicitly shown in the second line. Notice that all arrows appearing in these diagrams 
are fat. This means that diagrams “dressing” the two-particle sub-units of the expansion 
are automatically included. For example, the second contribution, containing just a single 
interaction line between the two-particle propagators, in fact represents an entire series of 
diagrams obtained by substituting the expansion of Fig. 5.8 for the full Green function. (In 
an analytical calculation, one will take account of these contributions simply by substitution 
of the self-energy renormalized Green function for each arrow.) 

EXERCISE Write down the analytical expressions contributing to the low-order diagrams shown 

in the first and second lines of the figure. 

All diagrams involving interactions between the two Green functions have the common factor 
that they contain four external “legs,” i.e. the Green function connectors to the external 
field operators. If you imagine these legs removed, you end up with a “core contribution” 
often called the vertex. (The designation is motivated by the fact that the first contribution 
to the vertex is, indeed, the contracted interaction vertex of the action; see diagram no. 2.) 
In Fig. 5.9, the vertex is denoted by a tightly hatched area. 

As with the Green function, the expansion of the vertex can be given some structure. 
To this end, notice that some of the diagrams contributing to the vertex (e.g. the second 
diagram in the second line with external legs removed) can be cut into two just by cutting 
two internal Green function lines. Vertex diagrams of this type are called two-particle 
reducible, by analogy with the “one-particle reducible” contributions to the expansion 
of the Green function. As with the expansion of the latter, one can lump all irreducible 
contributions to the vertex (e.g. the last diagram in the first line, or the first diagram in 
the second line, external legs removed) into a structural subunit called the irreducible 
vertex. In Fig. 5.9, that unit is denoted by a lightly hatched area. The first two diagrams 
contributing to the irreducible vertex are shown in the fourth line of the figure. Here, one can 
see that the irreducible vertex plays a role similar to the self-energy of the Green function. 
Expressed in terms of the irreducible vertex, the expansion assumes the regular form shown 
in the third line. 

To represent these graphical rela- 
tions in analytical form, we denote 
the full vertex by the symbol T = 

{Fp'y/’q} and the irreducible ver- 
tex by f 0 = {r-;“ q } where the 
indices a,a',b,b' keep track of the 
index labels carried by the four 
Green functions entering the ver- 
tex. (Although we have defined our 
correlation function in such a way 
that a = a' , b = b ' , a generalization 


Hans Albrecht Bethe, 1906- 
2005 

Nobel Laureate in physics in 1967 
for his “contributions to the the- 
ory of nuclear reactions, espe- 
cially his discoveries concerning 
the energy production in stars." 

As well as nuclear matter, he has 
also contributed substantially to atomic and con- 
densed matter physics." (Image © The Nobel Foun- 
dation.) 
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to a four-fold index label is necessary to formulate the recursion Eq. (5.42).) The three 
momentum arguments represent the momenta of the Green functions connecting to the 
vertex operators, as indicated explicitly in the first line of the figure. (Remember that 
the theory has overall momentum conservation, i.e. three momentum arguments suffice to 
unambiguously fix the momentum dependence of T and C^ 4 K) The content of the third fine 
of the figure can then be expressed in terms of a closed recursion relation: 


pan ,66 -paa ,66 . V ' pan ,cc prc / pcc ,66 

1 p,p',q — 1 0,q,p',p 2-^1 1 0,p,p",q L 'p" U p"+q 1 p",p',q‘ 

c,c',p" 


(5.42) 


Equations of this type are (often) called Bethe— Salpeter equations. Comparison with 
Eq. (5.38) shows that this equation appears to be conceptually similar to the Dyson equation 
for the one-particle Green function. 33 Indeed, the principle behind most recursion relations 
of perturbation theory is a structure like 


X = X 0 + X 0 *Z*X, 


(5.43) 


where X is our object of interest (e.g. f), Xo its “free” version, Z a subunit that is, in 
some sense, “irreducible” (E or foGG), and * some generalized matrix convolution. 


C <4) = 



Figure 5.10 Large- IV expansion of the two-particle correlation function into a “ladder structure”. 

Owing to the importance of the two-particle propagator, the solution of Bethe-Salpeter 
type equations is a central issue of many areas of many-body physics. In most cases only 
approximate solutions can be obtained. With our present example, “approximate” means 
that one sends N to large values and seeks a solution to leading order in AT -1 . In that limit, 
the only surviving contribution to the irreducible vertex is the first, i.e. a plain interaction 
fine (see Fig. 5.10). As with the self-energy operator discussed in the previous section, all 
diagrams with “entangled” interaction lines are frustrated in the sense that we do not have 
as many index summations as interaction constants ~ g/N. Such contributions vanish in 
the limit of large N. We thus conclude that the Bethe-Salpeter equation assumes the simple 
form 

pa6 _ 9 9_ 1 \ ' /~ia prc pc6 

1 p.p'.q — p p"+u p">p',q’ 

c,p" 


We can make the analogy perfect by defining a “one-particle vertex” 1 [G — G’ f0 ^]G’ f0J 1 . Inspection 

of the second equality of Fig. 5.8 shows that the expansion of F' 1 - 1 starts and ends with a self-energy operator, 
i.e. the first free Green function line Go is removed, and so are the two external Gos connecting to the self-energy 
operator. In direct analogy to Eq. (5.42), the analytical formula for r^ 1 -* then reads = E + EGoT^. 
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where T a6 = Y 00 ' ’ bh ' d 00 ' 6 hh ' : u This equation can be simplified even further by making the 
ansatz F" f ' p , q = r q , where F is independent of discrete indices and input momenta p and 
p'. (When solving a perturbative recursion relation, it is always a good idea to try an ansatz 
of maximal simplicity, i.e. one that is no more complex than the constituting elements of 
the equation.) Then, 

Fq = ~mA ~ 5PqFq ^ Fq = ~ NL d 1+gP^ (5 ' 44) 

where we have introduced the abbreviation P q = G p G p + q . In principle, one may now 

proceed by substituting the large- N expansion of the Green function (5.39) and computing 
the function P q by integration over p. This would produce a closed expression for F and, 
by virtue of the relation 

c $ 4, = p “[i +w > p ‘']' 

our correlation function. Since the emphasis in this section is on conceptual aspects of 
perturbation theory, we will not pursue the analysis to its very end. (For an analysis of the 
Bethe-Salpeter equation in a context more interesting than ^-theory, see the discussion of 
the quantum disorder problem in the next chapter.) Yet there is one aspect of the expression 
for r q worth noticing. Consider P q expanded as a Taylor series in q and focus on the 
zeroth-order contribution, Pq. From the definition of the Green function (5.39) we have 

P °=4iE G i p = ^E G P (5 i 0> - 9 -‘a 2 s = -9-'. (5.45) 

p p 

When substituted into the formula for F, this shows that for small momenta the expansion 
of the numerator of the vertex starts with a power of q. (By symmetry, the first non- 
vanishing contribution will be of 0(q 2 ).) This means that both the vertex and the four-point 
correlation function are long-ranged objects, i.e. unlike the Green function, they decay not 
exponentially, but as a power law. The long-range character of the four-point function has 
observable consequences, as discussed in the next chapter. 

Summarizing, our discussion of the two- and four-point functions, and of the RPA theory 
of the interacting electron gas has shown that, if a large parameter is present, (a) relevant 
subclasses of perturbative contributions can be identified and (b) they can be summed to 
infinite order. As a matter of fact, there are not too many of these summable diagram classes: 
ring-diagrams, rainbow-diagrams and ladder-diagrams nearly exhaust the set of “friendly” 
corrections, amenable to analytical summation. Notice that, so far, we have largely restricted 
ourselves to the discussion of abstract summation schemes; i.e. we still need to learn more 
about the way intermediate results like Eq. (5.39) or (5.44) can be translated into concrete 
physical information. These aspects are discussed - on applications more rewarding than 


Here we have used the fact that the large- AT approximation of the irreducible vertex forces the two input indices 
a, a/ to be equal (and the same for the output indices). As a word of caution, strictly speaking, our large- 
N approximation of the irreducible vertex explicitly uses the fact that the input/output indices entering our 
definition of the vertex are equal, i.e. should we compute a correlation function where the two input/output 
indices are different (a rare occurrence in realistic applications), the large- AT approximation of the four-point 
functions would no longer assume the simple form of a regular ladder. 
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plain </> 4 -theory - a little later. However, at this point we leave the discussion of formal 
perturbation theory. While a state-of-the-art exposition of the subject would require much 
more space - for pedagogical and condensed-matter-oriented texts on perturbative methods, 
see Pines and Nozieres, and Schulman 35 - the material introduced in this section suffices 
for nearly all purposes of the present text. 

Turning back to our prototypical problem of the interacting electron gas, in the next 
chapter we extend the purely diagrammatic methods discussed above to a more powerful 
hybrid approach comprising perturbative concepts in combination with functional methods. 


5.4 Summary and outlook 

This concludes our preliminary introduction to the concepts of perturbation theory. We have 
seen that general perturbative expansions mostly have the status of “asymptotic” rather 
than convergent series. We have learned how to efficiently encode perturbative series by 
graphical methods and how to assess the “importance” of individual contributions. Further, 
we have seen how the presence of a large parameter can be utilized to firmly establish 
infinite-order expansions. A number of recursive techniques have been introduced to sum 
up diagram sequences of infinite order. 

However, a second look at the discussion of the previous sections shows that the central 
tool, the functional integral, did not play much of a role. All it did was to provide the 
combinatorial framework of the perturbative expansion of correlation functions. However, 
for that we hardly need the full machinery of functional integration. Indeed, the foundations 
of the perturbative approach were laid down in the 1950s, long before people even began 
to think about the conventional path integral. (For a pure operator construction of the 
perturbative expansion, see the problem set.) 

More importantly, the analysis so far has a serious methodological weakness: all subclasses 
of relevant diagrams had the common feature that they contained certain “sub-units,” more 
structured than the elementary propagator or the interaction fine. For example, the RPA 
diagrams were organized in terms of polarization bubbles, the NCA diagrams had their 
rainbows, and the ladder diagrams their rungs. Within the diagrammatic approach, in each 
diagram these units are reconstructed “from scratch,” i.e. in terms of elementary propaga- 
tors and interaction lines. However, taking seriously the general philosophy on information 
reduction declared at the beginning of the text, we should strive to make the “important” 
structural elements of an expansion our elementary degrees of freedom. This program is 
hardly feasible within purely diagrammatic theory. However, the functional integral is ide- 
ally suited to introducing degrees of freedom hierarchically, i.e. trading microscopic objects 
for entities of higher complexity. The combination of functional integral techniques with 
perturbation theory presents a powerful theoretical machinery which is the subject of the 
next chapter. 


See D. Pines and P. Nozieres, The Theory of Quantum Liquids Normal Fermi Liquids (Addison- Wesley, 
1989), and L. S. Schulman, Techniques and Applications of Path Integration (Wiley, 1981). 
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5.5 Problems 

Technical aspects of diagrammatic perturbation theory 

Taking the second-order expansion of the <j > 4 Green function as an example, it is the purpose of this 
problem to discuss a number of technical aspects relating to the classification and the combinatorics of 
diagrams. 

(a) Show that the full contraction of (^(x^^y^^y'^x')) generates the 945 terms 

9G 0 (x - x')Go(O) + 72G 0 (x - x')G^(y - y')G^(O) + 24G 0 (x - x')G^(y - y') 


36G 0 (x - y)G 0 (x' - y)Gg(O) + 144G 0 (x - y)G 0 (x' - y)G^(y - y')Go(O) 


+ 96G 0 (x - y)Go(x / - y')Gg(y - y') 

6 

+ 144G 0 (x - y)G 0 (x' - y')G 0 (y - yOG^O) 


+ (y 


(5.46) 


Try to reproduce the combinatorial prefactors. 

(b) Check explicitly that the disconnected terms cancel against the “vacuum loops” from 
the expansion of the denominator. 

(c) Represent the corresponding diagrams in momentum space. Convince yourself that the 
“Kirchhoff law” discussed in the main text suffices to unambiguously fix the result. 

Answer: 


(a) The seven distinct pairings of the ten ^-fields are shown in Fig. 5.2 (and referred to here 
as diagrams 1-7 in the order in which they appear in the figure). By way of example, 
let us consider the combinatorial factor of diagram 2. There are 3-2 = 6 ways to pair 
two fields </>(y) and the same for (i>(y'). From these two contractions, we obtain a factor 
(6G(0)) 2 . Then, there are two possible ways of pairing the two remaining with the 
two remaining <f>( y') leading to a factor 2Gg(y — y')- Finally, the pairing of ^(x) with 
0(x') obtains a single factor Go(x — x'). Multiplying all contributions, we obtain term 
2 . 

(b) To obtain all terms contributing to G^\ both numerator and denominator of Eq. (5.14) 
have to be expanded to second order (where the identification X[<f>] = 0 (x) 0 (x') is 
understood). In symbolic notation, 

?, + if 1 + i ^ 2 + "' - N ° + N ' ~ N ° D i + \ N * ~ N ' D i + N ° D i ~ \ N * D * ■ 

1 + JJ 1 + nlJo + ‘ * * v v ' £ £ 
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The expression for N 2 is given by formula (5.46). From the main text, we know that 
N\ = 3£?o(x — x')Go(O) + 12Go(x — y)Go(0)Go(y — x') and D\ = 3G§(0). Further, 
No = Go(x — x'). Finally, D 2 reads 

^ 4 (y)^ 4 (y'| = 9G£(0) + 72G 2 0 (0)G 2 0 (y - y') + 24G*(y - /)• 

Collecting all the terms, we obtain 

N 2 - 18G 0 (x - x / )G^(Q) r 72G 0 (x - y)Gg(0)G 0 (y - x') +18G 0 (x - xQG^O) 

— ( 4 +y y') 

— 9Gp(x - x')G*(0) -72G 0 (x - x / )Gg(0)Gg(y - y') 

-1 -2 
-24G 0 (x-x / )G^(y-y / ) 

-3 

= 144G 0 (x - y)G 0 (x' - y)Gg(y - y / )G 0 (O) 

4- 144G 0 (x - y)Go(x / - y , )G 0 (y- y')Gg(O) 

6 

f 96G 0 (x - y)G 0 (x / — y')Gg(y- y') 

7 

+(y y')- 

i.e. the set of connected diagrams shown in Fig. 5.4. 

(c) This translates to a straightforward exercise in Fourier transformation. 


Self-consistent T-matrix approximation 

As well as scattering from lattice vibrations and other electrons, the low-temperature conductivity of 
a metal is heavily influenced by defects or impurities. The general role of weak impurity scattering is 
discussed in some detail in Section 6.5. As a warm-up to our more comprehensive discussion below, 
the aim of the present problem is to determine the scattering rate imposed by a collection of isolated 
impurities using diagrammatic perturbation theory. 

Consider a system of spinless electrons subject to a random pattern of iVi mp non-magnetic 
scattering impurities and described by the Hamiltonian H = Ho + Hj rnp , where Ho = 

Ek £ kcj c c k , 

JV imp iV lmp 

H imp = v 0 a d J d d r ^ <5(r - R^c^r^r) = v 0 a d ^ c t (R i )c(R i ), 

and we have assumed that an impurity at position R, creates a local potential voa d 5(r— R,). 
(Here a is a constant, of dimensionality [length], identified roughly with the extension of 
the impurity.) 
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Our aim is to compute the single-particle Green function G n ( r — r') = ((cj t (r)c„(r , )))i mp 
where n is a Matsubara frequency index and the configurational average (. . .}i m p = 
^-diVimp j f]' 1 ^ i s defined by integration over all impurity coordinates. 

(a) To begin, let us consider scattering from a single impurity (i.e. Afyip = 1). By developing 
a perturbative expansion in the impurity potential, show that the Green function can 
be written as G n = Go,n + Go, n T n Go tn , where 

'i’n = {Himp T H IIa pG() n iii m p + in p G q , n mp G(j ^ n m p + ■ ■ ■ }imp> (5.47) 

denotes the T-matrix, 36 and G n = (G n (r — r')}. Show that the T-matrix equation is 
solved by T n { r, r') = <5(r — r')L~ d (fyo a d ) _1 — Go,„(0)) ' . 

(b) For a collection of random impurities, the capacity for multiple scattering makes the 
concept of the T-matrix defined for the full operator i?i mp unilluminating. Instead, one 
would like to find an expression for the configurational average of the Green function 
involving the T-matrix of a single isolated impurity. Presenting the Green function as 
a series expansion in the impurity potential v, enumerate and classify the contributions 
entering the configurational average up to third order in v. Organising the series at this 
order in a diagrammatic expansion, show that a subset (all but one) of the diagrams can 
be incorporated into a Dyson series expansion, Gk = Go,fc-|-Go,fcEfcGfc, where Gk = G n . k 
denotes the Green function in a four-momentum representation, Efc = Ali mp (k|T n |k) is 
the self-energy and T n represents the T-matrix for a single isolated impurity. 

The Dyson series expansion can be inverted to give Gk = {iw n — fy — Efy -1 . When 
Fourier transformed back to time/space, one may note that the imaginary part of the 
self-energy can be associated with (one half of) the scattering rate, ImE fc = — ^sgn(n) 
(cf. the Info block on page 379 for further discussion of this point). To understand the 
sign factor sgn(n), notice that the imaginary part of T n is determined by the imaginary 
part of Go, n . However, it is straightforward to verify that sgn(ImGo iTl ) a sgnn. (For 
a comprehensive discussion of the analyticity properties of the self-energy, we refer to 
Chapter 7.) 

(c) Sometimes, one is interested in the T-matrix for a fixed (unaveraged) pattern of dilute 
impurities. In this case, the single-impurity T-matrix is given by the right hand side of 
Eq. (5.47), without the averaging brackets; it is no longer diagonal in momentum space, 
i.e.f„ = {(k|f„|k')} = {T„ jkk ,}. 

Making use of the operator identity (1 — O 2) -1 — (1 — O 1) -1 = (1 — 02) -1 (02 — Oi)(l — 
O 1 ) -1 , and defining the real- frequency analytical continuation T ± (e) = T(iuj n — > e±iS) 
(the same for G) show that 

f+(e) - f-(e) = f-(e)(G+(e) - G 0 fy))f+(c). (5.48) 

Using this result, show that the scattering rate is given by 

Y = Im E(e = e k + *0) = 7r ^ |(k , |f+(e k )|k)| 2 d(e k - e k /). (5.49) 

k' 


For a discussion of the physical content of the T-matrix and its connection to the scattering matrix, we refer 
to any elementary text on scattering theory. 
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Answer: 

(a) Representing the Green function as a coherent state path integral and performing the 
Gaussian integral over the Grassmann field, we obtain the formal result G n = ( iuj n — 
Ho — -ffimp) -1 - We next expand G n in H imp and compare with the definition of the T- 
matrix to identify the series in Eq. (5.47). Substitution of Hi mp = voa d S(r— R), where R 
is the impurity position, we obtain T = (<$(r— R)<5(r' — R))i mp uoa d (l — voa d Go, n (Q, 0)) _1 . 
Performing the average and rearranging terms we arrive at the result (see Fig. 5.11). 

x xx x 

xxx \ x xx x. x x 

Figure 5.11 Diagrammatic series expansion of the impurity averaged Green function (for a discus- 
sion, see the main text). 

(b) Presented as a series expansion in the random potential, 

Gn = (Go,n + Go,n^impGo,n + Go )ra RimpGo,n-^impGo,n H )imp- 

Turning to a momentum space representation and using the fact that (k|Hi m p|k'} = 
vo(a/L) d e* Ri ( k-k \ the first term in the series assumes the form 

GQ, = Go, k v 0 (^) d ^(e iRi ( k - k '))i m pGo,^ = Go r kVoNimp G 0 , k 6 kk ,. 

At second order, we obtain 

= G 0 , k vl(^ 

= G 0 , k v 2 0 (— 

where the first (second) contribution describes the two-fold scattering off a single (one- 
fold scattering off two different) impurities. Notice that, upon averaging, both types 
of scattering conserve momentum, i.e. G kk > oc S kk >. A graphical representation of the 
two processes is shown in the first line of Fig. 5.11 where crosses represent individual 
impurities (averaging over position understood) while the dashed lines denote scattering 
events. At third order in v, one may identify contributions from where all three impu- 
rities are distinct, where only two are distinct, and where the scattering occurs at the 
same impurity (see the bottom line of the figure). Of these terms, one may note that 
the penultimate diagram is special as it involves a “crossing” of the impurity (i.e. the 
dotted) lines. Later, in the discussion of the weakly disordered system, we show that 
such contributions enter with a suppression factor of l/kyd. 1, where l = Vf t is the 
elastic mean free path, fcp the Fermi wavevector, and the inequality holds in all but the 
dirtiest metals. Physically, diagrams with crossed lines represent processes wherein an 


2 d EE< eiRi(k “ k " )G o,fc"e <R ‘' (k "- k ' ) ) imp Go,fc' 

k" i.i' 
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electron alternately scatters off different impurities. Clearly, such events become highly 
improbable in the limit of a low impurity concentration. 

Since impurity averages engaging a single impurity involve zero X 

momentum transfer, one may note that the remaining series can be /’ X \, 
organised into the form of the required Dyson series expansion. Here, 
since the blocks entering the series do not correlate different impurities, the self-energy 
may be identified as N llnp times the single-impurity T-matrix; the approximation 
neglecting crossed lines is called the T-matrix approximation. 

Now, when pursuing to fourth order and higher, one may note the existence of a second 
class of “rainbow” diagrams which (a) do not involve crossed impurity lines and (b) 
are not included in the Dyson series (see the figure above.) These contributions can be 
incorporated by replacing the bare Green function Go,k by the impurity averaged Green 
Gk in the T-matrix expansion Eq. (5.47), an exercise that can be performed only self- 
consistently. Such a scheme is known as the self-consistent T-matrix approximation 
(SCTA). 

(c) Noting that T ± (e) = Hi mp (l — G^(e)fli mp ) -1 , a straightforward application of the 
given identity gives the required expression. Now, applying this result to the self-energy, 
substituting e = for the energy arguments of the Green functions, and noting that 
Im(k'|G+( ek )|k'} = 7ri5(ek — ek')> one obtains Eq. (5.49), a manifestation of the “Golden 
Rule.” 


Kondo effect: perturbation theory 

In Problem 2.4, we introduced the Kondo effect, describing the interaction of a local impurity with an 
itinerant band of carriers. (Those unfamiliar with the physical context and background to the problem 
are referred back to that section.) There we determined an effective Hamiltonian for the coupled system, 
describing the spin exchange interaction that acts between the local moment of the impurity state and 
the itinerant band. In the following, motivated by the seminal work of Kondo, we employ methods 
of perturbation theory to explore the impact of magnetic fluctuations on transport. In doing so, we 
elucidate the mechanism responsible for the observed minimum of electrical resistance found in magnetic 
impurity systems. 

The perturbation theory of the Kondo effect is one of those problems where the traditional formalism 
of second quantized operators is superior to the field integral (the reason being that perturbative 
manipulations on quantum spins are di cult to formulate in a field integral language). More specifically, 
the method of choice would be second quantized perturbation theory formulated in the interaction 
picture, a concept discussed in great detail in practically any textbook on many-body perturbation 
theory, but not in this text. For this reason, the solution scheme discussed below is somewhat awkward 
and surely not very e cient (yet fully su cient for understanding the essence of the phenomenon). 



238 


Perturbation theory 


Although the phenomenology of the Kondo system can be explored in several different 
ways ranging from the exact analytical solution of the Kondo Hamiltonian system 37 to a 
variational analysis of the Anderson impurity Hamiltonian, 38 in the following we focus on 
the perturbative scheme developed in the original work of Kondo. Later, in Problem 8.8.5, we 
will introduce a more advanced approach based on the renormalization group. The starting 
point of the analysis is the effective sd-Hamiltonian (2.51) introduced in Problem 2.4. 
Setting H s( \ = H 0 + H imv where 

H 0 = ^2 £ k c L C k<T- Amp = 2 JS • s(r = 0), 

kfT 

and s(0) = \ £kk w cl a a a<T 'Ckv'- our aim here is to develop a perturbative expansion in J 
to explore the scattering properties of the model. Here we have assumed that the exchange 
constant J is characterized by a single parameter, positive in sign (i.e. antiferromagnetic). 

In Problem 5.5, we have seen that the scattering rate associated with a system of impu- 
rities is specified by the T-matrix: 

- TiVimp (l< k > CT l T+ ( e )l k V0| 2 ) s ^( e k- e k'), 

k',a' 

where the symbol {•••)$ = trg(- • • )/trs(l) indicates that the calculation of the electron 
self-energy implies an average over all configurations of the impurity spin. (Indeed, it is the 
presence of an internal impurity degree of freedom which makes the problem distinct from 
conventional disorder scattering.) Via the Drude formula, p = m/e^nr (n is the electron 
density), the scattering time can be used to estimate the effect of the impurities on the 
electric resistivity p of the system. 39 

(a) Show that to leading order in the exchange constant J the scattering rate is given by 

y = 'KC\ uvp vJ 2 S{S + 1), 

where v denotes the density of states at the Fermi level. From this result, one can infer 
a resistivity pi mp ~ (m/e 2 eF)ci mp J 2 S(S + 1), independent of temperature. Taken at 
this order, scattering off magnetic impurities clearly does not explain the existence of a 
resistance minimum. 

(b) At second order in the expansion in I?i mp , the T-matrix assumes the form = 
Hj,np(e + — Hq) - 1 Amp- Assuming for simplicity that we are working at zero temperature, 
|k, cr) = ctjSl) is a configuration where a single particle with momentum k, |k| > p-p, 
and spin cr is superimposed on the filled Fermi sphere. Convince yourself that the state 
tf imp |k, a) is a linear combination of a single-particle state and a two-particle-one- hole 

37 N. Andrei, Diagonalization of the Kondo Hamiltonian, Phys. Rev. Lett. 45 (1980), 379-82, P. B. Wiegmann, 
Exact solution of the s-d exchange model (Kondo problem), J. Phys. C. 14 (1981), 1463-78. 

38 C. M. Varma and Y. Yafet, Magnetic susceptibility of mixed-valence rare-earth compounds, Phys. Rev. B 13 
(1976), 2950-4. 

39 Notice that, besides magnetic impurities, the system will typically contain non-magnetic scattering centers, i.e. 
Ptotai = po + <5p, where po is the resistivity due to conventional disorder and we used the fact that the inverse 
scattering times corresponding to different types of disorder add independently, r t ~£ al = r^ -1 + r — 1 . 
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Figure 5.12 The two spin-scattering processes contributing to the electron T-matrix at second- 
order perturbation theory. The second process involves a two-particle-one-hole configuration as an 
intermediate state. 


state (see Fig. 5.12). Determine the excitation energies of the two configurations. Use 
this result to compute (e£ — Ho) -1 Hi mp \k, a) . Then show that the real part of the 
second-order matrix element is given by 

Re <k', o>\T ( 2 )|k, a) = J 2 J2 i S ( S + 1) “ S ■ “ ®(-^p))1 ■ (5-50) 

P 

(Hint: (i) In deriving this result, all “vacuum contributions” have to be discarded [for 
they cancel against the partition function denominator of the properly normalized per- 
turbation series]. The defining property of a vacuum contribution is that it factorizes 
into two independent ground state matrix elements. For example, (Q|ck C r-Ui, np (c + — 
II o) lf v CT 'l^) ' (fi|-ffim P |fi) is of this type, (ii) Note that (S • cr) 2 = S 2 — a ■ S. (iii) 
Taking the real part of the T- matrix amounts to omitting the infinitesimal imaginary 
increment in the energy denominators.) 

At finite temperature, the Pauli blocking factors 0(e) — > (1 — rip(e)) and 0(— e) — >• np(e) 
generalize to Fermi functions and we obtain 

Re(k', cr' | T (2) | k, cr) = J 2 ^ 7 — + ( 2n F(£ P ) - 1)S • ovJ . (5.51) 

p Sk-^p L -I 

Neglecting the first (non-singular, why?) contribution, one may note that the second 
can be absorbed into a renormalization of the term first-order in J derived above, viz. 
i = m/c imp J 2 g S(S + 1), where J e s = J(1 + 2 Jg(e,T)) and the function g(e,T) = 
jP f_° D d£ tank by 2 ), depends sensitively on the bandwidth 2D of the itinerant electrons 
and on the energy e = ek of the reference state. Noting that 

lim g(e, 0) = v In — , lim g( 0, T) = v In 

e-)-o av ’ ’ I e | T ~> cr v ’ 

the effective exchange constant can be written as 

J eff = jfl + 2^(e F )Jln| \) . 

\ |max(e,T)|y 


D 
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On substituting into the expression for the scattering rate, one finds that the resistivity 
diverges logarithmically with temperature, 

p{T) m p( 0) [l - 4^ Jin . 

Finally, on combining with the effect of electron scattering from phonons, which provides 
a contribution to the resistivity that scales as T 5 , one may confirm that the resistance 
develops a shallow minimum. 

Although the perturbation theory suggests a divergence of the resistivity with tem- 
perature, the result remains valid only up to the characteristic Kondo temperature 
scale, 

rK = DeXP [-2^]' 

i.e. a temperature where the logarithmic correction becomes of the order of the first- 
order term. Experimentally, it is found that, at temperatures T < Tk, the resistance 
saturates. In fact, the low-energy physics can be reconciled through the formation of 
a resonance in which an electron in the itinerant band combines with the electron 
on the impurity site to form a singlet bound state effectively screening the magnetic 
impurity. Formally, one can explore the formation of the resonance state by developing 
a variational analysis based on the trial wavefunction \ijr) = [op + a a<4- c k<r] |H), 
where |f2) represents a filled Fermi sea. In doing so, one may confirm that a bound state 
develops with a binding energy Ak = Tk set by the Kondo temperature. 

INFO Although it was originally conceived for the problem of magnetic impurities in metals, 
lately the effects of Kondo resonance formation have been observed in artificial quantum 
dot structures. 40 Here a microscopic quantum dot (with dimensions of ca. 1 pm) is sandwiched 
between two metallic leads. In the so-called Coulomb blockade regime (see Problem 6.7), the 
charging energy of the dot plays the role of the local Hubbard interaction while the leads act 
as the Fermi sea. The development of a Kondo resonance below Tk appears as a signature in 
the quantum transport through the dot. In particular, for temperatures T < Tk, the differential 
conductance dl /dV shows a peak corresponding to the suppression of scattering off the impurity 
state. 


D. Goldhaber-Gordon, H. Shtrikman, D. Mahalu, et al., Kondo effect in a single-electron transistor, Nature 
391 ( 1998 ), 156 - 9 . 
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Answer: 


(a) To leading order in H\ mp . the T-matrix is given by T ~ = H\ mp . Using the fact 

that (k', cr'l-ffimplk, a) = JL~ d (a |S • crlcr') we then obtain 

TT-Aimp £ |(k,CT|ff im p|k , ,o- , )| 2 (5(e k -e k ') 

k >,„> s 

7rc imp J 2 L~ d y^((cr|S • crlcr 7 ) <cr'|S • u|a-)) s £ <5(e k - e k /) 

( T ' k 

7rc imp z/J 2 (((T|S • crS • cr|cr))s = nJ 2 S(S + 1 )c imp v, 

where in the last line we have used the fact that <5(e k — e k >) = L d v and ((S • cr)(S • 
&))s = S(S + 1) • 1. 

(b) Substituting the explicit form of H{ mp and using the anti-commutation relations of 
fermions, we obtain (summation convention) 

-£fimp|k ) <7 ) = J ^ ' C pi^i(® ‘ <J Ml(U2) C P2M2 C k ',(7' 1^) 

P 1 P 2 

= ^£](S ’ l CT ') C piAti 1^) _ ^ £] ' CT Miri2) C pi/Ji C k '<T ,C P2|U2l^)) 

Pi P1P2 


1 

2 r “ 


i.e. a linear combination of a one-particle state and a two-particle-one-hole state. Noting 
that the energies of the two contributions are given by e Pl and e Pl +e k / — e P2 , respectively, 
multiplying by the “bra” (k, cr\Hi mp , and observing that the overlap between a one- 
particle state and a two-particle-one-hole state vanishes, we obtain 


Re(k,<r|T (2) |k',o-'} = J 2 (S • 


')(S- 


>£ 


( l c F>iM , i C Pi/nl ) 


+J 2 (s- v2m|)( s ' ct «w) £ 

PlP2P'iP2 

Q(C P ) 


€ k — e P i 

( l C p '^ C ko-C p 'iMi C Pi/ii c k V' C P2M2l ) 


e k — e P i — «k' + e P 2 


► J 2 (S ■ a^){ S • V ,) V -^IbL + J 2 (S • ff p<7 /)(S • V ^LJpI , 

e k e P e k e P 


where the arrow indicates that vacuum contributions have been discarded. Application of 
the spin identity then leads directly to Eq. (5.50). 
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Previously, we have seen how the field integral method can be deployed to formulate perturbative 
approximation schemes to explore weakly interacting theories. In this chapter, we will learn how elements 
of the perturbative approach can be formulated more e ciently by staying firmly within the framework 
of the field integral. More importantly, in doing so, we will see how the field integral provides a method 
for identifying and exploring non-trivial reference ground states - "mean-fields." A fusion of perturbative 
and mean-field methods will provide us with analytical machinery powerful enough to address a spectrum 
of rich applications ranging from superfluidity and superconductivity to metallic magnetism and the 
interacting electron gas. 


As mentioned in Chapter 5, the perturbative machinery is but one part of a larger frame- 
work. In fact, the diagrammatic series already contained hints indicating that a straight- 
forward expansion of a theory in the interaction operator might not always be an optimal 
strategy: all previous examples that contained a large parameter “V” - and usually it is 
only problems of this type that are amenable to controlled analytical analysis - shared the 
property that the diagrammatic analysis bore structures of “higher complexity.” (For exam- 
ple, series of polarization operators appeared rather than series of the elementary Green 
functions, etc.) This phenomenon suggests that large- IV problems should qualify for a more 
efficient and, indeed, a more physical formulation. 

While these remarks appear to be largely methodological, the rationale behind search- 
ing for an improved theoretical formulation is, in fact, much deeper. With our previous 
examples, the perturbative expansion was benign. However, we already saw some glimpses 
indicating that more drastic things may happen. For example, for frequencies approach- 
ing the plasma frequency, the polarization operator of the weakly interacting electron gas 
developed an instability. The appearance of such instabilities usually indicates that one is 
formulating a theory around the “wrong” reference state (in that case, the uniformly filled 
Fermi sphere of the non-interacting electron gas). Thus, what we would like to develop 
is a theoretical framework (a) that is capable of detecting the “right” reference states, or 
“mean-fields” of a system, and that enables us (b) to efficiently apply perturbative methods 
around these states and, finally, (c) to do this in a language that draws upon the “physical” 
rather than the plain microscopic degrees of freedom as the fundamental units. 

To this end, in the following sections we develop a functional-integral-based approach that 
meets these criteria. In contrast to the previous chapters, the discussion here is decidedly 
biased towards concrete application to physically motivated problems. After the formulation 
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of the general strategy of field-integral-based mean-field methods, the next section addresses 
a problem that we have encountered before, the weakly interacting electron gas. The exem- 
plification of the new concepts on a known problem enables us to better understand the 
intimate connection between the mean-field approach and straightforward perturbation the- 
ory. In subsequent sections we then turn to the discussion of problems that lie firmly outside 
the range of direct perturbative summation, superfluidity and superconductivity. 


6.1 Mean-field theory 

Roughly speaking, the functional approach to problems with a large parameter proceeds 

according to the following program: 

1. In the first place, one must identify the relevant structural units of the theory. (That part 
of the program can be efficiently carried out by the straightforward methods discussed 
above.) 

2. Secondly, it is necessary to introduce a new field - let us call it <j> for concreteness - that 
encapsulates the relevant degrees of freedom of the low-energy theory. 

3. With this in hand, one can then trade integration over the “microscopic fields” for an inte- 
gration over <j>, a step often effected by an operation known as the Hubbard-Stratonovich 
transformation. 

4. The low-energy content of the theory can often be explored by subjecting the resulting 
action <S'[</>] to a stationary phase analysis. (The justification for applying stationary phase 
methods is provided by the existence of a large parameter N 1.) Often, at this stage, 
instabilities in the theory show up - an indication of a physically interesting problem! 

5. Finally, the nature of the elementary (collective) excitations above the ground state can be 
explored by expanding the functional integral around the solution of the stationary phase 
equations - the “mean-field.” From this low-energy effective action, one can compute 
physical observables. 

In the next section, we will illustrate how such a program can be implemented on a specific 

example, studied earlier by diagrammatic means. 


6.2 Plasma theory of the interacting electron gas 

To begin, let us return to the field theory of the interacting electron gas (see the action 
defined in section 5.2), 

(-i^n + 1^ - I*) Vv + ^3 ^Z^p+q^p'-qcy'VicOlpp'a'i’pa, 
p ' ' pp'q 

where the pairwise Coulomb interaction assumes the form R(q) m 47re 2 /|q| 2 . Here the 
summation runs over the 4- momenta k = (w n ,k) comprising Matsubara and momentum 
components and summation over repeated spin indices is implied. Being quartic in the fields 
ip a , the Coulomb interaction prevents an explicit computation of the ^-integral along the 
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lines of the free fermion integrals discussed in the previous chapter. However, it is actually 
a straightforward matter to reduce, or “decouple,” the interaction operator, bringing it to 
a form quadratic in the fields ip. Let us multiply the functional integral by the “fat unity” 

9 

where <p represents a real bosonic field variable, and a normalization constant has been 
absorbed in the definition of the functional measure D<p. Notice that the summation runs 
over a 4- momentum q = (aj rn . q) comprising a vectorial momentum and a bosonic Matsubara 
frequency. Employing the variable shift (p q — > (p q + ie~ 1 V(q)p q /p, where p q = ^pa^p+qa 
(summation on o implied), one obtains 

jdY + + • 

The rationale behind this exercise can be seen in the last contribution to the exponent: this 
term is equivalent to the quartic interaction contribution to the fermionic path integral, 
albeit with opposite sign. Therefore, multiplication of our unity by Z leads to the field 
integral Z = f Dcp f D(ij> gniPa) where 

Y^^ + Y^P'\(~ iuJn+ (6-1) 

q pp r 

denotes the action, i.e. an expression that is free of quartic field interactions of ip a . Before 
proceeding, to acquire some intuition for the nature of the action it is helpful to rewrite S 
in a real space representation. With cp q = T dr f d d r e~ lcl ' r+tWT cp( r, r), one may confirm 
that (exercise) 

S[<p, ip,ip\ = j dr J d d r ^(<9</>) 2 + [d r - ^ B Vv • 

Physically, (p couples to the electron degrees of freedom as a space/time-dependent (imagi- 
nary) potential, while the first term reflects the Lagrangian energy density associated with 
the electric component of the electromagnetic field. Before proceeding, let us step back and 
discuss the general philosophy of the manipulations that led from the original partition 
function to the two-field representation Eq. (6.1). 

INFO The sequence of manipulations developed above, i.e. the “decoupling” of a quartic inter- 
action through an auxiliary field, is known more generally as a Hubbard— Stratonovich trans- 
formation (see our discussion of the Ising model on page 196). The essence of the transformation 
is a straightforward manipulation of a Gaussian integral. To make this point more transparent, 
let us reformulate the Hubbard-Stratonovich transformation in a notation that is not burdened 
by the presence of model-specific constants. Consider an interaction operator of the form .Shit = 
Vap-ygipaippip-yips (summation convention implied), where ip and ip may be either bosonic or 
fermionic field variables, the indices a,/3, ... refer to an unspecified set of quantum numbers, 
Matsubara frequencies, etc., and V a p^s is an interaction matrix element. Now, let us introduce 
composite operators p a p = ipaipp to rewrite the interaction as Sint = V a p~fS pappus- The notation 


■/ 


1 = / D(j)exp 


J 


1 = / D(j)ex p 



6.2 Plasma theory of the interacting electron gas 


245 


can be simplified still further by introducing composite indices m m (a/3), n = (7(3), whereupon 
the action Shit. = PmVmnPn acquires the structure of a generalized bilinear form. To reduce the 
action to a form quadratic in the 0s, one may simply multiply the exponentiated action by unity, 
i.e. 

e -‘ m v mn - n=t £ D<f> 


where 0 is bosonic. (Notice that here V. ~n represents the matrix elements of the inverse and 
not the inverse (V/rm) -1 of individual matrix elements.) Finally, applying the variable change 
0 m — 1 4> m + 2 i(Vp)m, where the notation (V p) is shorthand for V m nPn, one obtains 


exp [-PmVmnPn] = / D<j> exp 







i.e. the term quadratic in p is cancelled. 1 This completes the formulation of the Hubbard- 
Stratonovich transformation. The interaction operator has been traded for (a) an integration 
over an auxiliary field (b) coupled to a 0-bilinear (the operator 0 m /3 m ). 

In essence, the Hubbard-Stratonovich transformation is tantamount to the Gaussian integral 
identity Eq. (3.13) but read in reverse. An exponentiated square is removed in exchange for 
a linear coupling. (In (3.13) we showed how terms linear in the integration variable can be 
removed.) 

To make the skeleton outlined above a well-defined prescription, one has to be more specific 
about the meaning of the Gaussian integration over the kernel 0mVw[0 n , he. the integration 
variables can be real or complex, and V must be a positive matrix (which is usually the case 
on physical grounds). 

There is some freedom as to the choice of the integration variable. For example, the factor of 
1/4 in front of the Gaussian weight 0mV^n0n was introduced for mere convenience (i.e. to 
generate a coupling 0m pm free of numerical factors). If one does not like to invert the matrix 
kernel V m n, one can scale 0 m —1 ( V0) m , whereupon the key formula reads 


“ = / .D0 e 




EXERCISE Show that the passage from the Lagrangian to the Hamiltonian formulation of the 
Feynman path integral can be interpreted as a Hubbard-Stratonovich transformation. 

As defined, the Hubbard-Stratonovich transformation is exact. However, to make it a meaningful 
operation, it must be motivated by some physical considerations. In our discussion above, we 
split up the interaction by choosing p a p as a composite operator. However, there is clearly some 
arbitrariness with this choice. Why not, for example, pair the fermion-bilinears according to 
(0a0p)('07'0(5)> or otherwise? The three inequivalent choices of pairing up operators are shown 
in Fig. 6.1 where, as usual, the wavy line with attached field vertices represents the interaction, 
and the dashed ovals indicate how the field operators are paired. 


1 Here we have assumed that the matrix V is symmetric. If it is not, we can apply the relation PmVmnPn = 
p T V p = | p ( V + V T )p to symmetrize the interaction. 
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Figure 6.1 On the different channels of decoupling an interaction by Hubbard-Stratonovich trans- 
formation. (a) Decoupling in the “density” channel; (b) decoupling in the “pairing” or “Cooper” 
channel; and (c) decoupling in the “exchange” channel. 


The version of the transformation discussed above corresponds to Fig. 6.1(a). That type of 
pairing is sometimes referred to as decoupling in the direct channel. The designation becomes 
more transparent if we consider the example of the spinful electron-electron interaction, 

Sint = ^ J dr j d d r d d r' ?/v(r, r)Vv (r 7 , r)V(r — v')ip cr i (r 7 , r)tp a (r, t), 

i.e. here a (r , t, cr)^‘*= 5 = (r',r, o'), and V a p-,s = V(r — r'). The “direct” decoupling 

proceeds via the most obvious choice, i.e. the density operator p(r, r) = i/v(r, r)'0 CT (r, r). One 
speaks about decoupling in a “channel” because, as will be elucidated below, the propagator of 
the decoupling field can be interpreted in terms of two Green function lines tied together by 
multiple interactions, a sequential object reminiscent of a “channel.” 

However, more important than the terminology is the fact that there are other choices for p. 
Decoupling in the exchange channel (Fig. 6.1(c)) is generated by the choice p a 5 ~ iJ'aipi where, 
in the context of the Coulomb interaction, the reversed pairing of field operators is reminiscent 
of the exchange contraction generating Fock-type contributions. Finally, one may decouple in 
the Cooper channel (Fig. 6.1(b)) p a7 = ij> a Pp ~ t , pp 7 = Here, the pairing field is conjugate 
to two creation operators. Below we will see that this type of decoupling is tailored to problems 
involving superconductivity. 

The remarks above may convey the impression of a certain arbitrariness inherent in the 
Hubbard-Stratonovich scheme. Indeed, the “correct” choice of decoupling can be motivated only 
by physical reasoning, not by plain mathematics. Put differently, the transformation as such is 
exact, no matter what channel we choose. However, later on we will want to derive an effective 
low-energy theory based on the decoupling field. In cases where one has accidentally decoupled 
in an “unphysical” channel, it will be difficult, if not impossible, to distill a meaningful low- 
energy theory for the field <f) conjugate to p. Although the initial model still contains the full 
microscopic information (by virtue of the exactness of the transformation) it is not amenable to 
further approximation schemes. 

In fact, one is frequently confronted with situations where more than one Hubbard- 
Stratonovich field is needed to capture the full physics of the problem. To appreciate this point, 
consider the Coulomb interaction in momentum space. 

a] - ^ VVpiVVp 3 P(pi - p 2)lpa’ Pi tpa P2 S, 

Pl, — ,P4 


'.+P3-1 


( 6 . 2 ) 
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In principle, we can decouple this interaction in any one of the three channels discussed 
above. However, “interesting” physics is usually generated by processes where one of the three 
unbounded momenta entering the interaction vertex is small. Only these interaction processes 
have a chance to accumulate an overall collective excitation of low energy (see our previous 
discussion of the RPA, the interacting electron gas, and many of the examples to follow). It may 
be instructive to imagine the situation geometrically: in the three-dimensional Cartesian space 
of free momentum coordinates (pi,p2,P3) entering the vertex, there are three thin layers, where 
one of the momenta is small, (q,P 2 ,P 3 ), (pi,q,p3), ( pi,P2,q ), |?| -C \pi\- (Why not make all 
momenta small? Because that would be in conflict with the condition that the Green functions 
connecting to the vertex be close to the Fermi surface.) One will thus often choose to break down 
the full momentum summation to a restricted summation over the small-momentum sublayers: 

^ ^pVVp+ 9 H(q)^ CT vVVp'-g ~ $ap4’a'p+qV(p' ~ p)4>v'p' +q1pap’ 

~ 'lpap4’v'- p + q V(p' - p)VVp'VV-p'+q • 

Now, each of these three contributions has its own predestined choice of a slow decoupling field. 
The first term should be decoupled in the direct channel pd, q ~ tAptW+g; the second in 
the exchange channel p^ aa 'q ~ VVpVVp+g> and the third in the Cooper channel p c ,oo' q ~ 
^ i/Vpi/V-p+q. One thus winds up with an effective theory that contains three independent 
slow Hubbard-Stratonovich fields. (Notice that the decoupling fields in the exchange and in the 
Cooper channel explicitly carry a spin structure.) 

In our discussion of the high-density limit of the electron gas above, we effected a decoupling in 
the direct channel. That choice is made because, drawing on our previous discussion, we already 
know in advance that relevant contributions to the free energy of the system are generated by 
RPA-type contraction of operators pd, q ~ J2 P VVpVVp+g, where q is small. If we had not known, 
a more careful three-fold Hubbard-Stratonovich decoupling, followed (see below) by a careful 
analysis of the decoupled action, would have identified the density channel as relevant. More 
generally, if in doubt, one should decouple in all available channels, and let the mean-field analysis 
discriminate between the relevant fields. 

After this digression on the principles of the Hubbard-Stratonovich transformation, let us now 
return to the discussion of the electron gas. 

At the expense of introducing a second field, the Hubbard-Stratonovich transformation 
provides an action quadratic in the fermion fields. In this case, the fermion integration 
can be undertaken exactly. Making use of the Gaussian functional integral Eq. (4.19), one 
obtains 

Z = J D(f) e~ sirL 3 det iui + ^ — pL + > 

where, as usual, the circumflexes appearing in the argument of the determinant indicate 
that symbols have to be interpreted as operators (acting in the space of Matsubara and 
Hilbert space components). 

The standard procedure to deal with the determinants generated at intermediate stages 
of the manipulation of a field integral is to simply re-exponentiate them. This is achieved 
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by virtue of the identity 

lndet A = trln^4, 


(6.3) 


valid for arbitrary (non-singular) operators A. 2 Thus, the quantum partition function takes 
the form Z = f D<(> e~ s ^\ where 




P 

87 tM 


-t rln 

Q 



(6.4) 


This is as far as purely formal exact manipulations can carry us. We have managed to trade 
the integration over the interacting Grassmann field a for an integration over an auxiliary 
field <j>; a field that we believe encapsulates the relevant degrees of freedom of the model. 
This completes steps l, 2, and 3 of the general program outlined above. 

The next step in the program is to subject the action to a stationary phase analysis, i.e. 
to seek solutions of the set of saddle-point equations such that 


Vg = (q^ 0 ,w): 



The solution </>(x, t) -H <f> q is commonly referred to as the mean-field. This terminology can 
be understood by inspection of the argument of the “tr In” above. The structure p 2 /2 m — 
fj, + ie<p/L d , where <f> is a fixed configuration (to be determined by solving the saddle-point 
equations), resembles the Hamiltonian operator of particles subject to some background 
potential, or “mean” field. The notation on the left-hand side of the saddle-point equations 
indicates that our original interaction V(q) and, therefore, the decoupling field <j> do not 
possess a zero-momentum mode (a consequence of charge neutrality). 

The concrete evaluation of the functional derivative 5S/S(j) leads us to question how one 
differentiates the trace of the logarithm of the operator, 


G-'W 


with respect to its argument. (The notation G _1 is motivated by its structural similarity 
to an inverse Green function.) Owing to the presence of the trace, the differentiation can 
be carried out as if G were a function: 3 (6/6<f) q )ti ln(G -1 ) = tr (G(^/(5^ g )G _1 ), where we 


2 Equation (6.3) is readily established by switching to an eigenbasis, whereupon one obtains In det A = ^ Z a l n e a = 
trln A, where e a are the eigenvalues of A and we have used the fact that the eigenvalues of In A are lne a . 

3 Consider an operator A{x) depending on some parameter x. Let f(A) be an arbitrary function ( f(A ) = In A in 
the present application). Then 


3*tr(/(A)) 


d x ^ tr (^ n ) = ((d x A)A n - 1 + A(d x A)A n ~ 2 + • ■ ■ + 

E ^/ (n) (0)tr(d r *- 1 (9xd)) = tr (f'(A)d x A), 


where, in the third equality, 


have used the cyclic invariance of the trace. 
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omitted the argument [</>] for notational clarity. Then, making use of the identity 

- r a 

~ ' ; U qiq2°qi-q2,qi 

QlQ2 




the saddle-point equation assumes the form 

4 s ^j^ i,v * + Ss <}s ‘-'»-* io ' (6 - 5) 

H Q 1 

Here, the factor of two appearing in front of the double summation accounts for the electron 
spin. Further, we have used the fact that G -1 is linear in its argument (f> and that, following 
the rules of functional differentiation, {S ( f >q (j>) qiq2 = 8 qi _ q2 , q . In fact (owing to the discreteness 
of the momentum) we are not even differentiating functionals: <fr is a matrix in momentum 
space. Its matrix elements are given by components <p q : (<p) qi . q2 = <P qi -q 2 - Differentiation of 
this matrix with respect to 4>q then produces another matrix with matrix elements one for 
Q 1 —Q 2 = Q and zero otherwise. (Admittedly, it takes some time to get used to differentiation 
formulae of this type.) 

Equations of this form are solved not from scratch, but by making a physically motivated 
ansatz , i.e. by guessing the solution! Naturally, the first guess is a homogeneous solution, 
4>(r,t ) = 0 = const., i.e. one relies on the picture that a spatially and temporarily varying 
field configuration is energetically more costly than a constant one, and therefore cannot 
provide a stable extremum. 


INFO Be aware that there exist translationally invariant problems with inhomogeneous mean- 
fields; or a homogeneous solution exists, but it is energetically inferior to a textured field con- 
figuration. Indeed, there may be sets of degenerate solutions, etc. Often, when new theories 
describing an unknown territory have been developed, the search for the “correct” mean-field 
turns out to be a matter of long, and sometimes controversial, research. 


In the present context, spatial and temporal homogeneity translates to the solution </> q)W = 0 
if either lo or q is non-zero. Indeed, such an ansatz solves the mean-field equation: assuming 
that all fluctuating components of <j> are switched off, the Green function operator becomes 
diagonal in momentum space, (G[<f>]) qi ,q 2 oc S qi , q , 2 . We thus see that, for non-vanishing q, 
both terms in the equation vanish. Moreover, since charge neutrality requires <j) q =o = 0 one 
may identify <j> = 0 as the solution of the mean-field equation, settling step 4 of the general 
program. 

We now proceed to expand the functional in fluctuations around <f> = 0. Since the mean- 
field solution vanishes, it makes no sense to introduce new notation, i.e. we will denote the 
fluctuations again by the symbol <j>. As regards the first term in the action (6.4), it already 
has a quadratic form. The logarithmic contribution can be expanded as if we were dealing 
with a function (again, a consequence of the trace), i.e. 

trlnG -1 = trlnGo 1 + ^tr(G 0 f>) + ^ tr(G 0 ^G 0 ^) H , 

where Gq : = iuj — + /i is the momentum and frequency diagonal operator whose matrix 

elements give the free Green function of the electron gas. Now, let us discuss the terms 
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appearing on the right-hand side in turn. Being ^-independent, the first term generates an 
overall constant multiplying the functional integral, i.e. a constant that must describe the 
non-interacting content of the theory. Indeed, one may note that e trlnG ° = e - trlnG o = 
det Gq = Z 0 is just the partition function of the non-interacting electron gas. Linear in 0, 
the second term of the expansion must, by virtue of the mean-field analysis, vanish after all. 
(We are expanding around an extremum! Exercise: Write out the momentum representation 
of the first-order contribution to confirm that it vanishes identically.) The third term is the 
interesting one. Remembering that 0 couples to the theory as a voltage, this term describes 
the way potential fluctuations are affected by the presence of the electron gas, i.e. it must 
encode screening. 

To resolve this connection, let us make the momentum dependence of the second-order 
term explicit (exercise): 

2 (^) tr(G 0 ^00) = y^G 0)P </) q Go, p + q <f)- q = 

p,q q 

where, once again, we encounter the polarization operator (5.28). Combining with the first 
term in the action, one finally obtains 

Seff [0] = (^ - e 2l 0 <l>-q+0{<f> 4 ), (6-6) 

9 ' ' 

where we note that odd powers of 0 vanish by the symmetry of the problem (exercise). 
Indeed, the form of the effective action (6.6) is suggestive. Ignoring the coupling to the 
electron gas, 0 represents a field with bare “propagator” ~ q -2 , i.e. the long-range cor- 
relation mediated by the electric field in vacuum. Coupling the field to a medium leads 
to the appearence of a “self-energy” ~ n 9 . As we saw above, the self-energy converts the 
long-range power law correlation into something exponential - screening. The form of the 
screened propagator exactly coinicides with the effective RPA interaction (5.34) derived 
diagrammatically above. In passing we note that the structures emerging in the functional 
integral analysis support a unified way of “reading” the theory: there appear “propagators,” 
“self-energies,” etc. irrespective of the concrete context, i.e. irrespective of whether we are 
dealing with a fermionic field (0) or a bosonic field (0). 

From here it is a one-line calculation to reproduce the result for the RPA free energy of 
the electron gas discussed previously. Gaussian integration over the field 0 (step 5 of the 
program) leads to the expression .Zrpa = Zq n q {\ — ^f 2 Ilq) 1 / 2 , where we have noted that 
the 0-integration was normalized to unity, i.e. for II = 0, the integral collapses to unity. 
Taking the logarithm, we obtain the free energy 

Frpa = -T{\nZ - In 2 b) = \ X> ( X “ ^T n ?) ’ 
in agreement with Eq. (5.27). 

At this point, it is instructive to pause and compare the two approaches to the problem: 
diagrammatics and field integration. We first note that the functional integral formulation 
indeed leads to the “reduced” description we sought: the minimal degree of freedom (0) 
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directly couples to the physically relevant entities of the theory (II g ). The downside is 
that, to formulate the approach, we had to go through some preparatory analysis, notably 
work that required prior physical insight into the problem (i.e. the correct choice of the 
Hubbard-Stratonovich decoupling). However, this turned out to be an effort well invested. 
After the identification of <j> as the “appropriate” field, the further construction of the 
theory proceeded along the lines of a largely “automated program” (seeking saddle-points, 
expanding, etc.). In particular, there was no need to do battle with combinatorial problems. 
Further, in functional integral approaches, the risk of missing relevant contributions, or 
diagrams, in the expansion of the theory is far less pronounced than in direct diagrammatic 
expansions. But undoubtedly the most important advantage of the functional integral route 
is its extensibility. For example, an expansion of the theory to higher orders in <p would 
have generated an interacting theory of voltage fluctuations. Correlations on the level of 
that theory correspond to diagrams beyond the RPA level, i.e. diagrams whose direct and 
error- free summation would require more refined skills, a program beyond the scope of the 
present text. 

The mean-field optimizing the problem above was particularly simple with <p = 0. More- 
interesting situations arise when one encounters non-vanishing mean-field configurations, i.e. 
the perturbation theory has to be organized around a state different from the trivial vacuum 
of the theory. Referring to Problem 6.7, we will discuss how, under certain circumstances, 
Coulomb interaction can favor an itinerant magnetic state. 


6.3 Bose— Einstein condensation and superfluidity 

Previously, we have considered the influence of weak Coulomb interaction on the properties 
of the electron gas. In the following, our goal will be to consider the phases realized by a 
weakly interacting Bose gas. To this end, let us introduce the quantum partition function 
Z = f D$, ip) e~ s ^, where 

S[tp,ip\ = J d d r J dr ^(r,r)(d T + H 0 - /x)^(r,r) + |(t/S(r, r)^(r, r)) 2 ] . (6.7) 

Here ip represents a complex field 
subject to the periodic boundary 
condition ip(r,/3) = tp(r,0). The 
functional integral Z describes the 
physics of a system of bosonic par- 
ticles in d dimensions subject to 
a repulsive contact interaction of 
strength g > 0. For the moment the 
specific structure of the one-body 
operator Hq need not be specified. 

The most remarkable phenomena displayed by systems of this type are Bose-Einstein 
condensation and superfluidity. However, contrary to a widespread belief, these two effects 


Satyendranath Bose 1894-1974 

He undertook important work 
in quantum theory, in particular 
on Planck’s black body radiation 
law. His work was enthusiasti- 
cally endorsed by Einstein. He also 
published on statistical mechan- 
ics, leading most famously to the 
concept of Bose-Einstein statistics. Dirac coined the 
term "boson" for particles obeying such statistics. 
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do not depend on each other: superfluidity can arise without condensation and vice versa. 
We begin our discussion with the more elementary of the two phenomena. 


Bose-Einstein condensation 


As may be recalled from elemen- 
tary statistical mechanics, at suffi- 
ciently low temperatures the ground 
state of a bosonic system can involve 
the condensation of a macroscopic 
fraction of particles into a single 
state. This phenomenon, predicted 
in a celebrated work by Einstein, is 
known as Bose-Einstein condensa- 
tion. To see how this phenomenon 
is born out of the functional inte- 
gral formalism, let us temporarily 
switch off the interaction and turn 
to the basis in which the one-particle Hamiltonian is diagonal. Expressed in the frequency 
representation, the partition function of the non-interacting system is given by 


Albert Einstein 1879-1955 

Nobel Laureate in Physics in 1921 
“for his services to theoretical 
physics, and especially for his dis- 
covery of the law of the pho- 
toelectric effect." He is perhaps 
best known for his theory of 
relativity and specifically mass- 
energy equivalence, expressed by the equation E = 
me 2 . His work on the low-temperature behavior 
of the bosonic quantum gas is published in A. 
Einstein, Quantentheorie des einatomigen idealen 
Gases, Sitzungsber. Preuss. Akad. Wiss. (1925), 
261-7. (Image © The Nobel Foundation.) 



Z 0 = Z\ g = 0 = J £>(V’,V’) exp 


- ^ an (~ iuJ n +e a - p) ‘Ipa 


Without loss of generality, we may assume that the eigenvalues e a > 0 are positive with 
a ground state eo = 0. 4 (In contrast to the fermionic systems discussed above, we should 
not have in mind low-energy excitations superimposed on high-energy microscopic degrees 
of freedom. Here, everything will take place in the vicinity of the ground state of the 
microscopic single-particle Hamiltonian.) Further, we note that, to ensure stability, the 
chemical potential determining the number of particles in the system must be negative for, 
otherwise, the Gaussian weight corresponding to the low-lying states e a < —/i would change 
sign, resulting in an ill-defined theory. 

From our discussion of Section 4.2 we recall that the number of particles in the system is 
set by the relation 


N(jm) 


-d^F = TY / 


%UJri — “t” 




The chemical potential fi can always be adjusted 


to meet this condition. 
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where, as usual, riB(e) = — l) -1 denotes the Bose dis- 

tribution. For a given number of particles, this equation deter- 
mines the temperature dependence of the chemical potential, 
p(T). As the temperature is reduced, the distribution func- 
tion controlling the population of individual states decreases. 

Since the number of particles must be kept constant, this scal- 
ing must be counter-balanced by a corresponding increase in 
the chemical potential. 

Below a certain critical temperature T c , even the maximum value of the chemical poten- 
tial, n = 0, will not suffice to keep the distribution function nB(e a ^o) large enough to 
accommodate all particles in the states of non-vanishing energy, i.e. X^ a >o nB ( e «)U=o T =f c 
N\ < N. I.e. below the critical temperature, the chemical potential stays constant at /i = 0 
(see the figure). As a result, a macroscopic number of particles, N — Ni, must accumulate 
in the single-particle ground state: Bose— Einstein condensation. 

EXERCISE For a three-dimensional free particle spectrum, tk = h 2 k 2 /2m, show that the critical 
temperature is set by T c = co h 2 /ma 2 , where a = p -1 ^ 3 is the average inter-particle spacing, and 
Co is a constant of order unity. Show that for temperatures T < T c , the density of particles in 
the condensate (k = 0) is given by po{T) =v$>fl — (T/T c ) 3 / 2 ]. 



Since its prediction in 
the early 1920s, the 
phenomenon of Bose- 
Einstein condensation 
has been a standard 
component of under- 
graduate texts. How- 
ever, it took some 
seven decades before 
the condensation of 
bosonic particles was 

directly 5 observed in experiment. The reason for this delay is that the critical condensation 
temperature of particles (atoms) that are comfortably accessible to experiment is absurdly 
low. 

I N FO In 1995 the groups of Cornell and Wieman at Colorado University and, soon after, Ketterle 

at MIT succeeded in cooling a system of rubidium atoms down to temperatures of 20 billionths(!) 

of a kelvin. 6 To reach these temperatures, a gas of rubidium atoms was caught in a magnetic 


Eric A. Cornell 
1961- (left), Wolf- 
gang Ketterle 
1957- (center), 
and Carl E. Wie- 
man 1951- (right) 

Joint recipients of 
the 2001 Nobel Prize 
in Physics “for the achievement of Bose-Einstein condensation in 
dilute gases of alkali atoms, and for early fundamental studies of the 
properties of the condensates." (Images © The Nobel Foundation.) 



5 Here, by “direct” we refer to the controlled preparation of a state of condensed massive bosonic particles. There 
are numerous “indirect” manifestations of condensed states, e.g. the anomalous properties of the helium liquids 
at low temperatures, or of Cooper-pair condensates in superconductors. 

6 M. H. Anderson, J.R. Ensher, M.R. Matthews, C.E. Wieman, and E. A. Cornell, Observation of Bose-Einstein 
condensation in a dilute atomic vapor, Science 269 (1995), 198-201. 
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Figure 6.2 Spectroscopic images of the Bose-Einstein condensation process for three values of 
temperature (400 nK, 200 nK, and 50 nK from left to right). (Figure courtesy of the JILA Institute, 
University of Colorado.) 

trap, i.e. a configuration of magnetic field gradients that couple to the magnetic moments of the 
atoms so as to keep the system spatially localized. 

The gas of atoms was then brought to a temperature of 0(1O _5 )K - still much too hot to 
condense - by “laser cooling” : crudely speaking, a technique where atoms, subjected to a suitably 
adjusted ray of monochromatic light, may transmit more of their kinetic energy to the photons 
than they get back. To lower the temperature still further, the principle of “evaporative cooling” 
was applied: by lowering the potential well of the trap, a fraction of the atoms, namely those 
with large kinetic energy, is allowed to escape. The remaining atoms have a low kinetic energy 
and, therefore, a low temperature. What sounds like a simple recipe actually represents a most 
delicate experimental procedure. (For example, if the trap potential is lowered too strongly, all 
atoms escape and there is nothing left to condense. If, on the other hand, trapping is too strong, 
the atoms remain too hot, etc.) However, after more than a decade of intensive experimental 
preparation, the required temperatures have been reached. 

Spectroscopic images of the Bose-Einstein condensation process are shown in Fig. 6.2 at three 
temperatures. The peak in the density distribution signals the onset of condensation. On low- 
ering the temperature, one may observe the transition to a condensed phase by monitoring the 
formation of a peak in the density distribution. The preparation of a Bose-Einstein condensed 
state of matter was recognized with the award of the 2001 Nobel prize in physics. Since 1995, 
research on atomic condensates has blossomed into a broad arena of research. Already, it is 
possible to prepare complex states of Bose condensed matter such as atomic vortices in rotat- 
ing Bose-Einstein condensates, condensates in different dimensionalities, or even an artificial 
crystalline state of matter (see Fig. 6.3). Regrettably, a detailed discussion of these interesting 
developments is beyond the scope of the present text. Those interested in learning more about 
this area are referred to the many reviews of the field. 

With this background, let us now try to understand how the phenomenon of Bose-Einstein 
condensation can be implemented in the functional integral representation. Evidently, the 
characteristics of the condensate will be described by the zero field component tpo (t). The 
problem with this zero mode is that, below the condensation transition, its action appears 
to be unbounded: both the chemical potential and the eigenvalue are zero. This means that 
the action of the zero Matsubara component -0o,o vanishes. We will deal with this difficulty 
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in a pragmatic way. That is, we will treat ’ipo( r ) not as an integration variable but rather 
as a time-independent Lagrange multiplier to be used to fix the number of particles below 
the transition. More precisely, we introduce a reduced action of the form 

So [Vh V>] = -V’o/fyV’O + X! ^ an + e a - /u) 'tpan, 

a^0,n 

where we did not yet set fi = 0 (since we still need n as & differentiation variable). To 
understand the rationale behind this simplification one may note that 

N = -9 m Fo| m=0 - = Td,j, In -Z 0 1^=0- = V’oV’o + T ^2 ~ Z — = + Nl > ( 6 - 8 ) 

a#0,n lUJn €a 

determines the number of particles. According to this expression, 'ijwpo = No sets the num- 
ber of particles in the condensate. Now, what enables us to regard V’o as a time-independent 
field? Remembering the construction of the path integral, we note that the introduction 
of time-dependent fields, or “time slicing,” was necessitated by the fact that the operators 
appearing in the Hamiltonian of a quantum theory do not, in general, commute. (Otherwise 
we could have decoupled the expression tr( e -^-^ at ’ a )) ~ f d(tp,ip) 
in terms of a single coherent state resolution, i.e. a “static” configuration.) Reading this 
observation in reverse, we conclude that the dynamic content of the field integral represents 
the quantum character of a theory. (Alluding to this fact, the temporal fluctuations of field 
variables are often referred to as quantum fluctuations.) Conversely, 



Figure 6.3 The creation of an artificial mono-atomic crystal from a Bose-Einstein condensate. 
A system of bosons in a condensed state - the state signaled by the pronounced coherent intensity 
peaks in the left portion of the figure - is exposed to a “grid” formed by two crossing fields of 
intense laser radiation. For strong enough field amplitudes, the electromagnetic radiation inter- 
feres, presenting a periodic lattice of deep potential minima. These minima become occupied by 
one atom each. The result is an artificial mono-atomic crystal, characterized by the absence of 
coherent superposition of atomic wavefunction amplitudes, i.e. an equilibrated density profile (mid- 
dle). If the laser intensity is lowered, the atoms re-arrange to the characteristic density distribution 
of the condensate (right). (Source: Courtesy of Max-Planck-Institut fur Quantenoptik, Garching, 
Germany.) 
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A static approximation in a field integral iP(t) = tp o = const, amounts 
to replacing a quantum degree of freedom by its classical approxima- 
tion. 


(In order to distinguish them from quantum, fluctuations in the “classical” static sector of 
the theory are called thermal fluctuations.) To justify the approximation of do 44 '</>o by a 
classical object, notice that, upon condensation, No = (agOo) will assume “macroscopically 
large” values. On the other hand, the commutator [«o, a),] = 1 continues to be of 0(1). It 
thus seems to be legitimate to neglect all commutators of the zero operator ao in comparison 
with its expectation value - a classical approximation. 7 

Now, we are still left with the problem that the V’o-integration appears to be undefined. 
The way out is to remember that the partition function should extend over those states 
that contain an average number N of particles. That is, Eq. (6.8) has to be interpreted as 
a relation that fixes the modulus V’oV'o so as to adjust the appropriate value of N. 8 


The weakly interacting Bose gas 

Now, with this background, let us restore the interactions focusing on a small but non- 
zero coupling constant g. To keep the discussion concrete, we specialize to the case of 
a free single-particle system, H 0 = p 2 /2 m. (Notice that the ground state wavefunction 
of this system is described by a spatially constant zero-momentum state.) By adiabatic 
continuity we expect that much of the picture developed above will survive generalization 
to non-zero interaction strengths. In particular, the ground state, which in the case under 
consideration corresponds to a temporally and spatially constant mode ?/>o , will continue 
to be macroscopically occupied. In these circumstances, the dominant contribution to the 
action will again come from the classical tpo sector: 

TS$ 0 , #1 = -M^oV’o + ^(VWo) 2 - (6.9) 

(Notice the similarity of the action to the integrand of the toy problem discussed in Sec- 
tion 5.1.) Crucially, the stability of the action is now guaranteed by the interaction vertex, 
no matter how small is g > 0 (see the schematic plot of the action in the figure on page 
263). Accordingly, we will treat ipo no longer as a fixed parameter but rather as an ordinary 
integration variable. Integration over all field components will produce a partition function 
Z(fi) that depends parametrically on the chemical potential. As usual in statistical physics, 
the function can then be employed to fix the particle number. (Notice that, vis-a-vis aspects 


Notice the similarity of that reasoning to the arguments employed in connection with the semi-classical treatment 
of spin systems in the limit of large S (Section 2.2). Unfortunately, the actual state of affairs with the classical 
treatment of the condensate is somewhat more complex than the simple argument above suggests. (For a good 
discussion, see A. A. Abrikosov, L. P. Gorkov, and I. E. Dzyaloshinskii, Methods of Quantum Field Theory 
in Statistical Physics [Dover Publications, 1975]) However, the net result of a more thorough analysis, i.e. an 
integration over all dynamically fluctuating components , 0o,n=o> shows that the treatment of 'ifo as classical 
represents a legitimate approximation. 

For a more rigorous discussion of the choice of the thermodynamic variables in the present context, we again 
refer to Abrikosov et al ., ibid. 
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of thermodynamics, the interacting system appears to behave more “naturally” than its 
ideal, non-interacting approximation. This reflects a general feature of bosonic systems: 
interactions “regularize” a number of pathological features of the ideal gas.) 

Returning to the 'i/’o-integration, we observe that, for low enough temperatures, the prob- 
lem is an ideal candidate for saddle-point analysis. Variation of the action with respect to 
ipo gives 

V>o (-M + -pV’oV’o) = 0. 

This equation is solved by any constant complex field configuration ip 0 with modulus |^>ol = 
s/v^/g = 7. In spite of its innocent appearance, this equation reveals much about the 
nature of the system: 

o For fj, < 0 (i.e. above the condensation threshold of the non-interacting system), the 
equation exhibits only the trivial solution ipo = 0. This means that no stable condensate 
amplitude exists. 

t> Below the condensation threshold (i.e. for p > 0), 9 the equation is solved by any config- 
uration with |^o | = 7 = yfgL^Jg- (Notice that ipoipo oc L d , reflecting the macroscopic 
population of the ground state.) 

i> The equation couples only to the modulus of ipo- That is, the solution of the stationary 
phase equation is continuously degenerate: each configuration 'i/’o = 7 exp (i(p), <p € [0, 2n], 
is a solution. 

For our present discussion, the last of the three aspects mentioned above is the most impor- 
tant. It raises the question of which configuration ipo = jexp(i(p) is the “right” one? 

Without loss of generality, we may choose ipo = 7 € R as a reference configuration for 
our theory. This choice amounts to selecting a particular minimum lying in the “Mexican 
hat” profile of the action shown above. However, it is clear that an expansion of the action 
around that minimum wifi be singular: fluctuations ipo — » ipo + Sip that do not leave the 
azimuthally symmetric well of degenerate minima do not change the action and, therefore, 
have vanishing expansion coefficients. As a result, in the present situation, we will not be 
able to implement a simple scheme, i.e. “saddle-point plus quadratic fluctuations”; there is 
nothing that constrains the deviations Sip to be small. In the next section, we discuss the 
general principles behind this phenomenon. In section 6.3, we then continue to explore its 
ramifications in the physics of the Bose system. 


Spontaneous symmetry breaking 

The mechanism encountered here is one of spontaneous symmetry breaking. To under- 
stand the general principle, consider an action S[ip] with a global continuous symmetry 
under some transformation g (not to be confused with the aforementioned coupling constant 
of the Bose gas): specifically, the action remains invariant under a global transformation of 


Owing to the stabilization of the 
condition. 


mode integration by the interaction constant, fi 0 is 


longer a strict 



258 


Broken symmetry and collective phenomena 


the fields such that Vi 6 M: ^ gxpi, where M is the base manifold, i.e. = S\gip\. The 
transformation is “continuous” in the sense that g takes values in some manifold, typically 
a group G. 

Examples: The action of a Heisenberg ferromagnet is invariant under rotation of all 
spins simultaneously by the same amount, S» — >• gS t - In this case, g £ G = 0(3), the 
three-dimensional group of rotations (g not to be confused with the coupling constant of 
the interaction). The action of the displacement fields u describing elastic deformations of 
a solid (phonons) is invariant under simultaneous translation of all displacements u, — > 
Uj + a, i.e. the symmetry manifold is the d-dimensional translation group G = W l . In the 
example above, we encountered a U(l) symmetry under phase multiplication i/jq —> e'^ip o- 
This phase freedom expresses the global gauge symmetry of quantum mechanics under 
transformation by a phase, a point we discuss in more detail below. 

Now, given a theory with globally G invariant 
action, two scenarios are conceivable: either the 
ground states share the invariance properties of 
the action or they do not. The two alternatives 
are illustrated in the figure for the example of 
the Bose system. For g < 0, the action £[' i/>o 5 Vb] 
has a single ground state at xp o = 0. This state 
is trivially symmetric under the action of G = 

U(l). However, for positive g, i.e. in the situation discussed above, there is an entire manifold 
of degenerate ground states, defined through the relation |^o| = 7- These ground states 
transform into each other under the action of the gauge group. However, none of them is 
individually invariant. 

With the other examples mentioned above, the situation is similar. For symmetry groups 
more complex than the one-dimensional manifold U(l), the ground states will, in general, be 
invariant under transformation by the elements of a certain subgroup H C G (that includes 
the two extremes H = {1} and H = G). For example, below the transition temperature, the 
ground state of the Heisenberg magnet will be given by (domain-wise) aligned configurations 
of spins. Assuming that the spins are oriented along the ^-direction, the ground state is 
invariant under the abelian subgroup H C 0(3) containing all rotations around the 2 -axis. 
However, invariance under the full rotation group is manifestly broken. Solids represent 
states where the translation symmetry is fully broken, i.e. all atoms collectively occupy a 
fixed pattern of spatial positions in space, H = {1}, etc. 

In spite of the undeniable existence of solids, magnets, and Bose-Einstein condensates of 
definite phase, the notion of a ground state that does not share the full symmetry of the 
theory may appear paradoxical, or at least “unnatural.” For example, even if any particular 
ground state of the “Mexican hat” potential shown in the figure above “breaks” the rota- 
tional symmetry, should not all these states enter the partition sum with equal statistical 
weight, such that the net outcome of the theory is again fully symmetric? 

To understand why symmetry breaking is a “natural” and observable phenomenon, it is 
instructive to perform a gedankenexperiment. To this end, consider the partition function 
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of a classical 10 ferromagnet, 

Z = tx 

where H is the rotationally invariant part of the energy functional and h represents a weak 
external field. (Alternatively, we can think of h as an internal field, caused by a slight 
structural imperfection of the system.) In the limit of vanishing field strength, the theory 
becomes manifestly symmetric. Symbolically, 

lim lim Z — > rot. sym., 

N-> oo h-> 0 

where the limit N — > oo serves as a mnemonic indicating that we consider systems of 
macroscopic size. However, keeping in mind the fact that the model ought to describe a 
physical magnetic system, the order of limits taken above appears questionable. Since the 
external perturbation couples to a macroscopic number of spins, a more natural description 
of an “almost” symmetric situation would be 

lim lim Z — 

h — ^0 N —¥ oo 

The point is that the two orders of limits lead to different results. In this case, for any h, 
the N oo system is described by an explicitly symmetry broken action. No matter how 
small the magnetic field, the energetic cost to rotate N —¥ oo spins against the field is too 
high, i.e. the ground state |S) below the transition temperature will be uniquely aligned, 
Sj h. When we then send h — ► 0 in a subsequent step, that particular state will remain 
the observable reference state of the system. Although, formally, a spontaneous thermal 
fluctuation rotating all spins by the same amount |S) — > \gS) would not cost energy, that 
fluctuation can be excluded by entropic reasoning. 11 (By analogy, one rarely observes kettles 
crashing into the kitchen wall as a consequence of a concerted thermal fluctuation of the 
water molecules!) 

However, the appearance of non-trivial ground states is just one manifestation of spon- 
taneous symmetry breaking. Equally important, residual fluctuations around the ground 
state lead to the formation of soft modes (sometimes known as massless modes), i.e. 
field configurations </> q whose action S[<f>] vanishes in the limit of long wavelengths, q — » 0. 
Specifically, the soft modes formed on top of a symmetry broken ground state are called 
Goldstone modes. As a rule, the presence of soft modes in a continuum theory has impor- 
tant phenomenological consequences. To understand this point, notice that the general 
structure of a soft mode action is given by 

S[4>] = 0q [c\\Qi\ + c l 2 q . 2 ] (j>- q + 0(</> 4 , q 3 ), (6.10) 

q,i 

where c\ 2 are coefficients. As discussed in Section 5.1, the absence of a constant contribution 
to the action (i.e. a contribution that does not vanish in the limit q — > 0) signals the existence 
of long-range power-law correlations in the system. As we will see shortly, the vanishing of 


The same argument can be formulated for the quantum magnet. 

In Chapter 8, we show that this (overly) simple picture in fact breaks down in dimensions d 2. 
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the action in the long-wavelength limit q — > 0 further implies that the contribution of the 
soft modes dominates practically all observable properties of the system. 

EXERCISE Explore the structure of the propagator G(q) = (0 q </>_ q ) associated with .S’ [0] and 
convince yourself that the arguments formulated for the specific case of the 0 4 -theory are of 
general validity. To this end, notice that, for small q, G(q) ~ |q| _n , where n denotes the index 
of the first non- vanishing coefficient ci,ca, . . ., i.e. the propagator is dominated by the smallest 
q-power appearing in the action. The power-law behavior of the correlation function implies a 
homogeneity relation G(q/A) = A n G(q). Show that this scaling relation implies that the Fourier 
transform G(r '£ |r|) — (</>(r)0(O)) obeys the “scaling law” G(Ar) = A -d+n G(r). This, in turn, 
implies that the real space correlation function also decays as a power law, namely G(r) ~ 
r |— rf+n, ■ e j n a “i on g- ran g e ” manner. Explore the breakdown of the argument for an action 
with a finite mass term. Convince yourself that, in this case, the decay would be exponential, 
i.e. “short-range.” 


b 0 ttttttttttttttttttttttttttttttttttttt What ’ then ’ are the ori S in and nature of the 

soft Goldstone modes caused by the sponta- 
8 ^° tHHuuuHuHHHuHtuuHttHt neous breakdown of a symmetry? To address 

swbo tt\WWWWt ff//////////ftt\\\\\^\\ this point let us consider the action of a sym ‘ 

m A ^ metry group element g on a (symmetry broken) 

ground state ipo (cf. the middle row of the fig- 
ure). By definition, S[gip o] = S[ip o] still assumes its extremal value. Assuming that g is close 
to the group identity, we may express g = exp[^) a (j> a T a \, where the {T a } are generators 
living in the Lie algebra of the group and <p a are some expansion coefficients. 12 Express- 
ing fluctuations around Ao in terms of the “coordinates” <P a , we conclude that the action 
S[<£] = 0. However, if we promote the global transformation to one with a weakly fluctuating 
spatial profile, g — > <?(r), ip o — t S ( ( r )V , o (bottom row of the figure), some price must be paid. 
That is, for a spatially fluctuating coordinate profile {^> a (r)}, S[(p\ ^ 0, where the energy 
cost depends inversely on the fluctuation rate A of the field <p. The expansion of S in terms 
of gradients of <p is thus bound to lead to a soft mode action of the type Eq. (6.10). 

In view of their physical significance, it is important to ask how many independent soft 
modes exist. The answer can be straightforwardly given on the basis of the geometric picture 
developed above. Suppose our symmetry group G has dimension r, i.e. its Lie algebra is 
spanned by r linearly independent generators T a , a = 1, . . . , r. If the subgroup H c G has 
dimension s < r, s of these generators can be chosen so as to leave the ground state invariant. 
On the other hand, the remaining p = r — s generators inevitably create Goldstone modes. 
In the language of group theory, these generators span the coset space G/H. For example, 
for the ferromagnet, H = 0(2) is the one-dimensional subgroup of rotations around the 
quantization axis (e.g. the ^-axis). Since the rotation group has dimension 3, there must be 
two independent Goldstone modes. These can be generated by the action of the rotation, or 
angular momentum generators J x/y acting on the 0 - aligned ground state. The coset space 


Here we have used the fact that, for any reasonable model, G will be a Lie group, i.e. a group with the structure 
of a differentiable manifold. 
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0(3)/0(2) can be shown to be isomorphic to the 2-sphere, i.e. the sphere traced out by the 
spins as they fluctuate around the ground state. 

Finally, you may ask why we bothered to formulate these concepts in the abstract lan- 
guage of Lie groups and generators, etc. The reason is that the connection between the 
coordinates parameterizing the Goldstone modes (pi, i = 1 , ... ,p, residual “massive modes” 
Kj , j = 1 , ... ,n — p, and the original coordinates ipi,i = 1, ... ,n, of the problem, respec- 
tively, is usually nonlinear and sometimes not even very transparent. With problems more 
complex than the three prototypical examples mentioned above, it is usually profitable to 
first develop a good understanding of the geometry of the problem before specific coordinate 
systems are introduced. 

After these general considerations it is time to get back to the Bose system and to explore 
the physical consequences of Goldstone mode formation on the level of a concrete example! 


Superfluidity 

The theory of the weakly interacting superfluid to be discussed below was originally con- 
ceived by Bogoliubov, then in the language of second quantization. 13 We will reformulate 
the theory in the language of the field integrals starting with the action of the weakly inter- 
acting Bose gas (6.7). Focussing on temperatures below T c (p > 0), let us expand the theory 
around the particular mean-field ground state Vh = = (^-L < Vfl , ) 1/,2 — 7- (Of course, any 

other state lying in the “Mexican hat” minimum of the action would be just as good.) Notice 
that the quantum ground state corresponding to the configuration ip 0 is unconventional in 
the sense that it cannot have a definite particle number. The reason is that, according to the 
correspondence ip <->■ a between coherent states and operators, respectively, a non- vanishing 
functional expectation value of ipo is equivalent to a non-vanishing quantum expectation 
value (ao). Assuming that, at low temperatures, the thermal average (. . .) will project onto 
the ground state |fl), we conclude that (fl|ao|G) ^ 0, i.e. |fi) cannot be a state with a 
definite number of particles. 14 


The symmetry group U(l) acts on this state by multiplication, 
ipo — > e^ipo and ipo — > e K Vo- Knowing that the action of a weakly 
modulated field <p[ r, r) will be massless, let us introduce coordinates 

iP(t,t) =p 1/2 (r,r)e^ (r ’ T) , ^(r,r) = p 1/2 (r, r)e“^ (r ’ T) , 


t ImV’ 


\S = extr. 


V* 


Sp 

Re it!* 


where p[r, r) = po + 8p[r,r) and po = ipoipo/L d is the condensate / 

density. Evidently, the variable 8p parametrizes deviations of the 

field ip[r , r) from the extremum. These excursions are energetically costly, i.e. 8p will turn 
out to be a massive mode. Also notice that the transformation of coordinates [ip, ip) — > [p, <p), 
viewed as a change of integration variables, has a Jacobian of unity. 


13 N. N. Bogoliubov, On the theory of superfluidity, J. Phys. (USSR) 11, 23-32 (1947) (reprinted in D. Pines, 
The Many- Body Problem, [Benjamin, 1961]). 

14 However, as usual with grand canonical descriptions, in the thermodynamic limit the relative uncertainty in 
the number of particles, (( N 2 ) — (N) 2 )/(N ) 2 , will become vanishingly small. 
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I N FO As we are dealing with a (functional) integral, there is a lot of freedom as to the choice 
of integration parameters. (I.e., in contrast to the operator formulation, there is no a priori 
constraint for a transform to be “canonical.”) However, physically meaningful changes of rep- 
resentation will usually be canonical transformations, in the sense that the corresponding 
transformations of operators would conserve the commutation relations. Indeed, as we have seen 
in the info block starting on page 176, the operator transformation o(r) ~ p(r) 1 ^ 2 e*^ r \ <d( r) E 
e -»<H r ) p( r ) 1 / 2 ; fulfills this criterion. 

We next substitute the density-phase relation into the action and expand to second order 
around the reference mean-field. Ignoring gradients acting on the density field (in compar- 
ison with the “potential” cost of these fluctuations), we obtain 

s[p,<t>\ ~ J dT J ddr ^(v^) 2 + ^rj • (6-H) 

The first term of the action has the canonical structure “momentum x d T (coordinate)” 
indicative of a canonically conjugate pair. The second term measures the energy cost of 
spatially varying phase fluctuations. Notice that fluctuations with ^(r, r) = const, do not 
incur an energy cost - ^ is a Goldstone mode. Finally, the third term records the energy 
cost of massive fluctuations from the potential minimum. Equation (6.11) represents the 
Hamiltonian version of the action, i.e. an action comprising coordinates (j) and momenta 
p. Gaussian integration over the field 5p leads us to the Lagrangian form of the action 
(exercise): 

%] ^IjdT J d d r [i(3 T </>) 2 + ^(V0) 2 ] • (6.12) 

Comparison with Eq. (1.4) identifies this action as the familiar d-dimensional oscillator. 
Drawing on the results of Chapter 1 (see, e.g., Eq. (1.29)), we find that the energy Wk carried 
by elementary excitations of the system scales linearly with momentum, Wk = |k| (<7Po / 1 ^ 2 - 
Let us now discuss the physical ramifications of these results. The actions (6.11) and (6.12) 
describe the phenomenon of superfluidity. To make the connection between the fundamental 
degree of freedom of a superfluid system, the supercurrent, and the phase field explicit, 
let us consider the quantum mechanical current operator 

j(r,r) = ^ [(Va t (r,r))a(r,T) - o f (r, r)Vo(r,r)] 

fun^mt. _L [(VVS(r, T ))V>( r ,T) - ^(r,r)VV’( r , T )] « — V0(r,r), (6.13) 

where the arrow indicates the functional integral correspondence of the operator description 
and we have neglected all contributions arising from spatial fluctuations of the density 
profile. (Indeed, these - massive - fluctuations describe the “normal” contribution to the 
current flow.) 

I N FO Superfluidity is one of the most counterintuitive and fascinating phenomena displayed by 
condensed matter systems. Experimentally, the most straightforward access to superfluid states 
of matter is provided by the helium liquids. Representative of many other effects displayed by 
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superfluid states of helium, we mention the capability of thin films to flow up the walls of a vessel 
(if the reward is that on the outer side of the container a low-lying basin can be reached - the 
fountain experiment) or to effortlessly propagate through porous media that no normal fluid may 
penetrate. Readers interested in learning more about the phenomenology of superfluid states of 
matter may refer to the seminal text by Pines and Nozieres. 16 

The gradient of the phase variable is therefore a measure of the (super) current flow in 
the system. The behavior of that degree of freedom can be understood by inspection of 
the stationary phase equations - a.k.a. the Hamilton or Lagrange equations of motion - 
associated with the action (6.11) or (6.12). Turning to the Hamiltonian formulation, one 
obtains (exercise) 

id T <j) = —gp + p, id T p = — V 2 </> = V • j. 

m 

The second of these equations represents (the Euclidean time version of) a continuity equa- 
tion. A current flow with non-vanishing divergence is accompanied by dynamical distortions 
in the density profile. The first equation tells us that the system adjusts to spatial fluc- 
tuations of the density by a dynamical phase fluctuation. The most remarkable feature of 
these equations is that they possess steady state solutions with non- vanishing current flow. 
Setting d T (j) = d T p = 0, we obtain the conditions dp = 0 and V • j = 0, i.e. below the con- 
densation temperature, a configuration with a uniform density profile can support a steady 
state divergenceless (super)current. Notice that a “mass term” in the 0-action would spoil 
this property, i.e., within our present approach, the phenomenon of supercurrent flow is 
intimately linked to the Goldstone mode character of the <j> field. 

EXERCISE Add a fictitious mass term to the 0-action and explore its consequences. How do 
the features discussed above present themselves in the Lagrange picture? 

It is very instructive to interpret the phenomenology of supercurrent flow from a different, 
more microscopic perspective. Steady state current flow in normal environments is pre- 
vented by the mechanism of energy dissipation, i.e. particles constituting the current flow 
scatter off imperfections inside the system, thereby converting part of their energy into the 
creation of elementary excitations. (Macroscopically, the conversion of kinetic energy into 
the creation of excitations manifests itself as heat production.) Apparently, this mechanism 
is inactivated in superfluid states of matter, i.e. the current flow is dissipationless. 

How can the dissipative loss of energy be avoided? Trivially, no energy can be exchanged if 
there are no elementary excitations to create. In reality, this means that the excitations of the 
system are energetically so high-lying that the kinetic energy stored in the current-carrying 
particles is insufficient to create them. But this is not the situation that we encounter in the 
superfluid! As we saw above, there is no energy gap separating the quasi-particle excitations 
of the system from the ground state. Rather, the dispersion <u(k) vanishes linearly as k — > 0. 
However, there is an ingenious argument due to Landau showing that a linear excitation 
spectrum indeed suffices to stabilize dissipationless transport. 


D. Pines and P. Nozieres, The Theory of Quantum Liquids: Super uid Bose Liquids (Addison- Wesley, 1989). 
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Figure 6.4 (a) Flow of a fluid through a rough pipe, (b) The same viewed from the rest frame of 
the fluid, (c) Dissipative creation of a (quasi-particle) excitation, (d) The same viewed from the 
laboratory frame. 

INFO Consider the flow of some fluid through a pipe (see Fig. 6.4(a)). To be concrete, let us 
assume that the flow occurs at a uniform velocity V. Taking the mass (of a certain portion of the 
fluid) to be M, the current carries a total kinetic energy E\ = M~V 2 / 2. Now, suppose we view 
the situation from the point of view of the fluid, i.e. we perform a Galilean transformation into 
its own rest frame (Fig. 6.4(b)). From the perspective of the fluid, the walls of the pipe appear 
as though they were moving with velocity —V. Now, suppose that frictional forces between 
fluid and the wall lead to the creation of an elementary excitation of momentum p and energy 
e(p), i.e. the fluid is no longer at rest but carries kinetic energy (Fig. 6.4(c)). After a Galilean 
transformation back to the laboratory frame (Fig. 6.4(d)), one finds that the energy of the fluid 
after the creation of the excitation is given by (exercise) 

£ 2 = ^+p.V + e(p). 

Now, since all of the energy needed to manufacture the excitation must have been provided by 
the liquid itself, energy conservation requires that E\ = E 2 , or — p • V = e(p). Since p • V > 
— |p| I V| , this condition can only be met if |p||V| > e(p). While systems with a “normal” gapless 
dispersion, e(p) ~ p 2 , are compatible with this energy-balance relation (i.e. no matter how 
small |V|, quasi-particles of low momentum can always be excited), both gapped dispersions 
e(p) ’——t const, and linear dispersions are incompatible if V becomes smaller than a certain 
critical velocity V* . Specifically for a linear dispersion e(p) = v|p|, the critical velocity is given 
by V* = v. For currents slower than that, the flow is necessarily dissipationless. 

Let us conclude our preliminary discussion of the weakly interacting Bose gas with a very 
important remark. Superficially, Eq. (6.11) and (6.12) suggest that we have managed to 
describe the long-range behavior of the condensed matter system in terms of a free Gaus- 
sian theory. However, one must recall that <j> is a phase field, defined only modulo 27T. (In 
Eq. (6.11) and (6.12) this condition is understood implicitly. At this point, it is perhaps 
worth reiterating that when dealing with Goldstone modes it is important to keep the 
underlying geometry in mind and not focus too tightly on a specific coordinate representa- 
tion.) The fact that </> is defined only up to integer multiples of 27T manifests itself in the 
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formation of the most interesting excitations of the superfluid: vortices, i.e. phase config- 
urations (p(r. t ) that change by a multiple of 27 t as one moves around a certain reference 
coordinate, the vortex center. Existing in parallel with harmonic phonon-like excitations 
discussed above, these excitations lead to a wealth of observable phenomena, to be dis- 
cussed in more detail in Chapter 8. However, for the moment let us turn to the discussion 
of another prominent superfluid, the condensate of Cooper pairs, more generally known as 
the superconductor. 


6.4 Superconductivity 

The electrical resistivity of many 
metals and alloys drops abruptly 
to zero when the material is cooled 
to a sufficiently low temperature. 

This phenomenon, which goes by 
the name of superconductivity, 
was first observed by Kammerlingh 
Onnes in Leiden in 1911, three 
years after he first liquefied helium. 

Equally striking, a superconductor cooled below its transition temperature in a magnetic 
field expels all magnetic flux from its interior. (One of the more spectacular manifesta- 
tions of the field-aversion of superconductors is exemplified in the figure below: a magnet 
levitated by a superconductor due to the expulsion of magnetic flux.) This phenomenon 
of perfect diamagnetism is known as the Meissner effect and is characteristic of 
superconductivity. 

Indeed, the Meissner effect and dissipa- 
tionless transport are but two of a plethora 
of phenomena accompanying superconduc- 
tivity. 16 Along with the introduction of 
more advanced theoretical machinery, a 
variety of superconducting phenomena is discussed in the remainder of this text. The 
present, introductory section is devoted to the formulation of the theoretical foundations 
of the conventional “BCS” theory of superconductivity, cast into the language of the field 
integral. Although the presentation is self-contained, our focus is on the theoretical aspects. 
Depending on taste, some readers may find it useful to motivate their encounter with the 
formalism developed below by first familiarizing themselves with the basic phenomenology 
of the BCS superconductor. 



Kammerlingh Onnes 1853- 
1926 

Nobel Laureate in Physics in 
1913 “for his investigations on 
the properties of matter at low 
temperatures which led, inter 
alia to the production of liquid 
helium.” (Image © The Nobel 
Foundation.) 



In fact, it is not even appropriate to speak about the phenomenon of “superconductivity” as deriving from 
the same microscopic origin: since the discovery of the class of high- temperature cuprate superconductors in 
1986, it has become increasingly evident that the physical mechanisms responsible for high-temperature and 
“conventional” superconductivity are likely to be strikingly different. 
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Superconductivity 
involves an ordered 
state of conduction 
electrons in a metal, 
caused by the presence 
of a residual attrac- 
tive interaction at the 
Fermi surface. The 
nature and origin of 
the ordering were elu- 
cidated in a seminal 

work by Bardeen, Cooper, and Schrieffer - BCS theory 17 - some 50 years after its discovery! 
At low temperatures, an attractive pairwise interaction can induce an instability of the 
electron gas towards the formation of bound pairs of time-reversed states k t and — k 4 . in 
the vicinity of the Fermi surface. 

From where does an attractive interaction between charged particles appear? In con- 
ventional (BCS) superconductors, attractive correlations between electrons are due to the 
exchange of lattice vibrations, or phonons: The motion of an electron through a metal causes 
a dynamic local distortion of the ionic crystal. Crucially, this process is governed by two 
totally different time scales. For an electron, it takes a time ~ 1 to traverse the immedi- 

ate vicinity of a lattice ion and to trigger a distortion out of its equilibrium position into a 
configuration that both particles find energetically beneficial (top right panel of the figure) . 

However, once the ion has been excited it needs a time of 
O (wp 1 Ap 1 ) to relax back into its equilibrium posi- 
tion (middle left). Here, denotes the Debye frequency, 
i.e. the characteristic scale for phonon excitations. This 
means that, long after the first electron has passed, a 
second electron may benefit from the distorted ion poten- 
tial (middle right) . Only after the ion has been left alone 
for a time > w D does it relax back into its equilibrium 
configuration (bottom left and right). The net effect of 
this retardation mechanism is an attractive interaction 
between the two electrons. Since the maximum energy 
scale of ionic excitations is given by the Debye frequency, 
the range of the interaction is limited to energies ~ wd around the Fermi surface. (For a 
more quantitative formulation, see Problem 4.5.) As regards the high-temperature cuprate 
superconductors, the particular mechanism of pair formation remains (at the time of writ- 
ing) controversial, although the consensus is that its origin is rooted in spin fluctuations. 
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Basic concepts of BCS theory 


John Bardeen 

1908-1991 (left), 

Leon l\l. Cooper 
1930- (center), 
and John R. Schri- 
effer 1931- (right) 

Nobel Laureates in 
Physics in 1972 for 
their theory of superconductivity. (Bardeen was also awarded the 
1956 Nobel Prize in Physics for his research on semiconductors 
and discovery of the transistor effect.) (Images © The Nobel 
Foundation.) 



J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Microscopic theory of superconductivity, Phys. Rev. 106 (1957), 
162-4; Theory of Superconductivity, 108 (1957), 1175-204. 
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Comprising two fermions, the electron-electron bound states, known as Cooper pairs, 
mimic the behavior of bosonic composite particles. 18 At low temperatures, these quasi- 
bosonic degrees of freedom form a condensate which is responsible for the remarkable prop- 
erties of superconductors, such as perfect diamagnetism. 

To appreciate the tendency to pair formation in the electron system, consider the diagram 
shown in the figure below. The region of attractive correlation is indicated as a shaded ring 
of width ~ wd/^f- Now, consider a two-electron state |k j , — k 4 .) formed by two particles 
of (near) opposite momentum and opposite spin. 19 Momentum conserving scattering of the 
constituent particles may lead to the formation of a new state |(k + p) t, — (k + p) 4-) = 
|k 7 t, — k 7 4 .) of the same, opposite-momentum structure. Crucially, the momentum transfer 


p may trace out a large set of values of 0(fcp _1 WD /^f) 
without violating the condition that the final states 
be close to the Fermi momentum. (By contrast, 
if the initial state had not been formed by parti- 
cles of opposite momentum, the phase space for 
scattering would have been greatly diminished.) 
Remembering our previous discussion of the RPA 
approximation, we recognize a familiar mechanism: 
an a priori weak interaction may amplify its effect 
by conspiring with a large phase space volume. 

To explore this mechanism in quantitative terms, 
we will adopt a simplified model defined by the 
Hamiltonian 


k't 



H = 


' L d 


E 


: k+qt C -14 C — k'+qJ-Ck't , 


(6.14) 


where g represents a (positive) constant. The Hamiltonian H should be interpreted as an 
effective Hamiltonian describing the physics of a thin shell of states of width 0(u> d) cen- 
tered around the Fermi surface (i.e. the region where a net attractive interaction prevails). 
Although a more realistic model of attraction would involve a complicated momentum- 
dependent interaction such as the one obtained from the detailed consideration of the 
electron-phonon interaction (see Problem 4.5), the simple constant pairing interaction cap- 
tures the essential physics. 20 After the trio who first explored its phenomenology, the model 
Hamiltonian (6.14) is commonly referred to as the BCS Hamiltonian. 


18 Strictly speaking, this identification deserves some qualification. In the superconducting context, Cooper pairs 
have a length scale (the coherence length to be introduced below) which typically exceeds the average particle 
spacing of the electron gas (usually by as much as three orders of magnitude). In this sense, it can be misleading 
to equate a pair with a single composite particle. The interpretation of the BCS state as a Bose— Einstein 
condensate of Cooper pairs is developed more fully in the problem set in the discussion of the BEC-BCS 
crossover — see Problem 6.7. 

19 Note that there are a minority of superconducting materials - the spin triplet superconductors - in which 
electrons of equal spin are paired. 

20 More importantly, to simplify our discussion, we will take the electrons to be otherwise non-interacting. In fact, 
the presence of a repulsive Coulomb interaction of the electrons plays a crucial role in controlling the properties 
of the superconductor. For an in-depth discussion of the role played by repulsive interactions, see A. I. Larkin and 
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Now the preliminary discussion above does not explain why an attractive interaction is 
so special. Nor does the discussion elucidate the phenomenological consequences of pair 
scattering at the Fermi surface. In the following we will address these issues from a number 
of different angles. The result will be a heuristic picture of the superconductor that will 
guide us in constructing the more rigorous field integral approach below. Mainly for illus- 
trative purposes, we begin our discussion with a brief perturbative analysis of Cooper pair 
scattering. Proceeding in close analogy to the previous discussion of the RPA, we discover 
the dramatic consequences of an attractive interaction on the ground state of the system. 
(However, this part of the discussion is an optional (if instructive) element of the develop- 
ment of the theory. Readers who did not yet navigate Section 5.3 may choose to skip this 
part of the discussion and proceed directly to Section 6.4 where the mean-field picture of 
the superconductor is developed.) 


Cooper instability 

To explore the fate of a Cooper pair under multiple scattering, let us consider the four-point 
correlation function 

= J2d H (t'k+qtCrK- 14 (t) V’k'+q4.(0)V’-k't(0)) . 

k,k' 

This describes the amplitude of Cooper pair propagation |k + q f, — k 4.) — > |k' + q ), — k' 4,} 
in an imaginary time r and averaged over all initial and final particle momenta. 21 As is 
usual with problems whose solution must depend only on time differences, it is convenient 
to switch to a frequency representation. With ( 7 (f) = C(q, oj m ) = T J ^ dr e~ ZUmT C (q, r) , 
where ui m denotes a bosonic Matsubara frequency, it is straightforward to verify that (7(f) = 

T?s J2k,k' <V’fc+ ? tV’-fcfV’fc'+qtV’-fc't)- 

To calculate the correlation function, let us draw on the perturbative methods introduced 
in Section 5.3. As with the analysis of the RPA, the density of the electron gas will play the 
role of a large parameter, i.e. one must expand the correlation function in pair interaction 
vertices g and retain only those terms that appear with one free momentum summation 
per interaction. Summation over these contributions leads to the ladder diagram series 
shown in Fig. 6.5, where the momentum labels of the Green functions are hidden for clarity. 
According to the definition of the correlation function, the two Green functions entering the 
ladder carry momenta k+q and —k, respectively. Momentum conservation then implies that 
the Green functions defining each consecutive rung of the ladder also carry near opposite 
momenta p + q and —p, where p is a summation variable. 


A. A. Varlamov, Fluctuation phenomena in superconductors, in Handbook on Superconductivity: Conventional 
and Unconventional Superconductors , ed. K.-H. Bennemann and J. B. Ketterson (Springer- Verlag, 2002). 

As with applications of the path integral discussed previously, information about the real time dynamics of 
the pair can be extracted from the analytical continuation r — »■ it. Further, notice that the “center of mass” 
momentum q of a pair |k + q — k 4-) can be interpreted as a variable Fourier conjugate to the center 

of the pair. (Exercise: Show this by inverse Fourier transform.) An equivalent interpretation of the correlation 
function C is that it describes the wandering of the Cooper pair under the influence of scattering. 
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p+qd 



-p.+ 


Figure 6.5 Two-particle propagator in the presence of an (attractive) interaction. The two Green 
function lines defining each rung of the ladder carry momenta p + q and —p, respectively, where p 
is a free summation variable. The vertex of the propagator, defined through the second line, obeys 
the Bethe-Salpeter equation defined on the bottom right. 


EXERCISE Convince yourself that the ladder diagrams shown in the figure are the only diagrams 
that contain one free momentum summation per interaction vertex. 


As with the previous discussion in Section 5.3, the central part of the correlation function 
is described by a vertex F. The diagrammatic definition of that object is shown in the 
bottom right part of Fig. 6.5. Translating from diagrammatic to an algebraic formulation, 
one obtains the Cooper version of a Bethe-Salpeter equation r ? = g + J2 P G p + q G- p T q , 
where we have anticipated that a solution independent of the intermediate momenta can 
be found. Solving this equation for T g , we arrive at an equation structurally similar to 
Eq. (5.44): 


r 9 


a G- v 


(6.15) 


Drawing on the results of Problem 4.5, the frequency part of the summation over p gives 


T_ 

L d 


E 


Gv+aG- n 


V 


hT, 


L d 


1 MF(Cp+q) »f(£— p) 


n + £p+q + C-p 

^eQ-m£ P )) 


i 


iUm + £p+q + £-p 


- (q ** -q) 


where, in the last line, we have made use of the symmetry of the energy arguments, <f p = £_ p . 
The summation for non-zero q is left as an instructive exercise in Fermi-surface integration. 
However, for the sake of our present argument, it will be sufficient to perform the sum for 
zero external momentum q = (0,0) (i.e. we will probe the fate of spatially homogeneous 
and static pair configurations). Using the identity jpj ]U p P’( e p) = f de v(c)F(e) to replace 
the momentum sum by an energy integral, and remembering that the pairing interaction is 
limited to a thin shell around the Fermi surface, we then obtain 


T_ 
L d 


Y.G P G. P = 


/: 


dev(e 


1 — 2riF(e) 



In 



2e 


(6.16) 
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where we have used the fact that, at energies e ~ T, the 1/e singularity of the integrand is 
cut off by the Fermi distribution function. Substitution of this result back into the expression 
for the vertex leads to the result 

t, 9 

1 ( 0 , 0 ) — Z \ • 

From this, one can read off essential elements of the transition to the superconducting phase. 

> We first note that the interaction constant appears in combination with the density of 
states, i.e. even a weak interaction can lead to sizeable effects if the density of states is 
large enough. From our previous qualitative discussion it should be clear that the scaling 
factor v simply measures the number of final states accessible to the scattering mechanism 
depicted in the figure on page 266. 

> The net strength of the Cooper pair correlation grows upon increasing the energetic 
range cud of the attractive force or, equivalently, on lowering the temperature. Obviously, 
something drastic happens when gv In (^f-) = 1, i.e. when 

T = T C = <u D exp [— — 

L 9 " 

At this critical temperature, the vertex develops a singularity. Since the vertex and the 
correlation function are related by multiplication by a number of (non-singular) Green 
functions, the same is true for the correlation function itself. 
t> As we will soon see, T c marks the transition temperature to the superconducting state. 
At and below T c a perturbative approach based on the Fermi sea of the non-interacting 
system as a reference state breaks down. The Cooper instability signals that we will have 
to look for an alternative ground state or “mean-field,” i.e. one that accounts for the 
strong binding of Cooper pairs. 

In the next section, we explore the nature of the superconducting state from a complemen- 
tary perspective. 


Mean- field theory of superconductivity 

The discussion of the previous section suggests that, at the transition, the system develops 
an instability towards pair binding, or “condensation.” In the next section we build on 
this observation to construct a quantitative approach, based on a Hubbard-Stratonovich 
decoupling in the Cooper channel. However, for the moment, let us stay on a more informal 
level and assume that the ground state |fl s ) of the theory is characterized by the presence 
of a macroscopic number of Cooper pairs. More specifically, let us assume that the operator 
C— kjAkt acquires a non- vanishing ground state expectation value, 

A ={- d A = j- d ^Ic^cLkja), 

k k 


(6.17) 
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where we have included the coupling constant of the theory for later convenience. The 
assumption that A assumes non-zero values (vanishes) below (above) the transition temper- 
ature T c is tantamount to declaring A to be the order parameter of the superconducting 
transition. However, at the present stage, this statement has the status of a mere presump- 
tion; we will have to explore its validity below. 

At any rate, the non-vanishing expectation value of A looks strange. It clearly implies 
that the fermion many-body state |Q S ) cannot have a definite number of particles (cf. the 
coherent states). However, a better way to think about the problem is to remember the 
bosonic nature of the two-fermion pair state |k — k 4-) - From this perspective, cj^cL 
appears as the operator that creates a bosonic excitation. Non-vanishing of its expectation 
value implies a condensation phenomenon akin to the condensates discussed in Section 6.3. 
Indeed, much of the remainder of this section is devoted to the (semi-phenomenological) 
construction of a “bosonic” mean-field picture of the superconductor. 

To develop this description, let us substitute 


^2 C -k+q4-Ckt 
k 


L d A ^ A L d 

“ + c -k+qt c kt — - 

y k y 


small 


into the microscopic Hamiltonian and retain only those terms that appear as bilinears in the 
electron operators. Adding the chemical potential, and setting £k = ek — p, the “mean-field” 
Hamiltonian takes the form 

^ j d I A 1 2 

H — fiN ~ ^2 l^kcj^Cko- — ^Ac_kj.Ck-|- + A c k-|- c lkjJ J ^ ~ ’ 

k y 

known (in the Russian literature) as the Bogoliubov or Gor’kov Hamiltonian after its 
authors (while the terminology Bogoliubov— de Gennes Hamiltonian has become more 
widespread in the Anglo-Saxon literature, reflecting the promotion of the mean-field descrip- 
tion by de Gennes). 

Indeed, although perfectly Hermitian, the Gor’kov Hamil- 
tonian does not conserve particle number. Instead, pairs of 
particles are born out of, and annihilated into, the vacuum. 

To bring the mean-field Hamiltonian to a diagonal form, 
we proceed in a manner analogous to that of Section 2.2 
(where appeared a Hamiltonian of similar structure, namely 
a^a + aa + a^a^). Specifically, let us recast the fermion oper- 
ators in a two-component Nambu spinor representation 

*t = (4,.c-w), *-(£). 
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comprising j'-creation and ^-annihilation operators in a single object. It is then straightfor- 
ward to show that the Hamiltonian assumes the bilinear form 


B~t*N=^ t 

k 




k 


L d |A| 2 

9 


Now, being bilinear in the Nambu operators, the mean-field Hamiltonian can be brought to 
a diagonal form by employing the unitary transformation 22 


Xk M 



( cos 9u 
\sin 9k 


sin 9k \ 
— cos 9k J 




(under which the anti-commutation relations of the new electron operators a k a are main- 
tained - exercise). Note that the operators a\^ involve superpositions of cj^ and c_ k j., i.e. 
the quasi-particle states created by these operators contain linear combinations of particle 
and hole states. Choosing A to be real 23 and setting tan(20 k ) = — A/£ k , i.e. cos(2d k ) = 
£ k /A k , sin(2# k ) = — A/A k , where 

A k = (A 2 +^) 1 / 2 , (6.18) 


the transformed Hamiltonian takes the form (exercise) 

H-(iN = Y, AkoL^k, + £(& - A k ) + A2,/ . (6.19) 

kcr k 9 

This result shows that the elementary excitations, the Bogoliubov quasi-particles, cre- 
ated by a! k(r , have a minimum energy A known as the energy gap. The full dispersion 
±A k is shown in the figure above. Due to the energy gap separating filled from empty quasi- 
particle states, elementary excitations are difficult to excite at low temperatures, implying 
a rigidity of the ground state. 

To determine the ground state wavefunction one simply has to identify the vacuum state 
of the algebra {a k , a^}, i.e the state that is annihilated by all the quasi-particle annihilation 
operators o ktT . This condition is met uniquely by the state 

|f2 s ) = ]Ja k1 -a_ k 4 .|fl) ~ (cos6> k - sin |fl), 

k k 

where |f2) represents the vacuum state of the fermion operator algebra {c k , c k }, and sin d k = 
\J (1 — £k/Ak)/2. Since the vacuum state of any algebra of canonically conjugate opera- 
tors is unique, the state |O s ) must, up to normalization, be the vacuum state. From the 
representation given above, it is straightforward to verify that the normalization is unity 
(exercise). 


It is instructive to compare with the previous analysis of Section 2.2 where the bosonic nature of the problem 
enforced diagonalization by a non-compact pseudo-unitary transformation. 

If A = |A|e 1 ^ is not real, it can always be made so by the global gauge transformation c a e t< ^^ 2 c a , c£ -4: 
e -i<t >/ 2 c t Notice the similarity to the gauge freedom that led to Goldstone mode formation in the previous 
section! Indeed, we will see momentarily that the gauge structure of the superconductor has equally far-reaching 
consequences. 
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Finally, we need to solve Eq. (6.17) self-consistently for the input parameter A: 


A = ^X]^ s l c - k l Ck tl fis ) 

k 

~ 2 i_ WD (A 2 +e 2 )i/ 2 


9 x — > 

~id 2^ sin0kCOS0k 
k 

(/Ai/sinh - (wq/A), 



where we have assumed that the pairing interaction g uniformly extends over an energy 
scale cud (over which the density of states v is roughly constant). Rearranging this equation 
for A, we obtain the important relation 



This is the second time that we have encountered the combination of energy scales on 
the right-hand side of the equation. Previously we identified T c = cuDexp[— (g^) -1 ] as the 
transition temperature at which the Cooper instability takes place. Our current discussion 
indicates that T c and the quasi-particle energy gap A at T = 0 coincide. In fact, that 
identification might have been anticipated from our discussion above. At temperatures T < 
A, thermal fluctuations are not capable of exciting quasi-particle states above the ground 
state. One thus expects that T c ~ A separates a low-temperature phase, characterized by the 
features of the anomalous pairing ground state, from a “Fermi-liquid-like” high-temperature 
phase where free quasi-particle excitations prevail. 

In the mean-field approximation, the ground state |O s ) and its quasi-particle excitations 
formally diagonalize the BCS Hamiltonian. Before proceeding with the further development 
of the theory, let us pause to discuss a number of important properties of these states. 


Ground state 

In the limit A — > 0, sin 2 6 k —t 0(g — e k )) and 
the ground state collapses to the filled Fermi sea 
with chemical potential g. As A becomes non- 
zero, states in the vicinity of the Fermi surface 
rearrange themselves into a condensate of paired 
states. The latter involves the population of 
single-particle states with energy ek > g- (This follows simply from the energy dependence 
of the weight function sin 0 k entering the definition of the ground state - see the figure.) 
However, it is straightforward to show that, for any value g > 0, the total energy of the 
ground state, -E|q 8 ) = (fi s |.ff — gN\Cl s ) = ^ k (£k — ^k) + A 2 L d /g, is lower than the energy 
E 0 = 2 £| k |< PF £k °f the Fermi sphere when g = 0: 

EXERCISE To show that S|o s ) < Eq. it is convenient to represent the ground state energy of 
the Fermi sea as Eo = limA-»o i?|n s )- Use this representation (and the solution of the mean- 
field equation) to verify that the superconductor ground state energy lies below that of the 
uncorrelated Fermi sea. It is also instructive to ask for the minimum value £)«„) may assume 
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upon variation of A for fixed g. Show that the solution of the variational equation 8aE\q s ) = 0 
leads back to the mean-field equation for A discussed above. 


Excitations 

It is very important to distinguish between quasi-particle states and “excitations.” Quasi- 
particle states are the eigenstates of the BCS Hamiltonian. Their energy-momentum relation 
is shown in the figure on page 271. Notice that there is a positive- and a negative-energy 
branch of quasi-particles. In the limit A — » 0, the quasi-particles evolve into ordinary 
electrons. By contrast, the energy of excitations (as created by the operators a^) is always 
positive. An excitation can be either the creation of a quasi-particle at positive energy 
or the elimination of a quasi-particle (the creation of a quasi-hole) at negative energy. 
(In the ground state, all negative-energy quasi-particle states are filled.) As A — >• 0, the 
excitation operators evolve into the operator algebra introduced in Eq. (2.21). Notice that 
the total number of excitations is equal to the number of quasi-particle states. However, 
their density of states (non-vanishing for positive energies only) is twice as large. This is 
because the dispersion of excitations is obtained by superimposing the positive branch of the 
quasi-particle spectrum on the sign-inverted negative branch. For a particle-hole symmetric 
system, the quasi-particle spectrum is invariant under sign inversion, which implies that the 
two branches contribute equally to the density of excitations. 

After these general remarks we turn to the specific 
discussion of the excitations of the BCS superconduc- 
tor. According to Eq. (6.19), the excitation spectrum is 
gapped, i.e. it takes a minimum energy A to create an 
excitation above the BCS ground state; the formerly 
( g = 0) continuous spectrum has acquired a gap. To 
better understand the profile of the spectrum, let us 
^ compute the density of excitations p(e), 24 in the vicin- 
e ity of the Fermi surface, e as //: 



P <# = -^££(e-AiQ = /d^^(£-aM(e-A(0) 


i: 


dt 


<5(£-s[e 2 -A 2 ]V 2 ) 

|^[ C 2 + A 2 ] 1 / 2 | 


= 2i/0(e - A) 


( e 2 _ A 2)l/2 ' 


A schematic plot of the BCS quasi-particle density of states is shown in the figure above. 
It is apparent that the spectral weight of the quasi-particles has been transferred from 
the Fermi surface to the interval [A, oo]. The divergence at A signals that the majority of 
quasi-particle states populate the spectral region just above the gap. 


To distinguish the density of states of the quasi-particles (i/) from that of the excitations, we denote the latter 
by p • 
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EXERCISE Integrate the result above to confirm that f E dcp(e) E - — ^ 2 Ev. This demonstrates 
that the excitation density of states is indeed twice as large as that of the quasi-particles. 

The phenomenological analysis above explains important aspects of the physics of the BCS 
superconductor: the instability towards condensation, and the presence of a gap for quasi- 
particle excitations above the ground state. Indeed, it would be tempting to take the latter 
phenomenon as an explanation for the absence of electric resistivity below the transition 
temperature, i.e. if no low-lying quasi-particle excitations are available, should not external 
current be able to flow through the system without energy dissipation? Yet that picture 
neglects the most important excitation of the system, i.e. the collective phase mode. Above 
we made the ad hoc decision to set the phase of the order parameter to zero. However, 
as with the superfluid, the phase represents a Goldstone mode and we must expect its 
presence to have important consequences. Indeed, it will turn out that the phase mode of the 
superconductor - in contrast to its counterpart in the “neutral” superfluid - is responsible 
for much of the electromagnetic phenomena displayed by the superconductor. 

INFO The discussion above may convey the impression that the amplitude of the supercon- 
ducting order parameter is spatially uniform, with a value set by the mean field equation. It is 
important to realize, however, that there are superconductor species for which | A| exhibits strong 
fluctuations in space. Since it is the amplitude of the order parameter that determines the size 
of the quasi-particle gap, fluctuations of |A| lead to important phenomenological consequences, 
especially if there are regions for which |A| — >- 0. 

Without question the most important class of superconductors with spatially varying order 
parameters are the d-wave superconductors. Firstly, practically all high temperature super- 
conductors belong to this family, i.e. d-wave superconductivity is a field of profound interest from 
both a fundamental and an applied point of view. Secondly, the physical properties of the d-wave 
superconductor differ drastically from those of conventional superconductors. The reason is that 
it is “gapless” in the sense that its Brillouin zone contains four distinct points at which the order 
parameter vanishes. These “hot spots” are exceptional in that quasi-particles of arbitrarily low 
energy coexist with the order parameter condensate. The list of many exotic phenomena induced 
by this coexistence - for a detailed discussion of which we refer to the literature - includes 
unconventional heat conduction and anomalous Josephson and Meissner effects. 

EXERCISE The order parameter of a lattice d-wave superconductor is given by Ak = 
Ao(cos(A; x a) — cos (k y a)), where k = ( k x ,k y ) T is a two-dimensional lattice momentum and a 
the lattice spacing. 25 At two lines in momentum space, k x = ±fc y , the order parameter vanishes 
and low-energy quasi-particles persist. Assuming that the kinetic energy of the lattice problem 
is given by tk = —t(cos(k x a) + cos(k y a)), compute the quasi-particle energies of the Bogoliubov 
Hamiltonian H = l((tk — p)<73 + Aktri) k- Show that, at four points in the Brillouin zone, 


The origin of the designation “d-wave” is that it transforms like an l = 2 angular momentum state (sign change 
under a 90 rotation) under the action of the rotation group. On the same basis, ordinary superconductors 
are sometimes called s-wave superconductors. Less frequently occurring are superconductors of p-wave or other 
angular momentum symmetry. 
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k = (±1, ±l)?r/2a, the quasi-particle energy vanishes. Linearize the Hamiltonian in momentum 
space around one of these “hot spots,” say (+1, +l)7r/2a, to show that it assumes the form 




/ ta(k x + k y ) 
yAoa(fc z — k y ) 


Ao a(k x — k y )\ 

—ta{k x + k y ) J ' 


Apply a unitary transformation to bring H++ to the form of a two-dimensional Dirac Hamil- 
tonian H++ —¥ fciui + k 2 <J 2 , where k \/2 = k x - p k y , and we have rescaled fci — >• (Ao/t)fei and set 
vf = 1 . Show that the quasi-particle density of states of this Hamiltonian is given by v(e) ~ |e|. 


From a modern perspective, a comprehensive picture of the superconductor, encompassing 
the consequences of Goldstone mode formation, and the BCS structures discussed above, 
is most efficiently developed within the field integral approach. It is to this framework that 
we now turn. 


Superconductivity from the field integral 

To investigate the BCS transition within the framework of the coherent state path integral, 
we start out from a coordinate representation of the BCS Hamiltonian (6.14), 

Hbcs = J d d rcl( r) - eA) 2 + e<p - /zj c^r) - g J d d r c|(r)c|(r)c 4 .(r)c t (r), 

where, once again, summation over repeated spin indices is implied. Anticipating the emer- 
gence of non-trivial electromagnetic phenomena, we have coupled the single-particle Hamil- 
tonian to a vector potential A and a scalar potential (p. The origin and physical conse- 
quences of these fields are discussed somewhat later. Expressed in the form of the coherent 
state path integral, the corresponding quantum partition function takes the form Z = 

fD(ip,ip) e-iS^l, 26 

S[ip, ip] = J dr J d d r \^p a (d T + ie<p + ^ (-iV - eA) 2 - pj . 

(6.22) 

where ip( r, r) denote Grassmann fields. 

I N FO In the field theoretical literature, the substitution 

d T -»• d T + ietp, -i ->-i - eA, (6.23) 

is sometimes called a minimal coupling of an electromagnetic field. It is “minimal” in the 
sense that only orbital and potential coupling of the field are taken into account. (For exam- 
ple, the field-spin interaction is neglected.) At the same time, the potential-vector-potential 
coupling is complete enough to endow the theory with a local gauge invariance under U(l) 
transformations 

ip — > e* ip, -0 — J- e~ l ip, cp — >■ <p — e -1 d T 0, A — » A + e -1 9, (6-24) 


Notice that the Eucilidean character of the imaginary time theory is responsible for the additional factor of 
“i,” i.e. as our <f> transforms like a time-derivative under Wick rotation, it is given by (— i) X the conventional 
scalar potential. 
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where 0 = #(t, x) is an arbitrary space/time-dependent phase configuration. The minimal cou- 
pling introduces the general quantum electrodynamical gauge principle into the theory. This 
should be compared with the discussion of the neutral superfluid where only invariance under 
global U(l) transformations was required. 

In the following, we will often use units wherein fi — 1. 


As usual the quartic interaction of the fields prevents the partition function from being 
evaluated explicitly. Moreover, anticipating the existence of a transition of the electron gas 
to a condensed phase in which electrons in the vicinity of the Fermi surface are paired, we 
can expect that a perturbative expansion in the coupling constant g will be inadequate. 
Motivated by the mean-field theory discussed above, we will instead introduce a bosonic 
field A to decouple the interaction, which will have the physical significance of the order 
parameter. The decoupling is arranged using a Hubbard-Stratonovich transformation in 
the Cooper channel (cf. the discussion on page 244) 

exp g J dr 

= j D( A, A) exp — J dr d d r 1 A | 2 — (Af/^f + A'0t'04_) J > 

where A(r, r) represents a dynamically fluctuating complex field. Reflecting the behavior 
of the bilinear V’f.V’t) it obeys the periodic boundary condition, A( 0 ) = A(/ 3 ), i.e. A is 
a bosonic field variable. Were we to take A to be homogeneous in space and time, the 
quantum Hamiltonian corresponding to the action would coincide with that of the mean-field 
Hamiltonian considered in the previous section. Following that analysis (but not making 
the a priori assumption A(r, r) = const.) we turn to the Nambu spinor representation 


*)> *=(■£). 

comprising particle and hole degrees of freedom in a single object. Expressed in terms of 
the Nambu spinor, the partition function takes the form 

Z = J J D{ A, A) exp —J dr d d r |-|A| 2 — , 


where 



(6.25) 


is known as the Gor’kov Green function, and [Gq P ' ) ] 1 = —d T — ie<l>— ^C(— iV — e A) 2 +p, 
[d?0 h) ] _1 = — d T + ie<f> + 2^ (-NV — eA) 2 — p represent the non-interacting Green functions 
of the particle and hole respectively. 


INFO Computing the -representation of the action for a general single-particle Hamiltonian 
H, one finds that [g£ p) ] _1 = -d T - H + p, [G^ h) ] _1 = -d T + H T - p. (With T = - , the 
expression above is identified as a special case of the more general form.) This representation is 
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actually very revealing: it tells us that the Green function of a hole is obtained from that of the 
electron by a sign change H — » —H (the energy of a hole is the negative of the corresponding 
particle energy) followed by transposition — H > — H T , i.e. a quantum time reversal operation 
(a hole can be imagined as a particle propagating backwards in time). Noting that p T = — p 
and x T = x, the pair of Green functions can equivalently be represented through 

t<§§ ,) r 1 »--a T -4(x,p)+^ [Gp 1 )]- 1 = —d T + H(-k, — p) — p. (6.26) 

Bilinear in T, the Gaussian integration over the Grassmann fields can now be performed 
straightforwardly, and yields the formal expression (cf. the analogous formula (6.4) for the 
normal electron system) 

Z = j D(A, A) exp J dr d d r |A| 2 + lndetCT 1 ] . (6.27) 

By introducing a Hubbard-Stratonovich decoupling of the local interaction, we have suc- 
ceeded in expressing the quantum partition function as a path integral over an auxiliary 
bosonic field A. Further progress is possible only within some approximation. 


Mean-field theory 

To identify a suitable “platform” around which the action can be expanded perturba- 
tively, we begin by identifying the mean- field configurations of A. A variation of the 
action with respect to A generates the mean-field equation (for the differentiation of the 
“tr In” with respect to A we refer to the analogous calculation in Eq. (6.5)), <7 -1 A(t, x) — 
tr [<%, x; r, x)£P 2 h ] = 0, where E? h is a 2 x 2 matrix in Nambu space that takes the value 
of unity at position (i,j) and is zero otherwise. Assuming that configurations extremizing 
the action will be spatially and temporally homogeneous, i.e A(r, x) = A 0 = const., and, 
temporarily ignoring the dependence of the Green function on the field (<j), A), the equation 
simplifies to 


(~®T + 2^ + B 

a ° 2 y 

1 

(rx-rx)l \ 

\ Ao 

~°r ~ 2^ ~ A*/ 

[ ’ ’ ’ ’ vo ) 


T \ - / iw n — Cp A 0 \ _ T_ \ - Ao 

w v a 0 iu n +zj 21 - L*j£ U *+%+\Aor 


where, in the second line, we have switched to a frequency-momentum representation and, 
as usual, £ p = p 2 /2 m — jx. Rearranging the equation, we arrive at ^ Y2 pn u 2 +\ 2 > 

where A p = (£ 2 + Aj 2 ) 1 / 2 > 0 (cf. Eq. (6.18)). The Matsubara summation can be performed 
by means of the summation techniques discussed on page 170 (see also the problem set of 
Chapter 4), after which one obtains, 


f __ J_ v - '' 1 — 2 tif(A p ) 
g ~ 2A^ 


/ “D 1 


1 ^ 2n F (A(0) 
2A(0 
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where we have taken into account the fact that the range of the attractive interaction is 
limited by wd- Noting that the integrand is even in £, and making use of the identity 
1 — 2riF(e) = tanh(e/2T), we arrive at the celebrated BCS gap equation 


1 = T D ,, tanh(A(Q/2r) 

W Jo ? m 


(6.28) 


where, as usual, we have assumed that the density of states //(£) ~ v is specified by its value 
at the Fermi surface. For temperatures T < A 0 , we may approximate tanh(A/2T) ~ 1 and 
we arrive back at the T = 0 gap equation (6.20) analyzed above. However, here we wish to be 
more ambitious and explore the fate of the gap as the temperature is increased. Intuitively, 
one would expect that, for large temperatures, thermal fluctuations will eventually wash out 
the gap. On the other hand, we know empirically that the onset of superconductivity has the 
character of a second order phase transition (see Chapter 8). Since the gap parameter A 0 
has the status of the order parameter of the transition - an identification to be substantiated 
shortly - we must expect that the vanishing of Ao(T) occurs in a singular manner (in 
analogy to, e.g., the magnetization of a ferromagnet at the Curie temperature). Indeed, 
it turns out (see Problem 6.7) that the order parameter vanishes abruptly at the critical 
temperature of the BCS transition, 


T c = const, x <n D exp — — , (6.29) 

L 9 » \ 

where the numerical constant is 0(1). Notice that this result is consistent with our pertur- 
bative analysis above. For temperatures slightly smaller than T c (see Problem 6.7), 


Aq = const, x y/ T c (T c — T), 


(6.30) 


scales as the square root of ( T c —T ) /T c , i.e. the vanishing occurs with a diverging derivative, 
as is typical for second-order phase transitions. The interpolated temperature profile of 
the order parameter is shown in Fig. 6.6. Notice that (again, up to numerical factors) the 
critical temperature T c coincides with the zero-temperature value of the gap Ao(0). The 
square-root profile of the gap function has been accurately confirmed by experiment (see 
Fig. 6.6). 

Having explored the large-scale profile of the gap function, we next turn our attention to 
the vicinity of the superconductor transition, i.e. to temperature regions 5T = T c — T <C T. 


Ginzburg-Landau theory 

In the vicinity of the phase transition, the gap parameter A is small (in comparison with the 
temperature). This presents the opportunity to perturbatively expand the action (6.27) in 
A. We will see that the expansion reveals much about the character of the superconducting 
transition and the nature of the collective excitations. Further, the expansion will make the 
connection to the neutral superfluid (as well as important differences) explicit. 

To keep the structure of the theory as transparent as possible, we will continue to ignore 
the coupling to the external field. Our task thus reduces to computing the expansion of 
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Figure 6.6 Measurements of ultrasonic attenuation in a superconductor provide access to the ratio 
A(T)/A(0). The data here show the comparison of a tin alloy with the predicted square-root depen- 
dence of the mean-field theory. (Reprinted with permission from R. W. Morse and M. V. Bohm, 
Superconducting energy gap from ultrasonic attenuation measurements, Phys. Rev. 108 (1957), 
1094-6. Copyright (1957) by the American Physical Society.) 


trln Q 1 in powers of A. To facilitate the expansion, let us formally define Q 0 1 = Q 1 1 a= 0 j 
and set A = ( ^ A ) , so that 

trln Q~ x = trln + t/ 0 A)J =trln^ 1 + trln [1 + £ 0 A]. 

const. _ E ~ o _L tr( 4A) 2 " 

Here we have used the relation trln [AB\ = trln A + trln B. 27 Further, note that only even 
contributions in A survive. The constant contribution trln recovers the free energy of 
the non-interacting electron gas and, for present purposes, provides an inessential contribu- 
tion to the action. 

To give this formal expansion some meaning, let us consider the second-order term in 
more detail. By substituting the explicit form of Qq 1 it is straightforward to verify that 

--tr(G 0 A) 2 = -tr ([f?o]iiA[<5o]22A^ = - ^ -jj y^[go,p]ii[go,p - g ]22 A(g)A(g), 

q p 

~-EdYjp G V G -p+q 

where we have made use of the representation of the composite Green function Qq in terms 
of the single-particle Green function = G p and the hole Green function G p h ^ = — = 


Notice that the relation tr In [AB] = tr In A + tr In B applies to 


l-commutative matrii 
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—G-p (cf. Eq. (6.26)). On combining with the first term in Eq. (6.27), we arrive at the 
quadratic action for the order parameter field, 


5( 2 )[A,A]=^r- 1 |A(g)| 2 , E 9 1 = 1 - l^GpG p _„. (6.31) 

q 9 p 

This is our second encounter with the vertex function T” 1 : in our perturbative analysis 
of the Cooper channel (cf. Eq. (6.15)) we had identified the same expression. To appre- 
ciate the connection we should let the dust of our current technical operations settle and 
revise the general philosophy of the Hubbard-Stratonovich scheme. The field A was intro- 
duced to decouple an attractive interaction in the Cooper channel. By analogy with the 
field <j> used in the development of the RPA approximation to the direct channel, the action 
of the field A ~ can be interpreted as the “propagator” of the composite object 

'ip'f'tpl, i.e. a quadratic contraction (A A) describes propagation in the Cooper channel 
~ (tVAt'VVAl); as described by a four-point correlation function. This connection is made 
explicit by comparison of the quadratic action with the direct calculation of the Cooper 
four-point function above. 

However, in contrast to our discussion in Section 6.4 (where all we could do was to 
diagnose an instability as r~_^ 0 — > 0), we are now in a position to comprehensively explore 
the consequences of the symmetry breaking. Indeed, r~^ 0 — > 0 corresponds to a sign change 
of the quadratic action of the constant order parameter mode A (q = 0). In the vicinity of 
this point, the constant contribution to the action must scale as ~ (T — T c ) from which one 
may conclude that the action assumes the form 

5 (2) ;A,A: - J dr d d r ® |A| 2 + Q(dA, <9 T A), 

where r(T) ~ T — T c and 0(8 A,d T A) denotes temporal and spatial gradients whose role 
will be discussed shortly. 


EXERCISE Use Eq. (6.16) and the expansion 


np(e,T) - n P (e,T c ) ~ (T - T c )d T \ T=Tc n F (c,T) = -d n F (e,T c )(T- T c )^, 


to show that r(T) = vt where t = T T Tc defines the reduced temperature. 


For temperatures below T c , the quadratic action becomes unstable and - in direct analogy 
with our previous discussion of the superfluid condensate action - we have to turn to 
the fourth-order contribution, S^ 4 \ to ensure stability of the functional integral (see the 
figure below where the upper/lower surface corresponds to temperatures above/below the 
transition). At orders n > 2 of the expansion, spatial and temporal gradients can be safely 
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neglected (due to the smallness of A <C T, they will certainly be smaller than the gradient 
contributions to S 1 ' 2 ' 1 ). For A = const., it is straightforward to verify that 


S {2n) = ( g 0 A) 2n = E ( GpG-p) n |A| 2n * V' r 


di 


2 n »v=>u-, - 2n 2n ( w f + ^)n 

= const, x v\ A| 2 " ^2 2 ^_ x = const, x vT / 


where “const.” denotes only numerical con- 
stant factors. Once again, in the second 
equality, we have expressed the Gor’kov Green 
function through the respective particle and 
hole Green functions, and in the fourth equal- 
ity we have noticed that, for wd T, the 
integral over the energy variable is dominated 
by the infrared divergence at small £, i.e. 
fo° ( w i + £ 2 )“" - di r 2 "- This esti- 

mate tells us that the contributions of higher order to the expansion are (i) positive and 
(ii) small in the parameter |A | /T <C 1. This being so, it is sufficient to retain only the 
fourth-order term (to counterbalance the unstable second-order term). We thus arrive at 
the effective action for the order parameter, 

S[A, A] = J dr d d r AA + u(AA) 2 + 0(0 A, d T A, | A| 6 )^ , (6.32) 

valid in the vicinity of the transition. 28 In particular, one may note (see the figure above) 
that the dependence of the action on (a constant) A mimics closely that of the condensate 
amplitude. 



INFO It is straightforward to include finite spatial gradients in the derivation of the quadratic 
action S ^ (for details see Problem 6.7). The resulting action for static but spatially fluctuating 
configurations A(r) takes the form 


S gl [A,A]=/3 J d d r g|A| 2 + ||SA| 2 + w |A| 4 ] , 


(6.33) 


where c ~ po(vp /T) 2 . This result is known as the (classical) Ginzburg— Landau action of 
the superconductor. It is termed “classical” because (cf. our remarks on page 255) temporal fluc- 
tuations of A have been ignored. Notice that the form of the action might have been anticipated 
on symmetry grounds alone. Indeed, Eq. (6.33) was proposed by Ginzburg and Landau as an 
effective action for superconductivity some years before the advent of the microscopic theory. 29 


Here, the coupling constant u ~ pT~ 3 shows only a weak temperature dependence in the vicinity of the 
transition. 

V. L. Ginzburg and L. D. Landau, On the theory of superconductivity, Zh. Eksp. Teor. Fiz. 20 (1950), 1064-82. 
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A generalization of the action to include temporal fluctuations leads to the time-dependent 
Ginzburg— Landau theory, to be discussed below. 

Equation (6.32) makes the connection between the superconductor and superfluid explicit 
(cf. the condensate action (6.9)). Above T c , r > 0 and the unique mean-field configuration 
extremizing the action (6.32) is given by A = 0. However, below the critical temperature, 
r < 0 and a configuration with non-vanishing Cooper pair amplitude A 0 is energetically 
favorable: 

= 0 =► A 0 + 2u|A 0 | 2 ) = 0 => | A 0 | = ~ VT C (T C - T ), 

A=A 0 V2 J V 4u 

cf. the previous estimate Eq. (6.30). As with the superfluid, the mean-field equation deter- 
mines only the modulus of the order parameter while the phase remains unspecified. Below 
the transition, the symmetry of the action will be broken by the formation of a ground state 
with fixed global phase, e.g. Ao £ R. This entails the formation of a phase-like Goldstone 
mode 9, where configurations A = e 2i9 Ao explore deviations from the reference ground- 
state. 30 Pursuing further the parallels with the superfluid, it would be tempting to conjec- 
ture that these phase fluctuations have a linear dispersion, i.e. that the system supports 
dissipationless supercurrents of charged particles: superconductivity. 

However, at this point, we have overstretched the analogies to our previous discussion. In 
fact, the argument above ignores the fact that the symmetry broken by the ground state of 
the superfluid was a global phase U(l). However, as explained on page 276, the microscopic 
action of the superconductor possesses a more structured local gauge U(l) symmetry. As 
we will discuss presently, this difference implies drastic phenomenological consequences. 


SS[ A, A] 
SA 


Action of the Goldstone mode 

The ramifications of the local gauge symmetry can only be explored in conjunction with 
the electromagnetic field (</>, A). We must, therefore, return to the ancestor action (6.27) 
where Q now represents the full Gor’kov Green function (6.25). For the present, there is no 
need to specify the origin of the electromagnetic field; it might represent an external exper- 
imental probe, or the background electromagnetic field controlled by the vacuum action, 
i.e. Se.m. = \ f dr d d r F^F^, where — d^A^ is the electromagnetic field 

tensor. 31 However, throughout, we will assume that the field is weak enough not to destroy 
the superconductivity, i.e. the mean modulus of the order parameter is still given by the 
value Ao, as described by the analysis of the previous section. 

How, then, might an action describing the interplay of the phase degree of freedom and 
the electromagnetic field look? Below we will derive such an action explicitly by starting 
from the prototype Eq. (6.27). However, for the moment, let us stay on a less rigorous level 


30 The motivation for transforming by 26 is that, under a gauge transformation -0 —¥ e l 'ijj, the composite field 
A ~ -0-0 should acquire two phase factors. However, the introduction of that muliplicity factor is, of course, 
just a matter of convention; it can always be removed by a rescaling of the field 6. 

Notice that we are working within the framework of imaginary (Euclidean) time field theory, i.e. the definition 
of the field strength tensor does not involve the Minkowski metric (cf. the discussion on page 107). 
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and try to determine the structure of the action by symmetry reasoning. In doing so, we 

will be guided by a number of principles: 

> The phase 9 is a Goldstone mode, i.e. the action cannot contain terms that do not vanish 
in the limit 9 const. 

t> We will assume that gradients acting on the phase 9 (but not necessarily the magnitude 
of the phase) and the electromagnetic potentials are small. That is, we will be content 
with determining the structure of the action to lowest order in these quantities. 

> By symmetry, the action must not contain terms with an odd number of derivatives or 
mixed gradients of the type d T 9'V9. Respecting the character of the microscopic model, 
the action must be rotationally invariant. 

> The action must be invariant under the local gauge transformation Eq. (6.24). 


It may be confirmed that the first three criteria would be satisfied by the trial action 
S[9] = J drd d r [c 1 {d T 9)' 2 + c 2 (y9) 2 ] , 

where c\ and c 2 are constants. However, such an action is clearly not invariant under a 
gauge shift of the phase, 9(t, r) — > 9(t, r) + ip(r, r). It can, however, be endowed with that 
quality by minimal substitution of the electromagnetic potential, i.e. 32 


S[0,j4] = J dr d d r \c\{d T 9 + (j>) 2 + c 2 (V0 — A) 2 ] . 


(6.34) 


To second order in gradients, this action uniquely describes the energy cost associated 
with phase fluctuations. When combined with the action Se.m. controlling fluctuations of 
the field (cj), A), it should provide a general description of the low-energy electromagnetic 
properties of the superconductor. Notice, however, that the present line of argument does 
not fix the coupling constants Ci i2 . In particular, one cannot exclude the possibility that 
ci or c 2 vanishes (as would be the case, for example, in a non-superconducting system, 
cf. the problem set). To determine the values of ci, 2 , we need either to derive the action 
microscopically or to invoke further phenomenological input (see the Info block below). 
Either way one obtains C 2 = n s /2m and ci = u, where we have defined n s as the density of 
the Cooper pair condensate. (For a precise definition, see below.) In the following section 
we use the action (6.34) as a starting point to discuss the characteristic and remarkable 
electromagnetic phenomena displayed by superconductors. 


INFO Beginning with coefficient C 2 , let us briefly discuss the phenomenological derivation 
of the coupling constants. The starting point is the observation that the functional derivative, 
(sXfhrj ) = (j( r ) r ))i generates the expectation value of the current density operator. This relation 
follows quite generally from the fact that a vector potential couples to the action of a system of 
charged particles i± gill, ■ • • , N through the relation 

Sa = J dr ^2 r i ' A(ri).'j=s J drd d r <f(r — rj)rj • A (t, r). 


To keep the notation simple, 


will henceforth set 
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However, j = §a(t r) ~ 5{r — rj(r))ri(T) is just the definition of the total particle current 

density. Indeed, it is straightforward to verily that, on the microscopic level (cf., e.g., Eq. (6.22)), 

= -A w* + [(i -A)|*)E(j), 


where j is the quantum current density operator. Staying for a moment on the microscopic side 
of the theory, let us assume that a certain fraction of the formerly uncorrelated electronic states 
participate in the condensate, i.e. one may write j = j n + js, where j n , the current carried by 
the normal states of the system, will not be of further concern to us, while j s is the “supercur- 
rent” carried by the condensate. Let us further assume that those states tp s participating in the 
condensate carry a collective’ phase 9 with a non-vanishing average, i.e. ip e = e* ip s , where the 
states ip s do not carry any structured phase information (the residual phase carried by the local 
amplitude ip s tends, on average, to zero). Then, concentrating on the phase information carried 
by the condensate and neglecting density fluctuations, 


^<j s >^--< 0- A), 


where n a = t /' S V ,S is the density of the condensate. 

Now, let us evaluate the fundamental relation (f^) = (j) on our trial action with its undeter- 
mined coupling constants: 




Comparison with the phenomenological estimate for the expectation value of the (super)current 
operator above leads to the identification C 2 = n s /2m. Turning to the coupling constant ci, let 
us assume that the electron system has been subjected to a weak external potential perturbation 
</>(t, r). Assuming that the potential fluctuates slowly enough to allow an adiabatic adjustment 
of the electron density (i.e. that it acts as a local modulation of the chemical potential), the 
particle density of the system would vary as 

Q'Ci 

Sn(T, r) = Sn(i it + </>(r, r)) « — </>(r, r) « v<j)(T, r), 

where we have approximately 33 identified d^n with the single-particle density of states v. The 
potential energy corresponding to the charge modulation is given by f d d r 0(r, r)5n(r, r) = 
dfj fi f d d r r). Comparing this expression with our trial action - which contains the time- 
integrated potential - we conclude that ci = d lL n = v . 

Despite the integrity of the phenomenological arguments given above, some may feel ill at ease 
with all the liberal distribution of “assumes.” To complement this discussion, let us now recover 
the phase action using an explicit microscopic derivation. The construction detailed below 
represents a typical yet, until now, our most advanced “case study” of a low-energy quantum field 
theory. Although formulated for the specific example of the BCS superconductor, many of its 
structural sub-units appear in other applications in basically the same form. This “universality” 
is our prime motivation for presenting the lengthy construction of the low-energy phase action of 


33 For systems with strong inter-particle correlations, the thermodynamic density of states d^n = 
may deviate significantly from the single-particle density of states v = 5 ~l a 5( e a — aO- 
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the superconductor in some detail. Our starting point is the Gor’kov Green function appearing 
under the “tr In” of the microscopic action (6.27), 


Q- 1 = 



~ 2k H -A) 2 + M A 0 e 2i 

A 0 e 2 * -d T + i<f>+ ^(-i — A) 2 — 


coupled to the full electromagnetic potential. To simplify the analysis, we have set the modulus 
of the order parameter to its constant mean-field value Ao, i.e. concentrating on the Goldstone 
mode, we will neglect massive fluctuations A = Ao + 5A around the extremum of the free energy. 

We next make use of the gauge freedom inherent in the theory to remove the phase dependence 
of the order parameter field. To do so, we introduce the unitary matrix U = e , , and 

transform the Green function as 


S _1 


ug ‘r + - ( 


2m ( ^ - A ) 2 + fJ’ A 0 

Ao -dr +i<j>+ 2 ^(-« — A) 2 — p 


where the transformed electromagnetic potential is given by (exercise) <j> = <j>+d r 6, A = A— 6. 
(Reading the transformation in reverse, we conclude that - an important physical fact that should 
be remembered — the superconductor order parameter field is a gauge non-invariant quantity. 
Under gauge transformations it transforms as A — > e 2 ' A, as suggested by the definition of 
A ~ ip ip as a pairing field. This fact implies that the order parameter itself cannot be an 
experimentally accessible observable. 34 ) Owing to the unitary invariance of the trace, trln Q~ x = 
tr In (UG~ 1 W), the gauge transformed and the original Green function, respectively, equivalently 
represent the theory. 

Save for the neglect of massive fluctuations (5A, our treatment of the theory thus far has been 
exact. However, to make further progress, we must resort to some approximations: assuming 
that both the electromagnetic potential and spatio-temporal fluctuations of the phase mode are 
small, let us expand the action in powers of (0, A). In the literature, expansions of this type are 
known as gradient expansions, i.e. we are performing an expansion where the gradients d T 6, 
9 and not the phase degree of freedom 6 are assumed to be small. (Owing to its Goldstone 
mode character, 6 can slide freely over the entire interval [0, 27r] .) 

To facilitate the expansion, it will be useful to represent the 2x2 matrix structure of the 
Green function through a Pauli matrix expansion: 

Q- 1 = -cr 0 d T - cr 3 (^<t> + ^ (-* - Aa-3) 2 - yuj + U 1 A 0 

= crodr u" 3 ^ 2rn ** /*) +«A» " *«?#+ 2 ^^ A l+ “ ^^A 2 , 

g- 1 *1 X 2 

where we have defined <ro = 1 as the unit matrix. Expressed in terms of these quantities, the 
expansion of the action to second order in the field A takes the form 


S[A] 


—trln Qg 1 — X 1 — X 2 = const. — trln 1 — Go [Xi + X 2 ] 
const. + tr G 0 X 1 +tr ^@ 0 X 2 + ^QoXiQoXi^j + • • • , 


(6.35) 


This follows from the fundamental doctrine of electrodynamics that gauge transformations must not cause 
observable effects. 
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where we have used the fact that X 1.2 are of first and second order in the field, respectively. 
(Structures of this type appear frequently in the construction of low-energy quantum field theories 
of many-body systems, i.e., after the introduction of some auxiliary field <f> through a suitably 
devised Hubbard-Stratonovich transformation, the microscopic Bose/Fermi degrees of freedom 
of the theory can be integrated out and one arrives at an action ±trln(0^" 1 + where Q 

is the non-interacting Green function of the problem and X [0] is an operator depending on the 
new field. An expansion of the logarithm to first and second orders in X then leads to structures 
similar to those given above.) 

Written more explicitly, the first-order action S ^ takes the form (exercise) 

= YdJ2 tr [&>,p^i(p,p)] = Jd 5Z tr [&>.p ^300 + ^0-oP • Ao^j , 

where the subscripts 0 refer to the zero-momentum components of the fields <j> and A. Since the 
Green function Qo is even in the momentum, the second contribution oc p vanishes by symmetry. 
Further (d r 9 ) 0 = 0 • 60 = 0, i.e. 0o = 4>o and iS^[A.] = ^ J2 P [f?o,p]n — [<?o,p ]22 4>o, where the 
indices refer to particle-hole space. To understand the meaning of this expression, notice that 
[0o,p]n .pV 1 ,p)o gives the expectation value of the spin-up electron density operator on 

the background of a fixed-order parameter background. Similarly, — [Go,p ]22 = —( 1 j> ,pV> , p )g =¥ 
+(■0 tP ip ,p)o gives the spin-down density. Summation over frequencies and momenta recovers 
the full electron density: J| X) p ([(/o,p]ii - [</o,p] 22 ) = Wjfc or 

S' ( ' 1 ^[A] =iN<j) 0 = J dTd d rcj>(T, r). 

Thus, the first contribution to our action simply describes the electrostatic coupling of the scalar 
potential to the total charge of the electron system. However, as with the Coulomb potential 
discussed earlier, the “correct” interpretation of this expression should rather suggest S ^ = 0. 
That is, the total electrostatic interaction of the potential with the electron system must be - 
by the overall charge neutrality of the system - compensated by an equally strong interaction 
with the positive counter charge of the ions (usually excluded for notational convenience). 

We thus turn to the discussion of the second-order contribution to the action S (2) . The term 
containing X 2 is reminiscent in structure to the S ^ 1 ' 1 contribution discussed before. Thus, replac- 
ing Xi by X‘ 2 , we may immediately infer that 

tr(SoX 2 ) = ^ J dr J d d r A 2 (r,r), (6.36) 

where we have defined asns N/L d the total particle density. This contribution, known as the 
diamagnetic term, derives from the familiar diamagnetic contribution ^A 2 to the electron 
Hamiltonian. If it were only the diamagnetic contribution, an external field would lead to an 
increase of the energy. However, to obtain the complete picture, we need to include the magnetic 
field dependence of the operator A\ . 

Substituting for Xi, and noting that crossterms ~ <p p ■ A, being odd in momenta, vanish on 
integration, one obtains 

-tr Q 0 X 1 Q 0 X 1 = ^ tr So,pO’3<( > 9 0o,pC r 3<A-? + ^^o.pOoP • Aq^o.pOoP • A_ 9 

v,i ' 



288 


Broken symmetry and collective phenomena 


Here, noting that we are already working at the second order of the expansion, the residual 
dependence of the Green functions Go on the small momentum variable q has been neglected. 35 
Alluding to its origin, i.e. the paramagnetic operator ~ Af[Pt A]+ of the electron Hamiltonian, 
the magnetic contribution to this expression is called the paramagnetic term. Paramagnetic 
contributions to the action describe a lowering of the energy in response to external magnetic 
fields, i.e. the diamagnetic and the paramagnetic term act in competition. 

To proceed, it is convenient to change from an explicit matrix representation of the Gor’kov 
Green function to an expansion in terms of Pauli matrices: 36 


Go. 


, = [iaoOJn - CT 3 £p + ffl Ao] 1 = 2 c2 A2 [-icroUn - o-3£p + (71 Ao] . 

W n + ? P + ^0 


(6.37) 


On substituting into the equation above, noting that, for any rotationally invariant function 
F(p 2 ) (exercise), X! p (P ' v ) (P ' v').F(p 2 ) ■— Z]pP 2 ^(P 2 )> one obtains 


|tr G0X1G0X1 

= — Y 1 

L d K + A |) 2 



2A l) 


P A q • A — q 

IB 



where, as before, Ap = £p + A 2 . We now substitute this result together with the diamagnetic 
contribution Eq. (6.36) back into the expansion (6.35), partially transform back to real space 
fqf-i = / dr d d r / 2 (r, r) 2 , and arrive at the action 


- /“ 


S[A] = dr d! 




K + Ap ) 2 




n IT s p (— td n + A p ) 
2rn~drn?L*^ (w 2 + A£) 2 


A 2 (r,r) 


This intermediate result identifies the coupling constants ci^- The last step of the derivation, 
i.e. the sum over the “fast” momenta p, is now a relatively straightforward exercise. Beginning 
with the frequency summations, one may note that the denominator has two isolated poles of 
second order at uj„ = ±*A P . Applying the standard summation rules it is then straightforward 
to verify that (exercise) 


r E 

t E 


-w 2 + A 2 - 2Ag 

K +A £) 2 


1 

2Ap 


nv{— A p ) 



+ n F (— A p ) 


+ A 2 
K + A 2 ) 2 


— /3[n F (A p )(l - np(Ap)]. 


+ (Ap 


— A p ) t 


A| 

2Ad ’ 


i.e. we have set £ pq {S P 4> q Sp+ g 4>- q ) ~ J2 pq (d P 4> g Q P 4>- q ). 

In working with matrix operators it is useful to keep in mind the matrix identity 
[vo<7o + v • cr] 1 = ^2 1 v 2 — v • <*]■> where v = (i>o, v) is a four-component vector of coefficients. Other use- 
ful identities include (i,j = 1, 2, 3; /x, = 1, 2, 3, 4) cr? = 1, £ 7 ^ j, [<Ti, <?j]+ = 0, <Ji<jj = £e ljfc crfc, tr cr^ = 25^, o- 
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Figure 6.7 Plot of the function / 3 tif (\/£ 2 + Aq ) j^l — tif (\/£ 2 + Aq)J as a function of the dimen- 
sionless scales T/Ao and £/Ao. For T/Ao — > 0, the function vanishes (— > perfect diamagnetic 
response). For ^ 1, the function traces out a peak of width oc T and total weight f dn( 1 — 

n) = 1 (— > cancellation of diamagnetic and paramagnetic response). At intermediate temperatures, 
f dn( 1 — n) < 1, resulting in a partial survival of diamagnetism. 


Thus, one obtains the coupling constant of the potential contribution, 


1 y' Af 

2 L d ^ A| 

p F 





in accord with the previous estimate. With the magnetic contribution, the situation is more 
interesting. Converting the momentum sum to an energy integral, one obtains 


c2 = i + S/ d ^ [nF(A)(1 - nF(A)1 ’ 


where we have noted that the integrand is strongly peaked at the Fermi surface, i.e. that the 
factor p 2 « 2 mp can be removed from under the integral. 

This expression illustrates the competition between the diamagnetic and paramagnetic contri- 
butions in the magnetic response of the system. At low temperatures, T C Ao, the positivity of 
A p = (Ao -h^p) 1 / 2 > Ao implies that 71f(A p ) « 0, i.e. approximate vanishing of the integral (see 
Fig. 6.7). Under these conditions, ci % ° ^ is weighted by the total density of the electron 
gas, and the response of the system is governed by the diamagnetic term alone. Indeed, the 
diamagnetic response is known to be a hallmark of superconductivity; the superconductor tends 
to expel magnetic fields, a phenomenon that culminates in the Meissner effect to be discussed 
shortly. 
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By contrast, for high temperatures, T> Ao, the integral extends over energy domains much 
larger than Ao and we can approximate 

d£/3[n F (A)(l-n F (A)] « - f° d£/?K(£)(l - n P (0] 

Jo 

d£ d s n F {0 = n F (0) - n F (oo) = 1, 

to obtain 37 C2 « — -jM = 0. The near 38 cancellation of dia- and paramagnetic contributions 

is typical for the response of normal conducting systems to external magnetic fields. 

At intermediate temperatures, the integral over the Fermi functions leads to a partial can- 
cellation of the diamagnetic response. It is common to express that fact through the notation 
C2 = 5^, where the parameter 



n s =n+ ^ J d£/3[n F (A)(l — n F (A)J, 

is known as the superfluid density. Historically, the concept of a “superfluid density” was 
introduced prior to the BCS theory, when a phenomenological model known as the two-fluid 
model presented the “state-of-the-art” understanding of superconductivity. (Remember that 
the experimental discovery of superconductivity preceded its microscopic description by more 
than four decades!) The basic picture underlying this approach was that, below the transition, 
a fraction of the electron system condenses into a dissipationless superfluid of density n s , while 
the rest of the electrons remain in the state of a “normal” Fermi liquid of density n n = n — n s . 
This simple model provided a phenomenological explanation of a large number of properties 
characteristic of superconductivity, prior to the development of the microscopic BCS theory. 

However, notwithstanding its success and its appealing simplicity, the two-fluid notion of a 
(T— > o) 

complete condensation n s > n at low temperatures cannot be maintained. Indeed, we have 

seen that BCS superconductivity is a Fermi surface phenomenon, i.e. the bulk of the electrons 
are oblivious to the existence of an attraction mechanism at energies /j±wd and, therefore, will 
not enter a condensed state. Instead, our microscopic analysis produces a picture more subtle 
than the mere superposition of two fluids: as we saw above, the diamagnetic (paramagnetic) 
contribution to the response is provided by all quasi-particles (at the Fermi surface) . In a normal 
metal, or, equivalently, a superconductor at T > A 0 , quasi-particle excitations at the Fermi 
surface conspire to cancel the diamagnetic contribution of all other quasi-particles. However, 
at T <C Ao the existence of a quasi-particle energy gap at the Fermi surface blocks that com- 
pensation mechanism and a net diamagnetic signal remains. The far-reaching phenomenological 
consequences of the sustained diamagnetic contribution are discussed in the next section. 


The last equality follows straightforwardly from the two definitions 


V i5(ai-£ p) _ 2 

n L d (M - Ip) (2tr)$ 


d d p 


«(/*-{) 

o-o • 


Going beyond second lowest order of perturbation theory in A, a careful analysis of the coupling of a (small) 
magnetic field to the orbital degrees of freedom of the Fermi gas shows that the cancellation of diamagnetic 
and paramagnetic contributions is not perfect. The total response of the system is described by a weak diamag- 
netic contribution, a, a phenomenon known as Landau diamagnetism. The diamagnetic orbital response is 
overcompensated by Pauli paramagnetism, i.e. the three times larger paramagnetic response of the Zeeman- 
coupled electron spin, p = —3 d- For large magnetic fields, the situation changes totally, and more pronounced 
effects such as Shubnikov de Haas oscillations or even the quantum Hall e ect are observed. 
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Validity of the gradient expansion Before leaving this section let us discuss one last 
technical point. Above we have - without much of a justification - expanded the phase action 
up to leading order in the gradients d T 6 and 9. Indeed, why is such a truncation permissible? 
This question arises whenever low-energy effective theories are derived from a microscopic parent 
theory by expanding in slow fluctuations, and it is worthwhile to address it in a general setting. 
Let us suppose we had performed some kind of Hubbard-Stratonovich transformation to describe 
a system of interest in terms of an action S [</>]. Let us further assume that the action is invariant 
under a shift of the field by a constant, <f>(x) — > <t>(x) + (j> o, i.e. that the action depends only on 
gradients <j>. (To keep the notation simple, we do not explicitly distinguish between spatial and 
temporal gradients.) An expansion of the action in the field gradients then leads to a series of 
the formal momentum-space structure 

S ~ NJ2 [(loqfM-, + ( loq)U q <t>- q + •••)'* 

Q 

where N represents the large parameter of the theory, 39 lo is some microscopic reference scale 
of [length] needed to make the action dimensionless, and the ellipses stand for terms of higher 
order in q and/or <p q . Now, using the fact that only field configurations with S ~ 1 significantly 
contribute to the field integral, we obtain the estimate <p q ~ Jff ilq from the leading-order term 
of the action. This means that terms of higher order in the field variable, N {l 0 q(j> q ) n>2 ~ V 1-n / 2 , 
are small in inverse powers of the large parameter of the theory and can be neglected. Similarly, 
terms like N (loq) n>2 cpqfp-q ~ (It) q) n ~ 2 ■ As long as we are interested in large-scale fluctuations 
on scales g -1 lo, these terms, too, can be neglected. 

Notice that our justification for neglecting terms of higher order relies on two independent 
parameters; large N and the smallness of the scaling factor qlo. tf N = 1 but still qlo <C 1, 
terms involving two gradients but large powers of the field ~ q 2 <P q >2 are no longer negligible. 
Conversely, if N 3> 1 but one is interested in scales qlo Si 1, terms of second order in the field 
weighted by a large number of gradients ~ q n>2 4> 2 must be taken into account. An incorrect 
treatment of this point has been the source of numerous errors in the published literature! 


Meissner effect and Anders on-Higgs mechanism 

If you ask a person on the street to give a one-line definition of superconductivity, the 
answer will probably be that superconductors are materials showing no electrical resistance. 
However, to a physicist, that definition should not be altogether satisfactory. It highlights 
only one of many remarkable features of superconductors and does not have any predictive 
power. A better - if for most people incomprehensible - attempt at a definition would be 
to say that superconductivity arises when the quantum phases of a macroscopically large 
number of charged particles get locked into a collective degree of freedom. Indeed, that 
is exactly what the action (6.34) tells us: fluctuations of the phase of the condensate are 
penalized by a cost that scales with the (superfluid) density of the electron gas. This is to 
be contrasted to the situation in a normal metal where the action - the cancellation of the 


In theories containing a large parameter N, that parameter mostly appears as a constant multiplying all, or 
at least several, operators of the action. For example, in the fermionic problems discussed above, where N was 
proportional to the density of states at the Fermi surface, the action contained a trace over all momentum 
states. The summation over these states then led to an overall factor N multiplying the action. 
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diamagnetic and the paramagnetic contribution! - usually does not contain the gradients 
of phase-like degrees of freedom. Indeed, the macroscopic phase rigidity of the BCS ground 
state wavefunction suffices to explain a large number of non-trivial phenomena related to 
superconductivity. To see this, let us consider a simplified version of the action (6.34). 
Specifically, let us assume (i) that the temperature is high enough to exclude quantum 
fluctuations of the phase, d T 9 = 0, and (ii) that there are no electric fields acting on our 
superconductor <j> = 0, d T A = 0. 

INFO Why do we relate the presence of quantum fluctuations to temperature? Let us 
reiterate (cf. the remarks made on page 255) that a temporarily constant field variable acts like a 
classical degree of freedom. To understand why large temperatures inhibit temporal fluctuations 
superimposed on the classical sector, let us take the phase action of the superconductor as an 
example and write 

S[<9] =v dr J d d r ( d T 0 f + %[0], 

where the first term determines the temporal fluctuation behavior of the phase field, while S c \ is 
the “classical” contribution to the action, i.e. the contribution independent of time derivatives. 
Switching to a frequency representation, 

S[6>] = + S c \ [6 \ , 


from which one can infer that quantum fluctuations, i.e. modes 6 m =o, become inessential at 
large temperatures. More specifically, modes with non-vanishing Matsubara frequency can be 
neglected when the quantum fluctuation energy (density) ouim oc T 2 exceeds the characteristic 
energy scales appearing in ,5' c i [0] . 

To understand heuristically the temperature scaling of the fluctuation energy, remember that 
the Bose field 0(t) obeys periodic boundary conditions 0(0) = 0(f3). Inspection of Fig. 6.8 then 
shows that increasing the temperature, i.e. squeezing of the imaginary time interval [0, /3], leads 
to a linear increase of the gradients d T 0 oc T. Accordingly, the squared gradient appearing in the 
action increases quadratically with T. This mechanism confirms the intuitive expectation that 
quantum fluctuations - i.e. fundamentally a low-energy phenomenon - should be damped out at 
increasing temperature. 


Under these conditions, the action simplifies to 

S[A, 0\ = | J d d r (VO - A ) 2 + (V A A) 2 ] , 

where we have explicitly included the action | f dr f d d r F^F^ 1 ' ^ 0, '^ = statlc £ J ^ d r (y ^ 
A ) 2 = § f d d r B 2 due to fluctuations of the magnetic field. As pointed out above, the action 
is invariant under the gauge transformation A— > A + ¥(/), 6 — > 0 + <f>. One thus expects 
that integration over all realizations of 0 - a feasible task since the action is quadratic - 
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Figure 6.8 Qualitative picture behind the quadratic ~ T 2 energy increase of quantum fluctuations. 
As the temperature T is increased, the imaginary time interval [0, /3 = T -1 ] gets squeezed. The 
same happens to the temporal profiles of (quantum) fluctuating modes 9(r). Consequently, gra- 
dients d T 0 oc T increase linearly with temperature, and the energy density ~ ( d T 0 ) 2 oc T 2 grows 
quadratically. 


will produce a purely A-dependent, and gauge invariant, action 5 [A]. The integration over 
0 is most transparently formulated in momentum space where the action assumes the form 

S[ A,0] = (m( iq0q - Aq ) ■ (“*q 0 -q - A-q) + (qA A q ) • (qA A_ q )) 

q 

= ^ E ( m [ 6,qg20 - q “ 2 * 0qq ‘ A - q + Aq ' A ~ q ] + ( q A Aq ) ‘ ( q A A -u)) • 

q 

The integration over the field components 0 q is now straightforward and leads to an effective 
action for A: e- s I A l = f D6 e -‘ s d A ^] , where 


s[a] = ?£ S Aq - A - q - 


(q • Aq)(q • A_q) 


+ (qA A q ) • (qA A_ q ) 


To bring this result into a more transparent form, let us split the vector potential into a 
longitudinal and a transverse component: 

A q =A,-S<a^ + SteM (6.38) 


To motivate this decomposition, notice the following: 


!> The transverse component alone determines physical quantities, i.e. the magnetic field. 

(This follows from the relation B q = zq A A q and q A q = 0.) 

> The transverse component is gauge invariant under transformations A q — » A q + iq</> q 
(but (qciq)^ = 0). In the language of longitudinal-transverse components, the Coulomb 
gauge corresponds to a configuration where Aq = 0. 
t> The terminology “longitudinal component” emphasizes the fact that Fq is the projec- 
tion of a vector field F q onto the argument vector q. Correspondingly, the “transverse 
component” is the orthogonal complement of the longitudinal component. 
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Applying some elementary rules of vector algebra, it is straightforward to verify that the 
effective action can be represented in the simple form 

S i A ] = f A q • A -q‘ ( 6 ‘ 39 ) 

q 

At this stage, it is useful to pause and review what has been achieved: (i) starting from 
a composite action containing the Goldstone mode 8 and the gauge field A, we have arrived 
at an action for the gauge field alone. In a sense, the Goldstone mode has been absorbed 
into (the gauge degrees of freedom of) A. However, (ii) the coupling to the Goldstone mode 
has not left the gauge field unaffected. Indeed, S' [A] has acquired a mass term proportional 
to the superfluid density, i.e., unlike the vacuum, the action of long-range field fluctuations 
A q _>o no longer vanishes. That modification has serious phenomenological consequences to 
be discussed shortly, (iii) The action is manifestly gauge invariant. 

INFO The analysis above shows that the spontaneous breaking of a global U(l) symmetry, 
and of a local gauge U(l) symmetry lead to very different results. In the former (— > neutral 
superfluids), the soft action S[0] of a phase-like Goldstone mode 6 describes various long-range 
phenomena, such as supercurrent formation, etc. In the latter case (— » superconductors or, 
more generally, charged superfluids), the system is described by a composite action S[A, 9 \. 
The ubiquitous gauge symmetry can then be employed to absorb the Goldstone mode into the 
gauge field 5 [A, 0\ S[A]. The most important effect of the coupling A 8 is that, after 

integration over the latter, the former acquires a mass term. One may say that “the photon 
(vector potential) field has consumed the Goldstone mode to become massive.” That principal 
mechanism was understood in 1964 by Higgs, wherefore it is called the Higgs mechanism 
or, crediting Anderson’s pioneering discussion of gauge symmetry breaking in the context of 
superconductivity, the Anderson— Higgs mechanism. 

Mass generation due to spontaneous gauge symmetry breaking is a very general phenomenon, 
i.e. not limited to the relatively simple context of the collective phase of the superconductor. 
The Higgs mechanism found perhaps its most significant application in 1967 when Weinberg and 
Salam embedded it into their unified theory of electromagnetic and weak interactions in particle 
physics. Although this is not a book about elementary particles, the role of the Higgs principle 
in the theory of electroweak interactions is of such fundametal importance to our understanding 
of the microscopic world that it is irresistible to briefly discuss its implications. 40 

The standard model of high-energy physics describes the microscopic world in terms of 
a few generations of leptonic (electrons, e, electron neutrino, v e , muon, //, etc.) and hadronic 
(the quarks, u, d, s, c, t, b) elementary particles that interact through the quanta of certain 
gauge fields. In its original formulation, the model had one severe problem, that is, it did not 
know how to attribute mass to these particles. However, this stands in stark contrast to any 
kind of experimental observation. In particular, the quanta of the gauge fields of the weak and 
strong interactions are known to be extremely heavy, with rest masses of 0( 10 2 GeV/c 2 ). In 
view of the much lighter masses of typical composite hadrons - the proton weighs 938 MeV /c 2 - 
the mass of the gauge quanta can certainly not be explained in terms of some fictitious fine 
structure mechanisms superimposed on the core of the standard model, i.e. a major modification 


For a pedagogical and much less superficial discussion we refer, e.g., to L. H. Ryder, Quantum Field Theory 
(Cambridge University Press, 1996). 
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was needed. It is now widely believed that the “true” principle behind mass generation lies in 
the (Anderson-) Higgs mechanism, i.e. the spontaneous breakdown of a gauge symmetry. 

To sketch the principal idea of Weinberg and Salam, let us concentrate on a leptonic subsector 
of the theory, e.g. consider the two-component object 



comprising an electron neutrino and a (left-handed 41 ) electron. Involving a charged particle, 
the Lagrangian controlling the dynamics of will surely possess a local U(l) gauge symmetry. 
However, on top of that, Weinberg and Salam proposed a much more far-reaching symmetry 
structure. Without going into detail we just mention that, building on principles proposed earlier 
(1954) by Yang and Mills, transformations 

(x)^U(x) ( x ), U(x) € SU(2), x € R 4 , 

locally mixing the two components u e and e of the “isospinor” were introduced as a symmetry 
of the model, i.e. in analogy to the local U(l) gauge symmetry of quantum mechanics, it was 
postulated that the action £[ ] possesses a local gauge symmetry under the group of SU(2) 
transformations. In combination with the standard U(l), the theory had thus been endowed 
with a composite U(l) x SU(2) gauge structure. Physically, declaring a symmetry between the 
electron - interacting through electromagnetic forces - and the neutrino - weak interactions - was 
tantamount to a fusion of these types of interaction, i.e. the proposal of a theory of electroweak 
interactions. 

How can a theory defined through an action S[ ] = f d' l+1 x £,{ , ), containing the 

isospinor and its derivatives, be made invariant under non-abelian SU(2) gauge transforma- 
tions? Referring for a more systematic discussion to Ryder, 42 let us briefly sketch the principal 
idea of non— abelian gauge theory. We first notice that a fermion bilinear ~ 3 (Jl is gener- 
ally not gauge invariant. Under a mapping — » U it transforms to (d M + U~ 1 d^U) . E.g. 

for € T s*s e 1 ^ € U(l) a “standard” gauge transformation, the extra term U~ 1 d tl U = id^cj) € *R 
would be the ordinary “derivative of a function,” familiar from the gauge structure of quantum 
mechanics. More generally, for a non-abelian gauge transformation by an element U € G of a 
general group (e.g. G = SU(2)), the gauge term € g is an element of the Lie algebra g 

of the group, 43 i.e. the action picks up a matrix-valued extra contribution. 

To make the theory invariant, we have to introduce a gauge field, i.e. we generalize from 3,, 
to a covariant derivative d M where 6 g. For example, for G = U(l), € R is 

the ordinary gauge field of quantum mechanics; for G = SU(2), = ^ o=1 a a( a; ) cr a € su(2), 

etc. Under a gauge transformation, the field transforms as W M — » UW^U -1 — iU^d^U, 
i.e. in a way that makes the covariant bilinear, (3 M + W^) , invariant. For G = U(l), A M — » 
U AfjJJ -1 — UdplJ- 1 = Ap — idprf) reduces to its familiar form. However, in the non-abelian case, 


When viewed as a relativistic particle, the electron field has components of left and right chirality, but we shall 
not need to discuss that aspect any further. 

Ryder, Quantum Field Theory. 

The fact that U~ 1 d fJ ,U G g takes values in the Lie algebra of the gauge group can be proven by geometric 
considerations (for which we refer to textbooks of group theory). By way of example, consider an SU(2)- 
valued field U(x) = exp(zy^ C[ _ 1 q a (x)a a ). It is straightforward to convince oneself that U~ 1 d fJl U is a linear 
combination of Pauli matrices with real- valued coefficents, i.e. it takes values in the Lie algebra su(2). 
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the full structure on the right-hand side is needed to obtain invariance. The full action of the 
gauge theory then takes the form 

S[ ,W] = J d d+1 x £{ ,(d„-iW„) ) + S[W], 

where the Lagrangian density contains the minimally coupled gauge field, and the action 5[IT] 
describes the fluctuation behavior of W^. 44 Within a fully quantum mechanical setting, both the 
“matter field” and the gauge field W ^ are quantized. The field quanta of the non-abelian gauge 
field are described as vector bosons, where the attribute “vector” (somewhat misleadingly) 
refers to the higher-dimensional geometry of the field, and “boson” emphasizes the generally 
bosonic statistic of a quantum gauge field. 

We now have everything in place to turn back to the particular context of the electroweak 
interaction. Within the framework of the gauge theory, interactions between the particles e 
and i>e are mediated by the gauge field (as with a U(l) theory where interactions between 
the electrons can be described in terms of a fluctuating U(l) vector potential, cf. Section 6.2). 
Experimental analysis of typical weak interaction processes, such as the elastic collision, e+u e — > 
e + v e , indicates that the weak interaction forces are extraordinarily short-range, with a decay 
profile ~ exp(— 90 GeVr). However, according to the pure gauge theory, the propagator of Wy 
should be long-range ~ r -1 . This is the most severe manifestation of the mass problem of the 
electroweak theory. In order to be consistent with experiment, a mechanism is needed that makes 
the gauge field (very) massive. 

Here is where the Higgs mechanism enters the stage. To solve the mass problem, Weinberg 
and Salam postulated the existence of a scalar (more precisely, a two-component “iso-scalar”) 
bosonic particle, the Higgs boson 4> ■ The action of the Higgs particle - again a postulate - is 
of generalized <j> 4 type, i.e. 

S[4>, w„] = J d d+1 x [i(0 M - w M )0t(^ - w„)0 - + I OAV) 2 ] v 

where the minimal coupling to the gauge field provides the theory with local gauge invariance. 
The action of the Higgs has been deliberately designed so as to generate spontaneous symmetry 
breaking, i.e. the solution of the mean-field equations is given by \<j>\ = (^-) 1 ^ 2 , with undeter- 
mined phase. In direct analogy to our discussion of the superconductor above, an integration 
over the phase degree of freedom (i.e. the Goldstone mode) then generates a mass term for the 
gauge fields. In summary, Weinberg and Salam proposed an explanation of the short-rangeness 
of the weak interaction through the presence of an extra particle, i.e. a particle that does not 
belong to the standard hadronic or leptonic generations of the standard model. 

Since then, the hunt for the Higgs particle has been one of the big challenges for parti- 
cle physics. In 1983 vector bosons of the predicted mass were for the first time observed in 
experiment 45 (see Fig. 6.9), i.e. the existence of a massive gauge structure is now out of the 
question. However, the detection of the Higgs turned out to be a more difficult task. In 2000 
the JV"-experiment installed at the LEP (Large Electron Positron Collider) at CERN reported 
a “shadowy” Higgs signal in the e _ e + scattering cross-section at the predicted energy interval. 
However, at that time, the designated live time of LEP had already expired and one month after 

Typically, 5[W] will be given by generalization of the field strength tensor f x F IIU F’ lu of the abelian 
theory, i.e. the curvature tensor on G. 

G. Arnison et al ., Experimental observation of isolated large transverse energy electrons with associated missing 
energy at y/s = 540 GeV, Phys. Lett. B 122 (1983), 103-16. 
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Figure 6.9 Computer-generated visualization of a scattering process recorded in the AAexperi merit 
at CERN. Hidden somewhere in the “jet” of particles generated as a result of the collision of the 
scattering particles should, hopefully, be the Higgs. 


the detection of the first suspicious signals the machine was indeed shut down. Unfortunately, 
after the closedown of LEP only a few accelerator projects offer the perspective to participate in 
the hunt for the Higgs. Presently, with the “Superconducting Super Collider” (SSC) disapproved 
by the American Congress, only the Stanford Linear Accelerator reaches the relevant energy 
scales. The situation may change when the Large Hadron Collider (LHC) at CERN commences 
its work. 

In the meantime the collected data recorded at LEP over the past years have been subjected 
to critical review. Frustratingly, it turned out that the data reported in 2000 did not pass the test 
of a careful re-examination, i.e. presently (2009) there is no direct evidence for a Higgs particle. 
In view of the fact that the Higgs generates the mass not only of the vector bosons but of all 
particles known to the standard model, much indeed hinges on the question of its existence. 
(Some people even call the Higgs the “God particle.”) If it did not exist, our understanding of 
the microscopic world would be turned upside down. 


To conclude our discussion of BCS superconductivty, let us explore the phenomenological 
consequences of mass accumulation due to the Higgs mechanism. To this end, let us vary 
the action (6.39) with respect to A (keeping in mind the transversality condition q- Aq = 0, 
we henceforth drop the superscript “ ”), to obtain + q 2 ) A q = 0, or 

0_ V ^(r, = O. 


(6.40) 
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Figure 6.10 On the Meissner effect: inside a superconductor (the shaded area), magnetic fields 
decay exponentially. Microscopically, an external field existing outside a superconductor-vacuum 
interface induces diamagnetic surface currents inside the superconductor. These currents generate 
a counter-field that diminishes the external field. 


Remembering that B 

London equation 


V A A, multiplication of this equation by VA produces the first 


(m — y2 ) B ( r ) = ® 


(6.41) 


For n s 0, this equation does not have a non-vanishing constant solution, i.e. we conclude 
that 


A bulk superconductor cannot accommodate a magnetic field. 


This phenomenon is known as the Meissner effect. To understand what happens at the 
interface between the vacuum threaded by a constant magnetic field B 0 and a supercon- 
ductor, one can solve the London equation to obtain B(a;) ~ Boexp(— x/X), where 



is known as the penetration depth and x is the direction perpendicular to the interface 
(see Fig. 6.10). The physical mechanism behind the Meissner phenomenon is as follows: 
above we saw that the magnetic response of a superconductor is fully diamagnetic. That 
is, in response to an external field, diamagnetic screening currents will be generated. The 
magnetic field generated by these currents counteracts the unwanted external field. To see 
this explicitly, we obtain the current density induced by the field by differentiating the first 
term of the action 46 with respect to A: 


j(r) = 


S 

6A(r) 


d d r —A 2 
2m 


n s 

m 


A(r), 


(6.42) 


Generally, the electrical current density induced by a field is obtained (cf. the remarks made on page 255 ) 
by differentiating the field/matter part of the action S[ A] with respect to the vector potential, i.e. the purely 
field-dependent part of the action does not contribute to the current density. 
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i.e. the current density is directly proportional to A. This is the second London equation. 
Since the vector potential and the magnetic field show the same decay profile (Eq. (6.40) and 
(6.41)), the current density also decays exponentially inside the superconductor. However, 
in doing so, it annihilates the external field. 

I N FO To heuristically understand the incompatibility of magnetic fields and supercon- 
ductive pairing on a still more microscopic level, consider the real space representation of a 
Cooper pair state, (r, r|k , — k ) ~ e -> kr e +' k - r _ cons t. The cancellation of the phases results 
from the fact that two electrons propagating with opposite momenta acquire opposite quantum 
phases. Thus, the pair state is a slowly fluctuating, and therefore stable, object. However, in the 
presence of a magnetic field, the phase factors have to be generalized to 

(r, r|k ,-k ) - e -ifdr (k-eA) e -ifdr.(- k -eA) ^ gWeA* 

where we assumed that the vector potential varies only slowly across our observation region of 
0(|r|), i.e. the Cooper pair amplitude becomes an “incoherent” phase-dependent object. (Exer- 
cise: Employ the WKB approximation to convince yourself of the validity of this statement.) On 
the microscopic level, the lack of stability of the field-dependent Cooper amplitude is responsible 
for the aversion of the superconductor to magnetic fields. 

It is interesting to explore how a strong magnetic field eventually 
makes its way into the superconductor. To understand the competition 
between superconductive ordering and magnetic field energy, 
we need to go back to the Ginzburg-Landau action (6.33), i.e. to a 
description that involves both phase and amplitude of the order parame- 
ter (the latter detecting the presence or absence of a stable condensate). 

However, at the time when we derived that action, no attention had been 
paid to the electromagnetic properties of the system. Fortunately, after 
our general discussion of gauge invariance above, the minimal coupling of the system to the 
electromagnetic field is routine work. We simply have to remember that, under a gauge transfor- 
mation, A Ae 2l<l> , i.e. A A — »• ( +2 i </>)A( — 2 i <j>)A. The gauge invariant extension 

of Eq. (6.33) thus reads as 

Sgl[A,A ]=£ J d d r g|A| 2 + ||( — 2iA)A| 2 + <?|A| 4 j , 

where, as usual, A gauges as A -> A + cf>. To monitor the fate of the order parameter as 
|A| oc |B| increases, consider the mean-field equation (exercise) 

[r + c(-« — 2A) 2 + 4<;|A| 2 ] A = 0. 

Here we assume that we are at temperatures below the zero-field superconductor transition, i.e. 
r < 0. Superconductive ordering exists when the equation has a non-vanishing solution A. Now, 
the third contribution on the left-hand side is positive, so a solution can exist only if the first 
two terms add to a net negative contribution. This in turn requires the following condition on 
the eigenvalues of the minimally coupled operator, 

E V(-i — 2 A) 2 < — . 

c 

Formally, (— i — 2A) 2 is the kinetic energy operator of a particle of mass 1/2 and charge q = 2 in 
a uniform magnetic field. Its eigenvalues are the Landau levels, ui c n + | ,n = 0, 1, ... , familiar 
from elementary quantum mechanics. Here, oj c is the cyclotron frequency, — qB/rn = 4 B. 
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Thus, a finite pairing amplitude can be obtained only if |r|/c is larger than the energy of the 
lowest Landau level, or 


For magnetic fields larger than this, the energy needed to expel the field is larger than the 
maximum gain of condensation energy S [A] and superconductivity breaks down. There are 
superconductor materials - superconductors of type II - where the energy criterion opts for 
field penetration for fields B c 2 > B > B c 1 lower than the critical field B c 2 specified by the mean- 
field criterion above. For these systems, the superconductor and the field “meet a compromise.” 
That is, vortex tubes of quantized flux penetrate the superconductor for a field strength smaller 
than B C 2 but larger than the critical field strength B c ,. 47 These Abrikosov vortices usually 
arrange into a triangular vortex lattice. The figure above (courtesy of U. Hartmann, University 
of Sarbriicken) shows an STM image of a vortex lattice in the type II superconductor NbSe 2 . The 
distance between vortex centers is about 50 nm. Each of the vortices in a flux lattice contains 
magnetic flux <E> = ^ where 1/h (or e/h in units where the electron charge is kept track of) 
is the magnetic flux quantum. Inside the cores of the Abrikosov vortices, the superconducting 
order parameter is suppressed, but outside it still exists. For a discussion of the thermodynamics 
of superconductors not showing a mixed phase (superconductors of type I), we refer to the 
literature. 48 

To conclude this section let us discuss the most prominent superconducting phenomenon, 
absence of electrical resistivity. Assume we have chosen a gauge where an external 
electric field E is represented through E = —id T A (i.e. the static component of the potential 
vanishes). In this case, a time differentiation of the second London equation (6.42) gives 
—id T j = —i r ^d T A = E. Continuing back to real times we conclude that 


a . n s 
d t j = — E, 
m 


i.e. in the presence of an electric field the current increases linearly at a rate inversely pro- 
portional to the carrier mass and proportional to the carrier density. The unbound increase 
of current is indicative of ballistic - i.e. dissipationless - motion of the condensate parti- 
cles inside the superconductor. Now, an unbound increasing current is clearly unphysical, 
i.e. what the relation above really tells us is that a superconductor cannot maintain non- 
vanishing field gradients. 

EXERCISE Assuming that each particle is subject to Newton’s equation of motion mr = E, 
obtain the current-field relation above. How would the relation between field and current change 
if the equation of motion contained a friction term (modeling dissipation) mr = — ™r + E? 


A. A. Abrikosov, On the magnetic properties of superconductors of the second group, Soviet Physics JETP 5, 
1174-83 (1957). 

cf., e.g., L.D. Landau and E. M. Lifshitz, Course of Theoretical Physics, Vol. 9 - Statistical Physics 2 , 
(Butterworth-Heinemann, 1981). 
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To close the chapter on broken symmetry and collective phenomena, we turn now to a 
final, detailed application of the field integral method involving the problem of electrons 
propagating in a disordered environment. As well as the importance attached to this gen- 
eral area in the recent literature, the quantum disorder problem presents an ideal arena 
in which to revise the diagrammatic and field theoretic methods developed in this and 
previous chapters. This being so, this section is structured in a manner that reflects the 
different techniques and much of our discussion is deliberately cast in the form of problem 
assignments. However, this section may be read equally as a complete and coherent text, 
depending on taste. 


Disorder in metals 

No semiconductor or metal of macroscopic 49 extent is ever free of imperfections and impuri- 
ties. Indeed, the effect of disorder on the phenomenology of metals or semiconductors could 
not be more varied: in some cases, disorder plays an essential role (for example, conventional 
light bulbs would not function without impurity scattering!), in others the effect is parasitic 
(imparting only a “blurring” of otherwise structured experimental data), or it conspires to 
give rise to completely unexpected types of electron dynamics (as is the case in the quantum 
Hall transition discussed in Chapter 9). 

With this in mind, a complete theory of electron transport must, by necessity, include 
diagnostic tools to understand whether disorder seriously affects the problem of interest. 
Moreover, such a theory should include some analytical machinery to deal with the cases 
where the answer is affirmative. What kind of criteria should a successful theory of disor- 
dered conductors meet? A fair fraction of those problems where impurity scattering plays an 
essential role can be addressed in terms of infinite-order perturbation theory. However, there 
are plenty of phenomena - Anderson localization, the quantum Hall effects, the combined 
theory of interactions and disorder, to mention just a few - where non-perturbative field 
theoretical methods are required. In this section, the foundation of a general approach to 
the disordered electron gas, extendable to both perturbative and non-perturbative schemes, 
is laid. 

However, before doing this, we must first clarify what is meant by a “theory” of the 
disordered electron gas. Of course, while the problem may be considered formally as non- 
interacting, we will not be able to effect an exact diagonalization of the random Schrodinger 
equation for an electron in a metal for a given realization of the impurity potential. (Indeed, 
were such a particular solution available, it would not convey much information.) Rather, 
one needs to develop a statistical approach wherein the system is described in terms of 
a few universal characteristics of the scattering landscape - the strength of the impurity 


In ultraclean semiconducting devices, electrons may travel up to distances of several microns without expe- 
riencing impurity scattering. Even so, the “chaotic” scattering from the typically irregular boundaries of the 
system has an equally invasive effect on the charge carrier dynamics. 
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potential, the typical range of potential fluctuations, etc. In general, the analysis of generic 
properties involves averaging over microscopic realizations of the impurity potential. 

INFO In situations where the system is so large that different regions behave as though they 
were statistically independent with respect to their microscopic impurity configuration, proper- 
ties become self— averaging. In such cases, the configuration average can be subsumed into a 
volume average of the individual system. As a rule of thumb, systems which behave in a self- 
averaging manner must extend well beyond the phase coherence length £ - the length scale over 
which the quantum propagation remains phase coherent. (A more precise characterization of 
£ is given below.) Now, at low temperatures, £(T) grows rapidly, implying that even systems 
of near macroscopic extent (ca. 0(1 pm) and more) can behave as though they were non-self- 
averaging. Systems of this type are often termed mesoscopic, 50 where “meso-” alludes to the 
fact that such systems are macroscopic in extent yet microscopic in their reflection of quantum 
mechanical character. Mesoscopic systems manifest a multitude of unusual quantum phenomena 
from localization to strong sample-to-sample fluctuations, some of which will be discussed below. 
The experimental and theoretical study of these phenomena is the central theme of mesoscopic 
physics. 

How might one set about modeling an impurity potential in statistical terms? One might, for 
example, propose that a single imperfection at position r, creates a potential Vi mp (r — r, : ). 
Assuming that all impurities are, by and large, equivalent, the total perturbation experi- 
enced by an electron at point r will then take the form V(r) = JV Vi mp (r — r*). Within 
this framework (which prevails in the older literature) the disorder average amounts to 
integrating over the coordinates of the impurities, i.e. (• • • )di s = L~ Nd nf-./d-n (■••)■ 
The disadvantage of this scheme is that its implementation is not so straightforward, espe- 
cially in functional-based approaches. In practice, it is more convenient to think of the 
potential V as some function whose statistical properties are described by a probability 
measure P[V]. Averages over the potential are then computed by performing the integral 
(• • • )dis = / DV P[V](- • • ). In most 51 applications it is sufficient to implement a Gaussian 
distribution (unnormalized), P\V\ = exp [—272 f d d rd d r'V(r)K x (r — r , )y(r , )j , where 7 
measures the strength of the potential and K describes its spatial correlation profile: 

{V (r)V (r / )) d is = 7 2 K(r - r'). (6.43) 

Very often one finds that the finite spatial correlation of V is inessential, in which case one 
may set K( r) = <5(r), and 

P[V]DV = exp [-^2 y d d r V 2 (r)| DV. (6.44) 

The freedom that one can exercise in modeling the scattering potentials reflects the fact 
that, in a multiple scattering process, details of the potential are quickly erased. At any 

50 Note that it has, however, become fashionable to label any system of a size of 0(1 fim) “mesoscopic,” irrespective 
of whether or not its behavior is classical. 

An important class of exceptions is presented by bosonic problems of general type where, for stability reasons, 
V(r) > 0, while Gaussian distributed potentials categorically include negative “tails.” 
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rate, the short-range Gaussian distribution provides a convenient scheme both in purely 
diagrammatic perturbation theory and in functional approaches. 

As discussed above, our aim is to average the quantum expectation value of a certain 
observable O over the ensemble of disorder. Let us assume that the observable O can 
be obtained by differentiation of the (functional) free energy, that is O = — InZ, 

where J represents some source field, 52 i.e. 

(O)dis = -^| J=0 (ln^) dis . (6.45) 

This fundamental relation presents a technical challenge: should one first differentiate and 
only then average, one would need to compute integrals of the type 


Due to the appearance of the function V in both the numerator and the denominator, 
integrals of this type are largely intractable. The problem is particularly acute in func- 
tional approaches where one intends to take the ensemble average at an early stage of the 
computation. 53 

To date, three different approaches have been identified in which the problem with the 
denominator can be circumvented: the supersymmetry approach , 54 the Keldysh tech- 
nique , 55 and the replica trick . 56 All of these approaches share the feature that they alter 
the definition of the functional partition function in such a way that (a) Z[J = 0] = 1 
(i.e. the disorder dependence of the denominator disappears), while (b) Eq. (6.45) remains 
valid, and (c) the algebraic structure of Z[J ] is left largely unchanged. Since the disorder 
appears linearly in the Hamiltonian, point (c) implies that we need to average functionals 
with actions linear in the potential V, an enterprise that turns out to be quite feasible. 

Let us exemplify this program on the (technically) simplest of the three approaches above, 
the replica trick. Consider the Rth power of the partition function, Z R . For integer R one 
may think of Z R as the partition function of R identical copies of the original system (see 
Fig. 6.11), hence the name “replica” trick. To appreciate the merit of this procedure, one 
may note the formal relations 


( O ) = In Z[J\ = lim \ ( 

SJ SJ R-*o R ' 


* lim ±Z R 
’ SJ R-> o R 


The last equality tells us that the expectation value of observables can be obtained by 
performing computations with the -Rth power of Z (instead of its logarithm). Crucially, 
expressed in the coherent state representation, the expression for the replicated partition 


52 In the jargon of field theory, a source eld represents a parameter (function) that, when linearly added to the 
exponent of a functional integral, can be used to generate the expectation value of observables by differentiation. 
For example, the parameter p is a source generating expectation values of the particle number: —d^F = ( N ). 

3 In cases where the disorder can be treated perturbatively, one may first expand in powers of V and only then 
average; indeed, this is quite a viable strategy. 

54 K. B. Efetov, Supersymmetry method in localisation theory, Sov. Phys. JETP 55 (1982), 514-21. 

55 For a review see, e.g., A. Kamenev, Many body theory of non-equilibrium systems in Nanophysics: Coherence 
and Transport eds. H. Bouchiat et al. 177-246, (Elsevier, Amsterdam, 2005). 

56 S. F. Edwards and P.W. Anderson, Theory of spin glasses, J. Phys. F 5 (1975), 965-74. 
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Figure 6.11 On the idea behind the replica trick; for an explanation, see the main text. 


function involves an effective action which is still linear, (ef dv ) R = e^I dv , i.e. Z R will 
contain the disorder linearly in the exponent and, therefore, will be comparatively easy to 
average. However, the replica-averaging procedure involves one unusual feature - at the end 
of the calculation, one must implement the analytic continuation R — > 0. More precisely, we 
will have to compute a certain function (namely f(R) = jjR~ 1 {Z r ) di s ) for every integer R 
and then analytically continue R — > 0. However, there is no guarantee that / ( R) is analytic 
all the way down to R = 0. 57 In other words, the method is poorly founded, which is why 
it is called the “replica trick” as opposed to, say, the “replica theory.” However - in view of 
its poorly justifiable theoretical standing, quite surprisingly - examples where the replica 
trick is known to fail are rare. To some extent, this success is explained by the fact that 
the method is exact as long as the disorder is treated perturbatively (a point to be clarified 
below). So long as a perturbatively accessible point in the parameter space is not too far 
away, the chances are that it will not fail. 

I N FO The applicability of replica methods is not limited to the theory of disordered electron 
systems. Replicated field theories are of potential use whenever it comes to averaging the free 
energy functional of a disordered classical or quantum system. The method has been proven 
most fruitful (at times also controversial) in the theory of conventional and spin glasses. 58 


Replica field theory 

With this background, we return now to an analysis of the disordered electron system. The 
construction of the field integral begins with the representation of the replicated partition 
function as a coherent state field integral 


Z R [J] = 


j D{&i% 


exp 


£ 


s\r,r,j] 


(6.46) 


where tp a ,a = 1, . . . , R, denotes the Grassmann field representing partition function number 
a, D$,i >) = ria=i - D (V’ a > V , °)> and S[ip a ,'ip a , J] = S 0 [^ a ,^ a ] + Si nt [ip a , $ a ] + S s [^ a ,^ a , J], 
(Notice that the source J is the same for all replicas, i.e. it does not carry an index a.) Here 


s 0 [r,r] = £ ^ d d ‘ 


i>“(r)(-iu n - — -E F + V(r) 


C( r )> 


6.47) 


This uncertainty reflects disorder-generated correlations between the replicated systems. Since {Z R )$i a 7 ^ 
(. 2 ) dis , the function we wish to continue is not just a harmless power function. 

For a review of the industry of replica-based theoretical approaches in these fields we refer to the article by G. 
Parisi, Glasses, replicas and all that, in Les Houches - Ecole d’Ete de Physique Theorique , Vol. 77, ed. J.-L. 
Barrat et al. (Elsevier, 2004). 
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describes the non-interacting part of the disordered electron system (assumed, for simplicity, 
to be spinless), Sy specifies the bare particle interaction, and S s is the source-dependent 
part of the action (whose structure we need not specify for the moment). Notice that, before 
averaging, the action is “replica diagonal,” i.e. fields V ,a and ip b , a ^ b, do not interact with 
each other. 

Before proceeding with this expression, let us make one technical remark on the calcu- 
lation of observables. Suppose we were interested in the expectation value of some oper- 
ator which, in the non-replicated theory, assumes the form ( O ) = — (8/8J)\n.Z[J] = 
(0('i/j,'i/ ; ))v>/(l)i/’- Here, J represents some suitably devised source and, in the last equal- 
ity, 0(ip,ip) is the coherent state representation of the operator O. The denominator (1)^, 
reminds us of the fact that a quantum thermal average involves an explicit normalization by 
the (functional) partition function. Within the replicated formalism, the same expectation 
value assumes the form 

(6.48) 

where the last equality follows from differentiating Eq. (6.46) with respect to the source J. 
Assuming that all observables are evaluated as in Eq. (6.48), we no longer need to keep an 
explicit reference to the source field J. 

Now, let us average the functional (6.46) over the distribution (6.44). A straightforward 
application of the Gaussian integral formula (3.19) leads us to the result 

(•Z fl [J]) dis = J ex p 

where S c i = S|y_ 0 denotes the action of the non-disordered system, and 

5'dis[V’ a ,V’ 6 ,V’°,V’ b ] = -y J d<lr V’m( r )V’m( r )^ b ( r )V’ b (r), (6-49) 

represents an effective quartic interaction generated by the disorder average. Notice the 
superficial similarity between Sy and an attractive short-range “interaction” term. How- 
ever, in contrast to a dynamically generated interaction, (a) Sy does not involve frequency 
exchanging processes (the reason being, of course, that the scattering off static impurities is 
energy conserving), and (b) it describes interactions between particles of a different replica 
index. To understand the physics behind the attractive inter-replica interaction, consider the 
potential landscape of a given impurity configuration (see the figure). Irrespective of their 
replica indices, all Feynman amplitudes will try to trace out those regions in configuration 
space where the potential energy is low, i.e. there will be a tendency to propagate through 
the same regions in the potential landscape. (Recall that all replica fields are confronted 
with the same potential profile.) On average, this looks as if the replica fields were subject 
to an attractive interaction mechanism. 






306 


Broken symmetry and collective phenomena 


In summary, one may account for the pres- 
ence of quenched or static disorder by (a) 
replicating the formalism, (b) representing 
observables as in Eq. (6.48), and (c) adding the 
replica non-diagonal contribution Eq. (6.49) 
to the action. This results in a theory wherein 
the disorder no longer appears explicitly. 

(Technically, the effective action has become 
translationally invariant.) The price to be paid 
is that the action now contains the non-linearity 
Eq. (6.49). This concludes our formulation of 
the quantum disorder problem as a functional integral. In the following section, we develop 
some intuition for the effects of disorder which draw on the formalism developed above. 



Basic notions of impurity scattering 

The most basic time scale characterizing the scattering of electrons from static impurities is 
the elastic scattering time r. (Here the term “elastic” emphasizes that the scattering off 
static impurities may lead to the transfer of momentum but not of energy.) In the literature, 
the scattering time is often prematurely identified as the typical time of flight between 
neighboring impurities. However, this interpretation may be misleading. For example, a 
system may be (and often is) polluted by a dense accumulation of very weak scattering 
imperfections. In such cases, the scattering time may be parametrically larger than the 
time of flight. Similarly, the scattering may be from the shallow Coulomb potential created 
by impurities spatially separated from the conductor (a situation generically realized in 
semiconductor heterostructures). In such cases, impurity positions are themselves not even 
well-defined. How, then, can the scattering time be defined unambiguously? And how does 
it relate to the microscopic characteristics of the impurity potential? 

To assimilate the general meaning of the scattering time, let us consider the quantum 
amplitude U (y, x; t ) = (y|exp(— iHt)\x) for a particle to propagate from a point x to a point 
y in a time t for a particular realization of the disorder potential. One may think of this 
amplitude as the sum of all Feynman paths connecting the points x and y. On its journey 
along each path, the particle may scatter (see Fig. 6.12), implying that the action of the path 
depends sensitively on the particular realization of impurities. For large separations, |x — y | , 
the scattering phase becomes a “quasi-random” function of the impurity configuration. The 
same applies, of course, to the linear superposition of all paths, the net amplitude U. 

Let us now consider the impurity-averaged value of the transition amplitude (U (x, y; t))dis- 
As we are averaging over a superposition of random phases, one may expect that the 
disorder average will be translationally invariant and, as a result of the random phase 
cancellation, rapidly decaying, ([/(x, y;t))dis ~ exp(— |x — y|/(2 £)). The decay constant i 
of the averaged transition amplitude defines the elastic mean free path while the related 
time r = t/v f denotes the elastic scattering time. (We reiterate that only in systems of 
dilute strong scattering centers can this scale be identified with the average spacing between 
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Figure 6.12 Showing the scattering of an electron off static imperfections. Inset: the corresponding 
Feynman diagram. 


impurities.) In the following, we develop a quantitative description of this “damping” 
process. 

Technically, it is most convenient to explore the behavior of the transition amplitude 
in the imaginary-time formalism. More specifically, we shall consider the imaginary-time 
Green function 59 G(x, y; r) = (^>(x, r) V>(y, 0))^,, where the averaging is over the Grassmann 
action (6.47). (To keep the notation simple, we do not explicitly keep track of the normal- 
izing denominator (1)^, in our notation.) As usual, it will be convenient to perform the 
intermediate steps of the computation in frequency-momentum space. We thus represent 
the correlation function as 

G(x,y;r) = ^ ]T *^*P-*-ip'-y GpiP ,. WB , (6.50) 

Wn,p,p' 

where c o n is a fermionic Matsubara frequency and G(p, p' ;<*;„) = (^n,pV’np')V’- (Keep in 
mind that, prior to the impurity average, the system lacks translational invariance, i.e. the 
Green function will depend on two independent momentum arguments.) Following the gen- 
eral prescription developed in Section 6.5, the correlation function averaged over a Gaussian 
disorder distribution is then given by (cf. Eq. (6.48)) 

1 R 

<Gp, p q n )dla = |m ^ ^{C.p^p^W- ( 6 - 51 ) 

where ip a is the ath component of the /f-fold replicated field and (•■■), /, now stands for 
the functional average with an action including the interaction term Eq. (6.49). (Exercise: 
Consider why the averaged Green function is diagonal in momentum space.) To keep the 
discussion simple, we shall ignore dynamical interactions (i.e. Coulomb interaction, etc.) 
between the particles and assume that the non-disordered part of the action S['tp a ,'tp a ] = 
p P ( — ^ — Ef)^ p describes only free fermions. 

In the remainder of this section, our objective is to explore the impact of the disorder 
generated interaction on the behavior of the Green function by means of diagrammatic 


Representing the transition amplitude for the creation of a particle at point x followed by an annihilation 
process at y, this function generalizes the single-particle transition amplitude U to the presence of a continuum 
of particles; while all that has been said above about the disorder-generated attenuation remains valid. 
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Figure 6.13 (a) Impurity scattering vertex, (b) the first-order self-energy diagrams, (c) the second- 
order self-energy diagrams, and (d) SCBA self-energy. With the last, the bold line represents the 
full Green function while the diagram states that the self-energy is computed neglecting all crossed 
lines (cf. the discussion in Section 5.3). 

perturbation theory. As mentioned above, since the disorder problem presents a useful arena 
to practice the methods developed in the text we will outline the program as a sequence of 
assignments while the detailed solution is given below. Following the general arguments of 
Section 5.3, the principal object of interest is the impurity-generated self-energy operator 
E. Let us prepare the analysis of this object by introducing a bit of diagrammatic notation. 
We depict the impurity scattering vertex defining the action (6.49) as in Fig. 6.13(a). 60 
As usual, setting p = (u n , p), the free-particle Green function Go, p = (iut n — £ P ) _1 will be 
denoted by a fine (directed) line. Using this notation, and following the rules of diagrammatic 
perturbation theory developed in Chapter 5: 

Ql: Consider the self-energy E A) a t first order in the impurity scattering (Fig. 6.13(b)). 
Show that the “Hartree-type” diagram (right) does not contribute (in the replica limit!). 
Compute the real and imaginary parts of the “Fock” (left) contribution to the self-energy. 
Show that, in dimensions d > 2, Re E actually diverges. Convince yourself (both formally 
and heuristically) that this divergence is an artefact of our modeling of the impurity poten- 
tial by a 5-correlated function (see Eq. (6.44) and the related discussion). Consider what 
could be the reason for the real part of the self-energy not playing a very important role. 

Q2: Turning to the second-order contribution E^ 2 ) (see Fig. 6.13(c)), convince yourself that, 
in dimensions d > 1, the diagram with crossed impurity lines is parametrically smaller than 
the second contribution. (What is the small parameter of the expansion?) 

Q3: This motivates the computation of the self-energy in the self-consistent Born approx- 
imation (SCBA) (see Fig. 6.13(d)). Show that the SCBA equation can be approximately 
solved by the ansatz Im E(w n ) = — sgn(w n )/2r, where r is a constant. Once again, identify 
the small parameter of the expansion. 

Q4: Put together, one thus obtains the impurity-averaged Green function (G p ) di s = (iw n — 
£p + isgn(n)/2r) -1 . Fourier transforming this result back to real time/space, justify the 
identification of the self-energy with (one half of) the inverse scattering time. 


In the diagrammatic literature, individual scattering events off the impurity potential V(r) are tradi- 
tionally depicted by dashed lines (see Fig. 6.12, inset). Once we average a product of such vertices, 
V(ri)V(r 2 ) • • • V^(r 2 n) 5 over the Gaussian disorder distribution (6.44), the coordinates {rj} become pairwise 
identified in all possible combinations. This is usually indicated by connecting the corresponding vertex lines, 
i.e. a diagram with 2 n dangling bonds becomes a sum of diagrams, each containing n interaction lines. In our 
pre-averaged replica field theory, we are using the corresponding interaction vertex from the outset. 
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Q5: Why is the replica method exact in perturbation theory? 


Al: Unlike the Fock diagram, where all replica indices are locked to the index of the incoming 
Green functions, the Hartree diagram contains one free replica summation. This summation 
yields an excess factor R that, in the limit R — t 0, vanishes. For the same reason, all 
diagrams with closed fermion loops (loops connected to the external field amplitudes only 
by impurity lines or not at all) do not contribute to the expansion. Technically, the excluded 
contributions represent vacuum diagrams, 61 i.e. on the level of perturbation theory, the 
only 62 effect of the replica limit is the elimination of all vacuum processes. We have thus 
shown that the replica theory exactly simulates the effect of the normalizing partition 
function present in the denominator of the unreplicated theory (cf. the discussion of the 
linked cluster theorem in Section 5.1). This proves - all on the level of perturbation theory - 
the equivalence of the representations. 

The representation of the disorder-generated interaction Eq. (6.49) in momentum space 
emphasizes the fact that the impurity scattering exchanges arbitrary momentum, but not 
frequency. A straightforward Wick contraction along the lines of our discussion in Section 5.3 
then obtains the first-order contribution 




y(e) 


- Ep — e 


~ 7 2 P / de 


/• 


v(e) 

Ep — e 


— *7T7 2 ^sgn 


where P f stands for the principal value integral. For d > 2, the increase of the DoS v{e) as 
a function of e renders the real part of the self-energy formally divergent. This divergence is 
an immediate consequence of the unbounded summation over p' - which is an artefact of the 
model. 63 In any case, the real part of the self-energy is not of prime interest to us: all that 
Re E pn = const, describes is a frequency- and momentum-independent shift of the energy. 
This shift can be absorbed into a redefinition of the chemical potential and will not cause 
any observable effects. By contrast, the imaginary part Im Ep 1 ^ = — 7T7 2 i/sgn (u>„) describes 
the attenuation of the quasi-particle amplitude due to impurity scattering, a mechanism of 
great physical significance. 


A2: The analysis of the second-order contribution E^ 2 ) parallels our discussion of the RPA in 
Chapter 5: the Green functions G p are sharply peaked around the Fermi surface |p| = pp. 
(Since the Matsubara index n in p = (p, n) is conserved in impurity scattering, we will 
not always write it out explicitly.) Representing the diagram with non-crossing lines in 


To understand this assertion, consider the non-replicated theory prior to the impurity average. Due to the 
absence of “real” interactions, any closed fermion loop appearing in the expansion must be a vacuum diagram. 
After taking the impurity average, the loop may become connected to the external amplitudes by an impurity 
line. However, it remains a vacuum loop and must cancel against the expansion of the normalization denomi- 
nator, if we were crazy enough to formulate the numerator-denominator expansion of the theory explicitly. 

In a connected diagram, all replica indices are locked to the index a of the external field vertices. We thus 
obtain (symbolic notation) (G)dis ~ limjj^o y^ Q (i/> a '0 a ) oc lim^^o j| x const. = const, where the factor 
of R results from the summation over a and we have used the fact that the correlation function (■0 a i/> a ) is 
independent of a (replica symmetry). 

In reality, the summation will be finite because either (a) there is an underlying lattice structure (i.e. the p'- 
summation is limited to the Brillouin zone) or (b) the kernel K (r) describing the profile of the impurity potential 
varies on scales large in comparison with the Fermi wavelength. In this latter case, its Fourier transform has 
to be added to the definition of the scattering vertex above. The presence of this function then limits the 
p'-summation to values | p 7 — p| <£ -1 < pf- 
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momentum space, one obtains si 2 [ ~ )>/ p| P2 [Gp^ ] 2 Gp^ , restricting both momenta to the 
Fermi surface, i.e. |pi|, |p 2 1 — Pf- By contrast, the contribution with crossed lines takes 
the form ~ ^Z pi p2 G p 2 / G P ° 2 G p 0 ^ + p _ Pl ■ Since all three momentum arguments have to 
be tuned to values close to p F , only one summation runs freely over the Fermi surface. To 
estimate the relative weight of the two contributions, we need to know the width of the 
“shell” centered around the Fermi surface in which the Green functions assume sizeable 
values. Since |G| = [(E F —p 2 /2m) 2 + (Im G -1 ) 2 ] -1 , the width of the Lorenzian profile is set 
by ImG -1 . As long as we are working with the bare Green function, Im [G-G] -1 = L 0 n oc T 
is proportional to the temperature. However, a more physical approach is to anticipate 
that impurity scattering will broaden the width Im [G^ 0 )] _1 ~ r _1 to a constant value (to 
be identified momentarily as the inverse scattering time). Then, the relative weight of the 
two diagrams can be estimated as p^ d ^ / (p F (v F T )~ 1 ) d ~ 1 = {p F t) d ~ 1 , where i = v f t is 
the elastic mean-free path, and the numerator and denominator estimate the volume in 
momentum space accessible to the p-| ,2 summations in the two diagrams. The important 
message to be taken away from this discussion is that, for weak disorder p F ^ _1 (which 
we have assumed throughout), and in dimensions d > 1, scattering processes with crossed 
impurity lines are negligible. Under these conditions, we are entitled to evaluate the self- 
energy (and, for that matter, all other observables) within the self-consistent Born, or 
non-crossing, approximation, i.e. an approximation that neglects processes with crossed 
“interaction” lines. 


A3: Drawing on the analogous discussion in Section 5.3, the SCBA for the self-energy is 
given by the diagram shown in Fig. 6.13(d). The corresponding analytical expression takes 
the form (cf. Eq. (5.40)) 




= 7 2 t- d E 

p' 


1 

iu n + E f — pl 2 /2m — S p / ]n 


(6.52) 


Guided by our results obtained in the first order of perturbation theory, we may seek a 
solution of (the imaginary part of) this equation by the ansatz Im E Pj „ = — ^:Sgn(a;„). 
Substitution of this expression into the SCBA equation gives 


1 

2t 


sgn(oj„) q 2 Im 


iUn + E f - e + ^sgn (w n ) 


~ — 7T7 2 PSgn (w n ), 


where we have assumed that E f t G> 1. We have thus arrived at the identification of r _1 = 
2'kvj 2 as the elastic scattering rate. (In the literature, the potential strength 7 2 = 1/(27 tut) 
is usually defined through the scattering time, i.e. the parameter y 2 is expressed through t 
from the outset.) Summarizing, we have obtained the important result 


(6.53) 


A4: To compute the inverse Fourier transform of Eq. (6.50) one may trade the frequency 
summation for a complex contour integral. Taking account of the fact that the Green 
function has a cut along the entire real frequency axis (the non-analyticity of sgn(cu n ) — >- 
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sgn(Im z) at Re z = 0), we chose the integration contour as shown in the figure. This leads 
us to the representation 


(G'(x,y;r)) d is = ^ ^ e eT (l~n F (e))e 


< Im- 


E f - 


Here, we are using 1 — n F instead of n F as a pole function because 
g-zr(i _ nF ( z ^ j s finite for large \z\, whereas e~ ZT n F (z) is not. It is 
now a straightforward matter to compute this expression by first (a) 
integrating out the angular degrees of freedom of p followed by (b) a 
contour integration over p (cf. the analogous calculation of the clean 
Green function in the exercise at the end of section 5.1). However, as 
we require only the asymptotic effect of the damping, we may follow 
a more efficient route: the essential effect of the damping term is to shift the poles of 
the p- integration from the real axis into the complex plane. Equating the Green function 
denominator to zero and neglecting the parameter e (e E F as long as the time parameter 
r Ep 1 is not extremely small), we find that the poles are located at p = p F ± The 
essential effect of this shift is that the exponentials exp(*p|x— y|) have to be evaluated at the 
residues exp(ipp|x — y| — |x — y|/(2£)). Consequently, the disorder averaged Green function 
is related to the Green function of the clean system G c \ by the exponential damping factor 
which we had surmised above, 

(G(x,y;r))di s = G c i(x, y; T)e _|x_y|/2 ^. 

A5: We refer to the discussion in Al. 



This completes our discussion of the impurity-averaged single-particle Green function 
(G(x, y; r))di s - The latter has been shown to be short-range on the scale of the elastic mean 
free path t. In the following section, we shall see that, by contrast, the impurity-averaged 
two-particle Green function acquires long-range correlations which encode the modes of 
density relaxation and provide a means to explore mechanisms of quantum interference 
which characterize the mesoscopic regime. 


Diffusion 

How do local fluctuations in the electron density Sp(r, r) = p(r, r) — (f, p(r)) relax in 
a disordered environment? 64 In a classical disordered system, this relaxation mechanism 
would be diffusive. Below, we shall see that the same characteristic behavior survives in the 
quantum picture. Formally, this is achieved by exploring the correlation function D(t, r) = 


Notice that the brackets (• • • ) represent the quantum average, and not yet the disorder average. 
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((5p(t, r)6p(0, 0)))di s , i-e. a function that describes how 6p(r, r) changes in response to a 
density fluctuation at (r = 0, r = 0). 

EXERCISE In a real time formulation, how is the correlation function D related to the quantum 
transmission probability density? 

INFO The diffusive relaxation process described by the function D extends over distances 
much larger than the decay length of the average single-particle Green function, t. To iden- 
tify the mechanism that underlies the stability of the four-point function, think of D(t, r) = 
((V>(t, r)t/ J (0, 0)) r)i/>(0, 0)))di S as the product of the amplitudes for (a) the annihilation of a 

particle at space-time coordinate (0, 0) followed by particle creation at (r, r) and (b) the creation 
of a particle at (0,0) followed by its annihilation at (r, r). 65 The annihilation process initiat- 
ing (a) may be interpreted alternatively as the creation of a hole, i.e. the composite process D 
describes the joint propagation of a particle and a hole amplitude through the medium (which 
explains why D is sometimes called the particle— hole propagator). 

Now, against this background, let us temporarily switch to a real time description t —¥ it 
and draw on the intuition afforded by the consideration of Feynman amplitudes. Specifically, 
let us consider the particle propagator as the sum of all Feynman paths a connecting 0 and r 
(see Fig. 6.14). Similarly, the hole amplitude corresponds to a path-sum over all paths where 
each path 6 is weighted by the negative of its classical action. 66 We thus have the symbolic 
representation 

D ~ J de dt A a A*pexp j^(5«(e) - Sp(e')) 

where the notation emphasizes that the electron and hole have different energies e, e'. As with 
the single-particle propagator, the strong sensitivity of the actions S&g on the impurity potential 
implies that generic contributions (a, fi) to the path double sum vanish upon impurity averaging. 
However, the “diagonal” contribution (a, a) to the sum, Ajiag ~ |A a | 2 exp(|(S , a (e)— S a (e))), 

depends only weakly on the impurity potential, and will likely survive the averaging procedure. 

A glance at Fig. 6.14 (b) suggests that the diagonal contribution to D is but an elaborate 
quantum description of classical diffusion: two quantum amplitudes locally tied together, and 
propagating in a stochastic environment, should be capable of interpretation as a classically 
diffusive probability density. Indeed, this interpretation is corroborated by the fact that, upon 
approaching the classical limit h — t 0, the dominance of the diagonal approximation becomes 
more pronounced. 

However, notice that, beyond the straightforward diagonal configuration, there are other con- 
tributions to the double path sum that are likely to be impervious to configurational averaging. 
For example, the two paths depicted in Fig. 6.14 (c) are globally different but locally paired. The 
former attribute tells us that we are dealing with a fundamentally non-classical contribution to 
the correlation function, and the latter that the action difference S a — Sp will be small. Indeed, 


To understand the positioning of the quantum-thermal averaging brackets, consider the definition of D in terms 
of the operators Sp, and take into account the fact that, of the two possible Wick contractions, one gets canceled 
due to the subtraction of (p). 

Heuristically, the inversion of the sign of the action accounts for the fact that a hole is a “missing” electron. 
More formally, it follows from the fact that the hole amplitude is obtained by complex conjugation of the 
particle amplitude. 
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Figure 6.14 (a) A generic pair of Feynman paths contributing to the density-density correlation 
function prior to averaging, (b) Particle and hole propagating along the same path in configuration 
space - the “diffuson.” (c) A non-classical contribution to the path sum. 


we shall see that processes like the one shown here lie at the root of the observed quantum 
phenomena that characterize the physics of disordered media. 

Below, we apply concepts very similar to those developed in Section 5.3 to understand the 
spatial long-range character of the four-point function. Further, we wish to elucidate the 
diffusive character of this correlation function. 

Ql: Before turning to explicit computations, let us derive two exact relations obeyed by 
the Fourier transform £) q;Wm : Show that lim q _>o -D q ,o = L~ d d tl N and -Do,uj m = 0, where 
N = ( N denotes the number of particles in the system and, as usual, p Ep represents 
the chemical potential. Explain the origin of these two sum rules. 

Q2: Represent the four-point function in a momentum-frequency form similar in structure 
to that of the correlation function C 1 - 4 ' 1 introduced in Eq. (5.41). (Hint: It is convenient to 
represent the correlation function as D{x) = ^( x ) (U (o)(l n -2)dis| /i (x)=Mi where x = (r, r), and 
(. i(x) = h(t, r) is the generalization of the chemical potential to a space-time-dependent 
source field.) 

Q3: To compute the two-particle correlation function, one may apply concepts similar to 
those introduced in Section 5.3. In doing so, we will benefit from two major simplifications. 
Assuming that {ppt)~ l <C 1, we will work to leading order in this parameter ( ppf » 1 
plays a role similar to that of the large parameter N in Section 5.3). Secondly, we may 
make use of the fact that the momentum q Fourier conjugate to the argument |r| £ is 

much smaller than the Fermi momentum. Show that, under these conditions, the irreducible 
vertex, Fo^, p / = {2-KVT)~ 1 8 u , ntW i , collapses to (the Fourier representation of) a single 
impurity vertex. (Since all field amplitudes that contribute in the replica limit carry the 
same replica index o, one may drop the replica structure from the notation.) Write down 
the Bethe-Salpeter equation for the full vertex. 

Q4: Solve the Bethe-Salpeter equation to leading order in (pp£) _1 (note |q|^ <C 1). (Hint: 
Notice that the two cases u> n ■ < 0 and u> n • bj n + m > 0 behave in radically differ- 
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Figure 6.15 Maximally crossed contribution to the irreducible vertex. Second line: the correspond- 
ing Bethe-Salpeter equation. In all diagrams, the upper (lower) Green functions carry frequency 

OJn+m (Un). 

ent manners.) Use your result for the irreducible vertex to compute the density-density 
correlation function. 

Q5: Referring to Fig. 6.14(c), the diagram suggests that a particle-hole pair propagating 
along the same path in a disordered medium may split up, propagate along a closed loop 
in opposite directions, and later recombine. Provided that the action for traversal of a 
scattering path in the medium does not depend on the direction of propagation, one may 
expect the classical action difference of the composite process to be small. 67 How would these 
processes - known as cooperon-modes, or just cooperons 68 in the literature - manifest 
themselves in the present formalism? The cooperon describes a process wherein the two 
constituent partners propagate with near opposite momenta. Indeed, it turns out that, 
for pi ~ — p 2 , the irreducible vertex contains a second contribution (besides the isolated 
impurity line) of more complex structure: a sum over diagrams with “maximally crossed” 
impurity lines (see Fig. 6.15). This diagram, too, contains one Fermi sphere summation 
per impurity line. The most economic way to see this is to imagine the lower of the two 
fermion propagators twisted by 180° (the second diagram in the figure). Superficially, it now 
resembles the previously explored vertex. The important difference, however, is that the 
arrows marking the fermion propagators now point in the same direction, i.e. as compared 
with the particle-hole mode discussed above, the relative sense of traversal of the impurity 
sequence became reversed. 

Compute the cooperon contribution to the irreducible vertex. Discuss what would happen 
if the system was exposed to an external magnetic field. In spite of its structural similarity 
to the “diffuson mode” identified in Q4, why does the insertion of a single cooperon into 
the diffusive correlation function not give rise to a large correction? 


Formally, the directional invariance of the action requires that the system is invariant under time-reversal. 
Time-reversal symmetry would be lost, for example, if the system were subject to an external magnetic field. 
In this case, the classical probability for propagation along a path depends on the sense of traversal. 

The origin of the terminology “cooperon” is that these modes describe the dynamics of a two-particle process 
where the two constituents propagate in opposite direction (in analogy to the opposite momenta of the two 
electrons of a Cooper pair). 
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Al: The functional expectation value of the particle number is given by (N) = 
f dr d d r {p(r,T))g, = f d d r (p Wm =o(r))^. Differentiating this expression with respect to p, 
and noting that the chemical potential couples to the action through p f d d r f drp(r,r), 
one obtains 69 

M N ) = J d d rd d r' [{p 0 (r)p 0 (r')U - {po( r )MPo( T ')}^,- 
= / d d r d d r' D u =o(r — r') = L d lim D q q. 

J q-*-o 

Particle number conservation demands that J d d r (6p( r, r))^, = 0 at all times. Consequently, 
f d d r D{ r,r) = 0 or, equivalently, D q= o, Wm = 0. 

A2: It is straightforward to verify that the two- fold /^-differentiation suggested above 
yields the correlation function D. Now, let us employ the replica formulation (In Z),^ = 
liiri J R_j.o tj (Z R — 1). Differentiating the right-hand side of this equation, one obtains 

d(4 = 

To avoid the vanishing of this expression in the limit R — > 0, we need to connect operators 
and tp b and if a ) by fermion lines (thus enforcing a = b - otherwise the two- fold 
summation over a and b would produce an excessive factor R which would result in the 
vanishing of the expression in the replica limit). We thus obtain a structure similar to that 
discussed in Section 5.3 (cf. Fig. 6.16): two propagators connecting the points x and 0, 
where the role of the wavy interaction line of Section 5.3 is now played by the “interaction” 
generated by the impurity correlator (VV). 

As in Section 5.3, it will be convenient to formulate the diagrammatic analysis of 
the correlation function in momentum space. We thus substitute the Fourier decomposi- 
tion tp a (x) = (T/L d y/ 2 Yl p e~ w ' x ipp into the definition of D , make use of the fact that 
(V’piV’p 1 +g' i ^ 2 + g 'V , p 2 ) oc (momentum conservation in the averaged theory), and obtain 
D (*) = iY, q e iqxD q , where D q = ^ lim r^o R~ 1 T ipiP2 (^p 1 ' t Pp 1 + q i’p 2 + q i’p 2 )^- Th e 
impurity interaction is static, implying that the fermion frequency is conserved along each 
propagator: iv n > = ui n , i.e. 

Dq = Jfd^ 0 JlY.Y, (^Pi, Wn V’p 1+q , Wn+m V i p 2 + q , Wn+m V’p 2 ,a, n )^- (6-54) 

OJ„ pip 2 

A3: Only diagrams that contain one free summation over the Fermi surface per impurity 
line contribute to leading order in ■ (Formally, this is because each impurity line 

contributes a factor y 2 ~ (i/r) _1 and the phase volume of a momentum summation over the 
Fermi surface is needed to compensate the density of states factor v in the denominator.) 
As in our discussion of the self-energy, this implies that diagrams with crossed impurity 
lines do not contribute to the irreducible vertex. (The only irreducible contribution without 


From the expressions above it is, in fact, not quite clear why we set first u ) n = 0 and only then q = 0. That this 
is the correct order of limits can be seen by generalizing /i — > £t(r) to a smoothly varying static field, evaluating 
the corresponding functional derivatives <5/<5/x(r), and setting /x = const, at the end of the calculation. 
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Figure 6.16 Diagrammatic expansion of the diffuson mode. 


crossings is the single impurity line.) Consequently, the diagrammatic expansion of the 
correlation function assumes a form shown in Fig. 6.16. The Bethe-Salpeter equation for 
the impurity vertex (shaded in the figure) reads 


1 


- + ; 


1 


' 2?T L d VT 2wL d VT 


'y ' G p-i-q.?i+m p . ri F/i , 1)2 5 


(6.55) 


where the Gs denote the impurity-averaged single-particle Green functions evaluated in the 
SCBA and discussed in the previous section. 

A4: The correlation function D contains a summation over frequency. Assuming that u rn > 
0, let us organize the sum as D = D ++ + D -1 + D , where D ++ , D~* , and D 

denote, respectively, the contributions where {ui n ,u> n+m } > 0, {oj n+m > 0, aj n < 0}, and 
{u! n ,uj n+m } < 0. We begin by considering the most interesting term . Introducing the 
notation G± = (Ep — p 2 /2m ± i/2r) _1 , one may expand the integrand appearing in the 
Bethe-Salpeter equation to leading order in the small 70 energies ui n , ui n+m and q • p/m, 
and obtain 


G p+q G p = G+G- (1 - iG+u n+m - iG~u n + (G+q • p/m) 2 + •••), 


(6.56) 


where a term linear in q has been omitted as it will vanish upon integration over the angular 
coordinates of p. We next make the assumption (to be checked self-consistently) that, for 
small q, the vertex F = does not depend on the “fast” momenta p, p'. The integration 
over p - now decoupled from the vertex - can then be made with the help of two auxiliary 
identities: 71 f d d pf( |p|)(v • p) 2 = ^ Jd d pf{ |p|)p 2 and 


L 


- d £ 


m 


+ 1 r/ ,-lB-+l 


P, 




The inequality a; n a; n + m < 0 implies that |n| < \m\. Thus |o; n |,|a; m | r 1 , where it is assumed that the 
frequencies we are probing are much smaller than the scattering rate. 

This can be checked by writing v • p = ViPi and using the symmetry of the integrand under the operations 
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The last identity is proven by converting the momentum sum into an integral, which is 
then performed by contour integration (exercise). Substituting the expansion (6.56), and 
employing the two auxiliary identities, one arrives at the relation 

r « = iL { L ~ d + w | T ~r\ r *) ■ 


Solving this equation for T g , we obtain our final result for the vertex, 


q 27 TUT 2 L d ui m + Dq 2 ’ 

where D = v-pi/d defines the diffusion constant of a dirty metal. 

A few remarks about this result are in order. Firstly, one may note that the absence 
of (q, cu m )-independent constants in the denominator results from a cancellation of two 
terms in the vertex equation. (We have met with a similar cancellation in our discussion of 
the vertex of the generalized ^-theory in Chapter 5.) Thanks to this cancellation, T(r, r) 
becomes a long-range object. Later, we will identify the absence of damping or mass-like 
terms in the denominator as a consequence of a fundamental symmetry in the problem. 
Secondly, one may note that T is a solution of the diffusion equation (d T — DW 2 )T(t. r) = 
(' 2-rrL d T 2 )~ 1 6 d (r)5(T ) (exercise), describing the manner in which a distribution initially cen- 
tered at x = 0 spreads out in time. Alluding to this analogy, the vertex F has become known 
as the diffuson. However, more important than the terminology is the fact that we have 
succeeded in making the connection between particle-hole propagation in a dirty medium 
and classical diffusive processes quantitative. 

To finalize our calculation of the four-point function we need only add external Green 
functions to the vertex (see Fig. 6.16), add the first diagram (the empty bubble) shown 
in the figure, and integrate over the momenta pi, P 2 - Expanding the corresponding Green 
functions to zeroth order in q ,oj n ,uj n+m (cf. Eq. (6.56)) and using the auxiliary identity 
above, one obtains 


E 

-w m <w n <0 


^2nUT + 


(2ni n) 2 1 \ 

27 tut 2 u> m + Dq 2 ) 


VU) m 

uj m + Dq 2 ’ 


where, in the first equality, the factors 2ttut stem from the momentum integration over 
pairs of Green functions, and the second equality holds under the presumed conditions 
£ jj m ,Dq 2 <C t -1 . (The overall minus sign comes from the fact that we are computing a 
closed fermion loop.) 

To complete the analysis, we need to compute the expressions D ++ t . The analysis of 
these objects is fairly simple inasmuch as the vertex actually collapses to a single impurity 
line. Since the two Green functions over which the sum in Eq. (6.55) is performed now have 
imaginary parts of the same sign, the contour integration over |p| gives zero. Similarly, the 
placement of the residual single-impurity line vertex into the second diagram of Fig. 6.16 
leads to a vanishing contribution. Since the sum over |pi|, | P 2 1 extends over Green functions 
with imaginary parts of the same sign, one would be tempted to say that, for the same 
reasons, the empty bubble diagram ought to vanish. However, this is not the case: the tricky 
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point is that, with regard to the first diagram, one must keep an eye on the frequency sum- 
mation. Considering D ++ , all other diagrams correspond to frequency sums ]C w „>o D(cv n ), 
where F{uj n ) n ~ 1 n~ k 4 . However, the first diagram leads to an expression with only k = 2. 
This means that, after the frequency summation, one obtains a p-dependent expression that 
does not converge quickly enough to qualify for a safe contour integration procedure. 72 To 
be specific: 




(*“>n - £ P + 27 ) : 


1 ^ f°° den f(e) 

1 <E; 1 


2tt iL d ‘ 


' e - + jr 


where, as usual, £ p = p 2 /2m — Ep and the frequency summation has been performed by 
integrating over the real axis and closing in the upper complex half plane, and in the last 
equality we have approximated np(e) — 0(— e). Similarly, an analogous computation obtains 
D = (£>++)*, i.e. 


+ D— = 


X v 1 


+ c.c. = / de Im 


li E F — e + i/ 2r 


Combining everything, we arrive at the final result, 


D q = D+ + + D+- + D~~ =v\l- 


\Um 


\u m \+Dq 2 


Dq 2 

V \LU m \+Dq 2 ‘ 


(Here we have used the fact that the result does not change under sign inversion of w m . 
Enthusiastic readers may wish to check this claim.) Summarizing, we have thus found that 
the density-density correlation function assumes the form 


D ( r ’ r ) = ^ E eiq ' r+ " 


uDq 2 


This result makes the diffusive character of the density-density correlation function mani- 
fest. Notice also that D q obeys the two limiting conditions discussed in (Al). (Indeed, for 
a non-interacting fermion system, v = L~ d dfj,N.) 

A5: Let us now consider the irreducible vertex in a sector in momentum space where the sum 
of the upper incoming momentum, p l5 and lower incoming momentum, — pi + q, is small. 
(Momentum conservation then implies that the upper outgoing momentum, P 2 , and lower 
outgoing momentum, — p 2 + q, also sum to the small offset q.) Further, let us assume that 
the Matsubara frequencies u> n and uj n + rn carried by the upper and lower propagators are of 
opposite signs. Now, imagine the lower line is turned around in such a way that the diagram 
assumes the form of a ladder structure (still it remains “irreducible” because the notion of 


Indeed, the |p|-summation only approximately extends over the entire real axis. We are thus well advised to 
do the frequency summation (which has true semi-infinite support) prior to the momentum summation. 
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irreducibility has been defined for fermion lines of opposite orientation). The corresponding 
Bethe-Salpeter equation takes the form 

r « = 2-nvrd^ 2TrurL ^ G - p+cl ’ ni+mG - p ^ ’ 

P 


where T c is the cooperon contribution to the irreducible vertex, and we have assumed that, 
for small external momenta, T c is independent of the “fast” momenta. In the absence of 
an external magnetic field, G_ p = G p is even, i.e. the Bethe-Salpeter equation coincides 
exactly with Eq. (6.55) for the diffusion mode considered above. Consequently, we may infer 
that 


1 1 

2'KvrL d \u) m \ + Dq 2 


However, in the presence of a magnetic field, the inversion symmetry is lost. Let us, for 
a moment, assume that the vector potential A representing the magnetic field depends 
only weakly on its spatial coordinate. Neglecting commutators between the momenta and 
the vector potential, and defining G P (A) = (iu) n + E p — (p + A) 2 /2m + i/2rsgn(o;„)) _1 , 
(e = c = 1), one may then shift the fast momentum to obtain G p + q (A)G_ p (A) = 
G p+q (A)G p (— A) G p+q (2A)G p (0) = G p+q _2A(0)G p (0). In this way, one may 

see that the magnetic field can be absorbed into a redefinition of the momentum difference 
q. In the presence of an external magnetic field, the cooperon then acquires the form 


4 2-KVTL d \ui m \ + -D(q + 2A) 2 ' 

The notation here emphasizes that the simplified argument above generalizes to fields with 
arbitrary (yet smooth on the scale of the microscopic mean free path) space dependence. In 
that case, T c is defined as the solution of the equation (\u> m \ + D{— *V r + 2A) 2 )T(r, ui m ) = 
(2Tri/TL d )~ 1 S d (r) (a formal solution of which is given by the right-hand side of the definition 
above). Crucially, the presence of the vector potential spoils the singularity of the cooperon 
mode in the limit q, tj m — > 0. In other words, the magnetic field turns the cooperon into an 
exponentially decaying mode, as expected from our qualitative discussion above. 

To conclude this section, let us discuss the role of the cooperon mode within the larger 
framework of the “quantum diffusive” process. Neglecting the cooperon, we had identified 
the density-density correlation function - formally computed as a ladder sequence where 
each rung was given by the trivial single impurity line irreducible vertex - with a classical 
diffusive process. However, we have seen that coherent loop excursions traversed in opposite 
directions give rise to quantum corrections to this picture. Formally, these excursions were 
described by maximally crossed corrections to the irreducible vertex. We saw that, in the 
absence of external fields, these corrections quantitatively resembled the diffusive form of 
the (leading-order) particle-hole mode. (This could have been anticipated from Fig. 6.14(c). 
Although the impurity sequence is now traversed in an opposite sense, the cooperon excur- 
sion provides a diffusive contribution.) Does this formal similarity imply that the cooperon 
modes give a large 0( 1) correction to the leading classical term? To understand why it 
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does not, we have to, once again, consider the phase space structure of the fast integration 
momenta. 

The diffusion mode is computed for an overall conserved small momentum difference 
between the upper and lower momenta. On the other hand, the cooperon mode requires 
the momentum sum of the upper incoming and lower outgoing momenta to be small. These 
two conditions have to be reconciled. It is straightforward to see that this leads to the loss 
of Fermi-sphere integration volume. In other words, the cooperon gives rise to a singular 
(in Dq 2 + |w TO |) correction to the diffusion mode. However, this correction is weighted by a 
small factor ~ {p-pt) d ~ l . 

The reader may wonder why we left the cooperon mode calculation with the semi- 
quantitative arguments above. The reason is that the actual embedding of the cooperon 
mode into the density-density correlation function, i.e. the solution of the Bethe-Salpeter 
equation for the full vertex in the presence of a cooperon-corrected irreducible vertex, is a 
tricky business. It requires a fair amount of diagrammatic skill to obtain the correct result 
(notably one that fulfills the conservation laws discussed in Al). In the following section, 
we will see that this task is much more efficiently tackled by functional methods, a feature 
not uncommon in the field-theoretic environment. 

Further, one may wonder what happens when the external momenta are tuned to zero. 
Eventually, the singularity of the cooperon mode will seriously compete with its small 
prefactor, so that qualitative corrections to classical diffusion must be expected. Indeed, 
it turns out that a proliferation of cooperon corrections leads to a slowing down of the 
diffusive propagation of electron densities. In a way, the cooperon describes the tendency 
of an electron’s quantum amplitude to constructively self-interfere at the point of origin of 
the closed loop. Through an accumulation of such processes, the electron may eventually 
get stuck or localized. Qualitatively, this is the origin of the phenomenon of Anderson 
localization in metals. We shall return to this phenomenon from a more field-theoretical 
perspective below. 


Mean-field theory and spontaneous symmetry breaking 

Previously, we have seen that the Green function of the disordered electron gas contains a 
complex self-energy describing the damping of quasi-particle amplitudes by impurity scat- 
tering. Further, we have found that the more complex four-point function is a long-range 
object whose behavior is governed by soft modes of density relaxation: the diffuson and 
cooperon modes. The existence of the latter was intimately tied to the time-reversal invari- 
ance properties of the system. In concluding, we had anticipated that a complete theory 
of the weakly disordered conductor should be able to describe the embedding of diffuson 
and cooperon modes into structures of higher complexity. It also became quite clear that 
the construction of a diagrammatic approach to the problem would be a formidable task; 
rather one should aim to find a formulation wherein the diffuson and cooperon modes (in 
contrast to individual fermion Green functions) appear as fundamental degrees of freedom. 
Such considerations call for a field-theoretical construction. In this section, we utilize the 
machinery of the present chapter to construct the low-energy field theory of the disordered 
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Figure 6.17 The three different low-momentum channels of the impurity vertex: exchange, Cooper, 
and direct. 


electron gas. In doing so, we establish contact with our previous results obtained from 
the diagrammatic perturbation theory, and we understand the phenomenon of soft mode 
formation in the disordered electron system from a wider perspective. 

As usual, our starting point is the replicated action of the non-interacting disordered 
electron gas derived in Section 6.5: 




J dr d d ‘ 


rip(d T - E F - — +V)ip 


( ... 


v J drd d r 


r ip ( d T — E F 


AnUT 


dr dr' d d r (ipip)(r) (ipip)(T r ), 



(6.57) 


where the fields ip = { ip a } carry a replica index a = 1, ... ,R, and V denotes the disorder 
potential. The last term is generated by the ensemble averaging, where we have assumed 
that the potential is 5-correlated and expressed its strength through the scattering time 
r. (For clarity, wherever possible, we will avoid spelling out the spatial argument r of the 
fields.) Of course, our master strategy will be to decouple the disorder-generated interaction 
by a Hubbard-Stratonovich transformation. However, before doing so, we need to decide 
which of the three low-momentum channels of the interaction vertex to keep (see Fig. 6.17). 

The first contribution (termed the “exchange channel” in our general discussion in Sec- 
tion 6.1) appeared as a principal building block in our diagrammatic discussion of the 
diffuson mode above; this contribution should surely be retained. Similarly, the second dia- 
gram (the Cooper channel) played an important role in the computation of the maximally 
crossed diagram and must be retained. The third diagram (the “direct channel”) describes 
the scattering off impurities at low momentum transfer. There is no reason why such pro- 
cesses should play a distinguished role, so we will ignore this channel. 

Following the general discussion of Section 6.1, we should be prepared to introduce two 
different Hubbard-Stratonovich fields, one for each impurity vertex. There is, however, 
an ingenious trick whereby the number of required Hubbard-Stratonovich fields can be 
reduced to one. The idea is to exploit the fact that the diffuson and the cooperon mode are 
linked to each other by the symmetry operation of time-reversal. (Recall that the cooperon 
mode described the traversal of an impurity path in a chronologically reversed order.) 
In the quantum mechanics of spinless particles (for the straightforward extension of the 
present discussion to particles with spin, we refer to Efetov 73 ) the anti- unitary operation 


K. B. Efetov, Supersymmetry and the theory of disordered metals, Adv. Phys. 32 (1983), 53-127. 
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of time-reversal T maps operators X onto their transposes X — > TX = X T J 4 Notice 
that Tp = — p, i.e. T induces the momentum reversal distinguishing the diffuson and the 
cooperon vertex above. This latter observation suggests that one should consider a version 
of the theory “symmetrized” under time-reversal. Within the symmetrized description, the 
diffuson and the cooperon vertex should, in some sense, merge into a unified object. The 
practical implementation of this idea goes as follows. 

Let us return to the action prior to averaging and write 

S[ip,ip\ = - J dTd d r^G~ 1 ilj=-^ J drd d r + (^G“V) T ] 

= ~^J dr d d r \d T ^-d T $ T ] = J drd d r 

Here, G _1 = —d T — H + Ep and, in the second equality, we have used the fact that V>G _1 ^> = 
('ipG~ 1 ip) T (simply because it is a number). In the third equality, we have expressed the 
transpose in terms of its constituents and used the fact that (G _1 ) T = G _1 |a T _>._ 0 T (time- 
reversal symmetry !). 75 (The overall minus sign comes from the permutation of the Grass- 
mann components of V , 0 Finally, we have defined 

*(r) = , *(r) = 1 $( t % 

to condense the action into a single bilinear form . 76 Notice that the enlarged fields T and 
T are no longer independent of each other; they are connected by the symmetry operation 

'F(t) = — ^ T (— r)(i<T2 r ), (6.58) 


where of represents Pauli matrices acting in the newly introduced “time-reversal space.” 
Involving a transposition operation, Eq. (6.58) signals the fact that the fields and 
resemble “real” rather than “complex” fields (the quotes are used because we are dealing 
with Grassmann fields). 

Being now symmetrized, an average of the field integral over the Gaussian distributed 
impurity potential gives 


= § 


drd d r ^ 





1 

16 ttut 


dr dr' d d r (\E , ^ , )(r)(^ , ^ , )(r / ). 


Cast in this form, we are now in a position to implement the usual programme to construct 
the low-energy field theory of the quantum disordered system: 


Ql: Introduce a Hubbard-Stratonovich transformation to decouple the quartic interaction 
induced by the impurity average. Note that the corresponding Hubbard-Stratonovich field Q 


A more precise formulation is that there exists a representation (typically, the coordinate representation) 
wherein time-reversal amounts to transposition. 

Note that, under time-reversal, the sign of the time-derivative is reversed: f dr = — f dr {dr^ )-0 = 

-/dr y> T (-d T )i> T . 

Notice that the temporal minus sign in the fields and -0 is introduced to remove the unwanted sign 
multiplying d T in [G -1 ] . Physically, it emphasizes the time-like character of the transformation. 
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must be matrix-valued - identify the set of indices on which the fields depend. Finally, 
integrate out the fermions to determine the dependence of the effective action on the field Q. 

Q2: Starting with the effective action for Q, derive the corresponding mean-field equations. 
(Hint: You will find it convenient to switch to a frequency-momentum representation before 
varying the action.) 

Q3: The mean-field equations can be solved by a simple matrix-diagonal ansatz, i.e. Q = A. 
Motivate your ansatz and solve the equations. How does the result relate to the SCBA 
discussed in section 6.5? 

Q4: Assuming that the frequency contribution to the action ~ ’F n w n \I> n is negligibly small, 
identify the global continuous symmetries of the problem, i.e. explore what happens if the 
fermion fields are transformed as 'F — »• TtH, ^ — > 'FT, where the matrices T and T are 
constant in space and must be chosen so as to respect the relation (6.58). Show that the 
diagonal solution of the mean-field equations derived in Q3 breaks the full symmetry of the 
problem. Identify the manifold of Goldstone modes of the problem. 

Al: To effect a Hubbard-Stratonovich decoupling of the action, we must first isolate the 
Cooper and exchange contributions to the interaction vertex: 

S dis ~ J dr dr' d d r V)^ (2 V) 

+§(!) ( t )$( 2) ( r )§(2) (/)«£(!) (/)) , 

where the convention is that two fields carrying the same upper index form a slowly varying 
bilinear. Now, thanks to the symmetry Eq. (6.58), the two contributions to the action are 
identical. Indeed, 

iJrWtJff 2 ) — (i|r( 1 )iJ/( 2 )) T = _^f( 2 ) T ^r( 1 )' r = (i<jt r )i|f( 1 ) T = ^r( 2 )^f( 1 ) ; 

where in the last equality we have used the symmetry. Thus, we may focus on the decou- 
pling of the single term S d is = (87T ^t) -1 / dr dr' d d r '^ , ( 1 )(r)^ , ( 2 )(r)'^ , ( 2 ^(r , )^ , ( 1 ^(T , ). To 
make further progress with this expression, let us introduce a composite index a = (a, a) 
comprising the replica index a and an index a = 1, 2 labeling components in time-reversal 
space. Using this notation, 

S d is = — [ dTdT'd d r^ a (T)^ a (T)¥ 2 ^(T')^(T') 

Sttut J 

dr dr' d d r ^'( 1 ^ /3 (T , )'^ , ^ 1 ^ a (T)^ , ^“(T)'^ , ^ 2 ^ /3 (T , ). 

To decouple the interaction, let us introduce an infinite-dimensional, hermitian, matrix 
field Q = {Q a P( r; r, r')}, slowly fluctuating in space and of the same structure as the dyadic 
product 'F'F = {4 ,a (T)^ ,/3 (T / )}, i.e. Q = (ia 2 : )Q T . One may then multiply the 
action by 77 f DQ exp[— f d d rtrQ 2 ] and perform a shift Q — > Q + /-kv. 


l - I 

87 TUT J 


Here, tr Q 2 = f dr dr' (t ,t) and Hermiticity of Q is assumed to ensure the existence of the 

integral. 
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This generates the identity 

e- Sdls ["> ] = J DQ exp ^ J d d rtiQ 2 - k j kr'FQ’k] , 

where we have made use of the fact that tr (Q'l'il') = — TQT = — f dr dr' ^> a (r) x 
Q a P(r, r , )^ ,/3 (r / ) and the superscripts (1) have been dropped. Combining terms, we arrive 
at the preliminary result 78 


.Z 


-g-Mq] 

J D'h DQ exp|- ^ J d d r tr Q 2 - ^ J drd d r § (d T - E F - 'kj 

J DQ exp ^ J d d r trQ 2 + ^ trlnG _1 [Q]J . (6.59) 


A2: Fourier transforming the field Q(r, r') in its time arguments one obtains a matrix 
Q = { Qnn '}) where n, n' index fermionic Matsubara frequencies. The frequency /momentum 
version of the action then assumes the form 

s'IQ] S tr GfoOGHO ~ \ ix ln g~ x [Q), 


where G _1 [Q] = (iili — £ p + *Q/®rf _1 , ui is the diagonal operator of Matsubara frequencies 
and all indices (including momentum indices) not written out explicitly are traced over. 
Varying the action with respect to the matrix Q, i.e. Q — > Q + SQ, and requiring vanishing 
of contributions linear in SQ, one obtains the equation 

Y tr [ (nvL d Q( q) - i Y GIQkp+q) ^Q(-q)| = 0. 

q ' p ' 


Holding for any matrix SQ, this equation is equivalent to the matrix mean-field equation 
7ri/Q(q) - X) P ^[Qkp+q = °- 


A3: To seek solutions of the mean-field equation, let us apply the simplest ansatz, Q = A, 
where the A is diagonal in time-reversal and replica spaces, spatially uniform, and diagonal in 
Matsubara space. When this is substituted into the mean-field equation, the latter assumes 
the form 


ttpA 


i 

IA 



1 

- £p + 2F a ’ 


(6.60) 


reminiscent of the SCBA Eq. (6.52) for the average Green function (the identification 
iA/2r = — S understood). Drawing on the earlier discussion, one may identify the solution 
A n = — 2*rE n = sgn(w„) (where £ n = — *sgn(w n )/2T). 


Notice the prefactor 1/2 generated by the Grassmann integration, i.e. f D exp[— \ G 1 ] oc det G 1 ^ 2 = 
exp[^tr ln G _ ] arises because the components of and are not independent. (To see this, assume that G 
is diagonalized and count the number of independent Grassmann integration variables.) 
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A4: In the mean-field approximation, G becomes equivalent to the impurity-averaged Green 
function discussed in Section 6.5. But, clearly, this can not be the end of the story! We have 
seen in Section 6.5 that impurity scattering leads to diffusive phenomena that reach far 
beyond the damping of single-particle amplitudes. How are such structures reflected in the 
field theory? To identify the diffusion modes, one must return to consider the symmetries 
of the theory (having in mind that symmetries tend to generate long-range low-energy exci- 
tations). We thus go back to our initial action — f dTC -1 [ W] and explore what happens 
as we transform and T — > F' = FT, where T and T are matrices, constant 

in real space. Of course we require that the fundamental relation (6.58) be preserved under 
this transformation. This leads to the condition T = ( ia ^ )T T (ia%) Moreover, T should 

be a symmetry of the problem, i.e. we require that TG -1 [V]T ~ G -1 [V]TT = G -1 [F] or 
T = T 1 . Here, we have used the fact that, owing to its constancy, T commutes with the (dis- 
ordered) Hamiltonian. Nevertheless, TG -1 [F] ~ G -1 [F]T holds only approximately since, 
for a transformation T nn i of a general frequency structure, [T, Co] / 0. However, for transfor- 
mations of physical interest (fluctuating slowly in time), this lack of commutativity is not 
of much concern. Combining the two conditions above, we find that T = (icr^)T T , 
which is the defining relation of the unitary symplectic group Sp(2-i?- (2 M)). Here, we have 
assumed that [-w max ,w m ax] = [— 2ttMT, 2-kMT] is the range of 2 M Matsubara frequencies 
over which the lack of commutativity of the symmetry group and the frequency operator 
can safely be neglected. 

With this background, we may note that the diagonal solution of the mean-field equa- 
tions breaks the symmetry above. Indeed, the fact that T is a symmetry implies that, not 
only A, but any configuration Q = TAT -1 , solves the mean-field equation (6.60). The sym- 
metry is broken, inasmuch as only a subgroup of transformations To £ K C Sp(4i?,M), 
ToATj^ 1 = A, leaves the diagonal saddle-point invariant. (It may be helpful to think of 
A as some kind of spin, Sp(4 RM), the analog of the rotation group, and K the group 
of rotations around the spin axis.) The invariance group K is easily identified. Projected 
onto the positive/negative frequency range, the diagonal solution A| w > / <0 oc ±1 is equal 
to the unit operator, i.e. transformations not mixing positive and negative frequencies 
will leave the saddle-point invariant. This means that the invariance subgroup is given by 
Sp(2RM) x Sp(2 RM) c Sp(4AM) where the two factors are the invariance groups of the 
positive/negative frequency range. Following our general line of argument on page 257 we 
may conclude that the Goldstone mode manifold of the present problem is the coset space 
Sp(4i?M)/(Sp(2i?M) x Sp(2AM)). The elements of this space are conveniently parame- 
terized as Q = TAT -1 , where T = exp [IF] and the generators IF anti-commute with A, 
[W, A] _|_ = 0. 79 This condition is met by setting 



Generators commuting with A need not be taken into account inasmuch as they do not affect the diagonal 
saddle-point (or span the group K ). 
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where the block structure is in Matsubara frequency space (positive/negative fre- 
quencies) and B are 2M.R- dimensional matrix fields. The anti-Hermitian connection 
between the two blocks follows from the condition = Q W' = —W. Notice that 
(to 2 r ) e w (ia 2 T )~ 1 = e~ w implies a second condition B = (io^)B T (io^) -1 . As one last 
technical remark - of crucial importance for all that follows - let us notice that the matrices 
Q obey the nonlinear constraint Q 2 = 1. (This follows from the fact that Q 2 = (TAT -1 ) 2 
and A 2 = sgn 2 (w„) = 1.) Field theories on manifolds with nonlinear constraints are 
generally called nonlinear (j-models . 80 In the next section, we proceed to construct the 
low-energy action of the nonlinear cr-model of the disordered electron gas. 


Low-energy field theory 

In the previous section we constructed the backbone of a field-theoretical approach to the 
disordered electron gas, ran into a scenario of spontaneous symmetry breaking, and identi- 
fied the corresponding Goldstone mode manifold as Sp(4i?M)/(Sp(2i?M) x Sp(2i?M)). In 
the following, we now proceed to construct the low-energy action of the Goldstone mode 
degrees of freedom. Our second objective is to understand the connections to the diagram- 
matic approach introduced above. Finally, we wish to develop an efficient theoretical formal- 
ism, powerful enough to cope with the nonlinear proliferation of diffuson- and cooperon-type 
soft modes that govern the physics of disordered conductors at low temperatures. 

Our target is a low-energy action of the elements Q = TAT -1 £ Sp(4RM)/(Sp(2i?M) x 
Sp(2 RM)) of the Goldstone mode manifold. The action of this nonlinear cr-model can be 
constructed by substituting the slowly fluctuating matrix configuration Q = (r)} 

into the action (6.59) and expanding in (a) gradients and (b) the small symmetry breaking 
induced by the lack of commutativity of the “rotation matrices” T and the frequency oper- 
ator u> - a program which is instructive to follow. 81 However, in the present section we shall 
pursue a more symmetry-oriented (but only slightly less rigorous) strategy of constructing 
the action. 

Ql: Using only symmetry arguments, identify the minimal (lowest number of derivatives) 
action Sa [Q\ measuring the cost of spatial fluctuations of the Q field. What is the structure 
of the action S u [<2] accounting for the non-commutativity of Q and the frequency operator? 

Q2: Our analysis so far has left the values of the two coupling constants of the theory unde- 
termined. As we do not wish to perform an explicit gradient expansion, the only way to 
determine their values is to compare the predictions of the field-theoretical approach with 
those of another formalism. Indeed, we have seen in Section 6.5 that the long-range behavior 
of the density-density correlation function is governed by specific types of soft modes: dif- 
fusons and cooperons. We also saw that, at low frequency scales, these modes were prone to 


Other representatives of this family of field theories will be met in Chapters 8 and 9. 

A yet more systematic approach would be to decompose the full field manifold Q = Q gs + Q m into a Goldstone 
mode configuration Q gs = TAT -1 and a massive contribution Q m and to integrate over the latter in the second 
order of perturbation theory. However, it can be shown that, in the present case, the contribution of the massive 
modes is inessential, i.e. we may work with the pure Goldstone theory from the outset. 
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“interact” with each other. (A cooperon insertion into the diffuson mode can be interpreted 
as some kind of “self-energy” of the diffuson propagator.) Comparing this picture with the 
structure of the field theory, it is evident that the elementary diffusons/cooperons must 
be the Goldstone modes of the cr-model. “Interactions” between diffusons and cooperons 
appear when we expand the theory to higher than quadratic order in the generators B. 

Expand the action (6.62) to second order in the generators B defined in Eq. (6.61). 
Comparing the structure of the quadratic action with that of the diffuson mode, fix the 
ratio of the coupling constants cq and c u . 

Q3: Having determined the ratio of the two coupling constants, one may now complete 
the analysis by fixing the absolute value of c 0J . We do so by introducing a source term 
S[\ k] * .S' ['I'] — 2 j d n 4 , p, n K n v I'p, n such that differentiation 

= — 7r^sgn(u;„), 

gives the density of states. Use the source k = {n n S nn i} to determine the coupling con- 
stant c u . 

Al: As long as we ignore the symmetry breaking due to the frequency operator, the action 
is symmetric under transformations Q — » TQT -1 , where T € Sp(4f?M) is an element of 
the symmetry group. This requirement uniquely identifies <Sfl[Q] = caf d d rtr(VQ) 2 as 
the dominant (lowest number of derivatives) contribution to the fluctuation action. The 
dominant operator describing the explicit breaking of the global symmetry will surely be 
of first order in u>: S w [Q\ ~ f d d rtr(u}F[Q]), where F is some function. Since Q 2 = 1, only 
terms linear in Q contribute to the expansion of F and we may set S^Q] = c u f d d rtr(ujQ). 
Summarizing, one may deduce that the soft mode action acquires the form 

S[<2] = J d d rtr [cfl(VQ) 2 + c u (bQ] . (6.62) 

As with other nonlinear cr-models, the innocent appearance of the action above is somewhat 
deceptive: we need to remember that the field Q = e w Ae~ w is a nonlinear object. An 
expansion of it in terms of the generators W will generate “interaction operators” ~ W n of 
arbitrary (even) order in n. The analysis of these interactions is a highly non-trivial task! 

A2: Substituting Q = e w Ae~ w ^ W ’ = + ° e 2W A = (1 + 2 W + 2 W 2 + ■ ■ ■ ) A into the action, 
one obtains 

SM[W] = J d d r tr [-4c fl (VW) 2 + 2c w Ac oW 2 ] . 

One may now express the matrices W in terms of the off-diagonal blocks B and arrive at 
5< 2) [H, fit] = J d d r tr [8c fl a i H5 i H t - 2c u Cj{V + BB t - V~B^B)\ 

= X] ,nn' (SW ~ 2c w | - W n - 1 ) #1“' “ , n • 

q,n>0,n'<0 


lim ^-Im— Z R = Im— 
R-+ o R 8n n K=0 2 L d 


J d d r &n*n) = -}/rn'£G p 
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Here, V +/ ~ is a projector onto the space of positive/negative Matsubara frequencies. In 
the second line, a = (a, a) is the composite replica/time-reversal index introduced above. 
Importantly, the propagator of the B-field describes a mode of diffusive dynamics formed by 
the joint propagation of two states with (a) the Matsubara frequencies ui n > 0 and uj n > < 0 
and (b) the momentum difference q. The analogy to our previous diagrammatic analysis of 
such processes is made complete by choosing 8 ca = cD. —2 c u = c, or 

S (2) [B, St] = c £ (Dq 2 + K - UW. I) 

q,ra>0,n'<0 

where c is an, as yet, undetermined global coupling constant. 

Before continuing, it is worthwhile taking a brief look at the time-reversal structure of 
the theory. Recalling that Q a<T ~ \f r<T fE ,<T , one may conclude that Goldstone modes B with 
an index structure describe the joint propagation of a fermion propagating forwards 

in time (a = 1) (or time-reversed [cr = 2]) with a hole propagator moving forwards in time 
(a 1 = 1) (or time reversed [o' = 2]). Put differently, modes B aa off-diagonal in time-reversal 
space (a ^ o') represent cooperon modes, while modes B <ja reflect the diffuson modes. 
In the absence of magnetic fields, the actions of these two modes are identical (rotational 
invariance of the action in time-reversal space). However, had we included a magnetic field, 
this invariance would be lost. 

A3: Noting that the addition of the source to the action amounts to a shift Co — > go — ik/L d , 
one may conclude that the low-energy action changes to S^Q] —> c u J d d r tr \(Co — ik/L d )Q\. 
Differentiation of the Q-functional thus gives 

—■kv sgn(cu n ) = lim -^-Im-^- Z R = lim -?-Imi^ f d d r(tvQ nn ) = 2c w sgn(o;„) H , 

R-rO R 0K n | K=0 R-^0 R L a J 

where the ellipsis represents contributions arising from terms of higher order in the gener- 
ators B. (In fact, one can show that all of these terms vanish in the replica limit.) Thus, 
c u = —Ttvj2 and one obtains the low-energy action of the nonlinear u-model, 

(6.63) 

In applications, one is often interested in situations involving an external electromag- 
netic field. How would such a field affect the structure of the action (6.63)? Although this 
question can be answered by microscopic calculation, a more time-efficient strategy is to 
require that the action £[Q] be gauge invariant. Under a gauge transformation, ip — > e^ip, 
the fields \E> enlarged by a time-reversal structure transform as (exercise) T — > e' A> T , where 
$ = diag(</>(r), —<P(—t)) (matrix structure in time-reversal space). This in turn implies that 
the fields Q ~ TT transform as Q — > e*®Qe _ * 4> . Invariance under such transformations is 
ensured (exercise) if we generalize the action according to 

S[Q] = y J d d r tr ^ ((V - i[ A, ])Q) 2 - (Co + U)q| , (6.64) 

where the vector potential A and scalar potential V transform in the standard manner 
A — ► A + V'h and V — > V + d T <l>. This way of coupling a gauge potential to an effective 
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action - to be discussed in more general terms in the next chapter - is known as minimal 
substitution. 


6.6 Summary and outlook 

This concludes our preliminary discussion of functional mean-field methods. We have learned 
how the integration over microscopic quantum fields can be traded for an integration over 
degrees of freedom adjusted to the low-energy characteristics of the system. We found that 
the functional dependence of the action on the new coordinates is usually highly nonlinear - 
the notorious “trace logarithms” - and has to be dealt with by some kind of stationary 
phase analysis. While a first principles solution of the mean-field equations is often not 
possible, all applications discussed in this chapter shared the feature that solutions could be 
found by an “educated guess.” Importantly, we realized that these solutions did not always 
display the full symmetry of the action. 

I N FO However, it is important to realize that our discussion has excluded a number of problem 
classes with complex mean-field structure. For example, for certain lattice systems with 
discrete translational symmetry, the optimal mean-field solution shows staggering, i.e. changes 
sign under translation by one lattice spacing. Deceptively, these configurations usually exist in 
parallel to a solution that displays the full symmetry of the problem, but is of higher energy 
(and therefore unphysical), i.e. the “true solution” is camouflaged by the existence of an inferior 
but “more symmetric” solution. 82 Then, there are systems where the existence of meaningful 
stationary phase configurations is excluded by structural elements, such as frustration. 

Other systems, notably glasses (cf. the remarks on page 115), do possess an entire contin- 
uum of mean-field configurations, i.e. there is a macroscopic number of (metastable) extremal 
configurations that are very close in energy space but may be very far from each other in config- 
uration space. (For example, two such configurations may differ by a restructuring of the atomic 
configuration of a glass at remote places.) The dynamics of such systems is governed by the 
process of aging, i.e. transitions between different metastable extremal configurations on very 
long timescales (witness the apparent rigidity of window glass!). It goes without saying that the 
“mean-field configurations” of glassy systems are irregular and not amenable to straightforward 
analytic calculation. 

Finally, even systems possessing a spatially homogeneous mean-field configuration may host 
other solutions of physical significance. Indeed, we have encountered a scenario of this type 
already in Chapter 3 above. Exploring quantum double-well tunneling, we observed that the 
imaginary time Euler-Lagrange equations of a particle in a doubly degenerate harmonic well ( 
the “mean-field equations”) had a metastable constant solution ( the constant mean-field). 
However, in addition to that, we found instantons, i.e. temporally structured solutions of the 
mean-field equations. Superficially, it seemed that, due to their non-vanishing action, instantons 
are irrelevant. However, for sufficiently long times, the energetic inferiority of the single instanton 
was over-compensated by statistical aspects, i.e. the fact that a continuum of instantons entered 
the partition function with approximately equal weight. In the language of statistical mechanics: 


When dealing with problems whose lattice structure carries physical significance, it is therefore a good idea to 
try a number of different ansatze, i.e. trial solutions that transform differently under translation by a unit cell. 
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the large entropy associated with instanton configurations outweighed the finite energy (or, 
more precisely, action) to be paid for a single instanton. Later we will see that entropy/energy 
competitions of this type are realized in many other contexts of quantum and statistical field 
theory. For example, the phase actions of the BCS superconductor or the superfluid possess 
(approximate) extremal configurations wherein the phase winds around an integer multiple of 
27 t as a fixed reference point in space is encircled (vortices). While the energy of an individual 
vortex solution may be high, the entropy corresponding to the choice of its center eventually, for 
sufficiently large temperatures, dominates and leads to a proliferation of vortices in the system 
( the analog of the instanton gas) . For a more qualified discussion of topologically non-trivial 
mean-field configurations we refer to Chapter 9. 

Finally, we have explored a number of different applications showing the way low-energy 
effective actions describing fluctuations on top of a reference mean-field can be derived and 
evaluated. In cases where the mean-field breaks a continuous symmetry, these actions were 
partially soft, i.e. contained Goldstone modes. We saw that the presence of a Goldstone 
mode has a dominant effect on the observable physical properties of a system. 

As exemplified above, the theoretical machinery developed thus far enables us to tackle 
already highly non-trivial problems. However, to make the functional integral a universally 
applicable tool, two important gaps have yet to be filled. Firstly, we have so far mostly 
been content with discussing general structures of condensed matter theory (i.e. much of 
the discussion has been limited to general manipulations on the level of the effective action.) 
However, in applications, one is typically interested in comparison with experimentally 
accessible data, i.e. we still need to provide links relating the field integral formalism to 
observables of experimental significance. Secondly, our discussion of the fluctuation behav- 
ior of field theories has been quite limited. That is, we have typically expanded an action to 
second order in what we believed was the “physically relevant” degree of freedom and com- 
puted the corresponding determinants. However, we also saw that some problems contain 
anharmonicities of profound significance. For example, the quartic term in ^-theory led 
to UV singularities whose nature remained largely obscure. We must, therefore, (i) develop 
criteria indicating in what circumstances a model can safely be limited to second order and, 
(ii) in cases where it cannot, learn how to deal with its anharmonic content. Beginning 
with the first, these two topics (which are not quite as unrelated as one might think) are 
discussed in Chapters 7 and 8. 
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6.7 Problems 
Peierls instability 


Previously, in Problem 2.4 we have seen 
that the half-filled one-dimensional lattice 
system is unstable towards the formation 
of a commensurate periodic lattice dis- 
tortion. In fact, at zero temperature, a 
one-dimensional crystal comprising elec- 
trons and ions is always unstable against 
the formation of a lattice distortion which 
transforms the system to a dielectric. The 
aim of this problem is to study the nature 
of the lattice instability known as a Peierls 
distortion. 


Rudolph E. Peierls 1907-95 

Born in Germany, Peierls did fun- 
damental work during the early 
years of quantum mechanics. He 
also studied the physics of lattice 
vibrations (phonons) and is cred- 
ited with the development of the 
concept of the "hole” in condensed matter. In col- 
laboration with Otto Frisch, Peierls was the first to 
realize that an atom bomb based on 235 U would be 
feasible, and was engaged in the Manhattan Project. 
(Image © University of Cambridge, Cavendish Lab- 
oratory.) 



The starting point of the theory is the Euclidean action for a one-dimensional gas of elec- 
trons, 

S e i[ip,ip] = J dr 

(Here, for simplicity, we take the electrons to be spinless.) Starting with a classical field 
theory of the lattice dynamics, the Euclidean action of the system of ions is given by 

„9 p h [«] = ^ J dr J dx (d T u) 2 + c 2 (d x u) 2 , 

where u(t, x) denotes the scalar bosonic displacement field. Finally, to complete the theory, 
we suppose that the electron system is coupled to the distortion of the underlying lattice. 
In principle, one could try to derive a “realistic” model of the coupling. However, since we 
are interested in the general stability of the system, we will consider a phenomenological 
coupling which engages the fields in the lowest order compatible with the symmetry of 
the interaction (for more details, see Problem 4.5). To this end, we introduce “Frohlich’s 
deformation potential approximation,” wherein the coupling is taken to be proportional to 
the gradient of the deformation d x u, 

<Sei-ph[Vh' ! /’> u ] = g J dr J dxip%pd x u. 

The full Euclidean action S = S e \ + ,S' p h + .S' e i- P h represents an interacting theory. As such, 
an explicit and exact evaluation of the coherent state path integral is not feasible. Instead, 
we will develop an analysis in which the coupling will be treated perturbatively. In doing 
so, we will be able to explore the stability of the lattice system. 

(a) As a first step, integrate out the fermionic degrees of freedom ip and thereby obtain 
an effective action for the displacement field u. Assuming that the electron-phonon 


J d/X'tp 


dx ip ( d T — - — d 2 — p l] Ip. 
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coupling constant g is small, expand the action up to second order in u. You will find 
that the coefficient of u 2 involves the density-density response function x(g r ,u; TO ). 

(b) In general, when seeking a saddle-point of the effective action, the first guess would 
be a homogeneous displacement field, uo(x,t) = uo . Here, uo = 0 is such a solution. 
However, it is not necessarily the best solution. Show that the static solution uo(x, r) = 
uo cos(2kFX+ip ) is energetically favorable (i.e. S[u = uo cos(2/cf 2;+<£)]< = 0]) below 

a certain critical temperature T c . Thus, at low temperatures, the system is instable 
towards the formation of a static sinusoidal lattice distortion. Use the approximation 
X(2 £;f, 0) ~ ln(/3wD)/47rUF, where wd is the Debye frequency, to determine the transition 
temperature T c . 

Answer: 

(a) Integrating out the electron degrees of freedom ip, one obtains 

J D[ip,ip]e~ Sel ~ Sel - ph = det |d T — ^ — p + g(d x u ) 

Thus, the effective action for the displacement field u reads S' e ff[tt] = SphN — 
trln [d T — (d 2 /2m) — p + g(d x u)] . While the bare phonon action is diagonal in 
frequency-momentum representation, the “trln” is not. Thus, the evaluation of the 
second term requires more care. Defining the bare Green function of the electron 
system Gq 1 = d T — d 2 /2 m — p, an expansion up to second order in the coupling 
g gives trlnfGg 1 + g{d x u% pi trln Gy 1 - (g 2 /2)J2 Um ,q<fx{<BUm,)\u UJm , q \ 2 , where 
x{Q,u m ) = ~j;Ylpuj n Go,p,u n Go,p+q,u n +u m and we note that the linear term of the 
expansion vanishes by symmetry. Thus, to quadratic order in the displacement field, 
the effective action assumes the form 

S eS [u] ~ [f( w m + c V) - y? 2 x(g,w m ) 1 \u UmtQ | 2 . 

(b) In the presence of the static periodic modulation, u Umi q = Truo[e l,p 8(q — 2kp) +e~* v 6(q + 

2k-F)\8(oj rn ) the action takes the value S'efi = 47r 2 Mo^F[/ 3c2 — 9 2 x{ 2/jp, 0)] . Thus, not- 
ing that S[u = const.] = 0, the transition is realized when pc 2 < 47r 2 (/ 2 x(2&f, 0). 
Substituting the expression for x, one obtains the relation for the critical temperature 
T c : pc 2 = In (u>r>/T c ) i.e. at temperatures T < T c = w D ! g2 , the system 

becomes unstable against the formation of a periodic modulation of the lattice with a 
wavelength of 7r/fcF- 


Temperature profile of the BCS gap 

The following question concerns an exploration of the functional dependence of the BCS gap on tem- 
perature. In particular, we wish to understand its singular vanishing at the transition temperature (this 
being a hallmark of a second-order phase transition). 
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Consider the BCS gap equation (6.28), 

1 _ r“°/ 2T tanh(# 2 + k 2 ) 1 / 2 

gv~Jo (x 2 + K 2 y/ 2 ’ 

where we have introduced a dimensionless integration variable and k = A/2 T. In spite of its 
innocent appearance, the temperature dependence of this equation is not straightforward to 
infer. Referring for a quantitative discussion to Abrikosov et al., 83 we here restrict ourselves 
to exploring the gap profile in the vicinity of the transition temperature. 

(a) To determine the value of T c , we proceed somewhat indirectly assuming that this is 
determined through the condition A (T c ) = 0. Use this criterion to obtain Eq. (6.29) for 
the critical temperature. (You may assume the hierarchy of energy scales wd » T c ~ 
A 0 > A (T ~ T c ), where A 0 = A (T = 0).) 

(b) Now, let us derive the approximate profile Eq. (6.30) of the gap for temperatures T 
slightly smaller than T c . To this end, add to and subtract from the right-hand side of 
the gap equation the integral / dx la ° hx . Then expand to leading order in the small 
parameters 6T/T C and A/T c , where 6T = T C —T > 0. 

Answer: 


(a) For A = 0, A(£) = (£ 2 + A 2 ) 1 / 2 = |£| and Eq. (6.28) assumes the form 

1 run/ITc tanll x 

— = / dx , 

gv Jo x 

where we have introduced x = £/2T c as a dimensionless integration variable. The dom- 
inant contribution to the integral comes from the region x 1, where tanh x ~ 1. As a 
result, one obtains 1 /gv ~ ln(<UD/2T c ). Solving for T c , we arrive at Eq. (6.29). 

(b) Adding and subtracting the integral given above, we have 


1 

9” 


i: 


dx 


: aah(y 2 + k 2 ) 1 / 2 

( X 2 -(- K 2 ) 1 / 2 


tanh#l /-“W 2 T tan ha; 

+ / dx 

x J Jo x 


Arguing as in (a), the second integral can be estimated as ln(wD/2T) w ln(uv/2T c ) + 
(■ ST/T C ) = (1 /gv) + ( ST/T C ), where we have expanded to linear order in ST. Thus, 

ST r^ftT rtanh (x 2 + k 2 ) 1 / 2 tanh x 

~T~ C * Jo [ (x 2 + k 2 ) 1 / 2 

Now, the remaining integral can be split into a “low-energy region” 0 < x < 1, and a 
“high-energy region” 1 < x < u>d/2T. Using the small-# expansion, tanh# ~ # — # 3 /3, 
we find that the first region gives a contribution ~ k 2 . With tanh# w 1, the second 
region contributes a term 0(k 2 ) which, however, is approximately independent of the 


A. A. Abrikosov, L. P. Gorkov, and I. E. Dzyaloshinskii, Methods of Quantum Field Theory in Statistical 
Physics (Dover Publications, 1975). 
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large-energy cut-off wd/2 T. Altogether, we obtain ST/T C fa const, x k 1 « const, x 
(Ag/T 2 ) from which one obtains Eq. (6.30). 


Fluctuation contribution to the Ginzburg-Landau action of the superconductor 

In this short problem we derive the energy cost corresponding to large-scale spatial fluctuations of the 
order parameter of a BCS superconductor. The problem mainly serves technical training purposes. 


Consider the second-order contribution to the Ginzburg-Landau action of the BCS super- 
conductor Eq. (6.31). In Problem 4.5 we have seen that the frequency summation involved 
in the definition of the integral kernel evaluates to 


X c (w*,q) 


T_ 

L d 


E G p (iu m )G — p+q( i^rn T i^n) 

m,p 



P 


1 »f(£ P ) ?If(£— p+q) 

— £ p — £-p+q 


Expand x c (0, q) to second order in q. (Hints: You may trade the momentum summation for 
an integral, and linearize the dispersion: <j p+q — £ p + p ■ q/m (think why is this a permissible 
simplification). You may note the identity f dee~ 1 d‘^np{e) = cT~ 2 (where the numerical 
constant c = 7£(3)/2n 2 and £(x) = nX defines the ((-function).) 


Answer: 

Using the fact that £ p = £_ p , 

cm \ f d d p 1 — ri F(Cp+q/2) — ^F(Cp— q/2) 

lM = 'Jw wUTU 

f d d p l-2n F (e p )-9|n F (^)^ 

J {27r) d 2£ p 

= XC( ° ,0) ' 'vbn / de e ~ ld * nF ^ = ^ YY q2 > 

where c is a numerical constant and in the second equality we have used the fact that, for 
£ = 0(T ), p 2 /2m ~ //. Substituting this expansion into the quadratic action, we obtain the 
gradient term in Eq. (6.33). 


Coulomb blockade 

Technological advances have made it possible to manufacture small metallic or semiconducting devices 
of extension 1 pm and less. At low temperatures, the physics of these so-called quantum dots is 
predominantly influenced by charging effects. In this problem, we explore the impact of charging on the 
most basic characteristic of a quantum dot, the tunneling DoS. 

Consider a quantum dot weakly 84 connected to an outside environment (see Fig. 6.18 for 
a semiconductor realization of such a setup). The addition of an electron to, or subtrac- 


By “weak” we mean that the conductance of all external leads attached to the system is such that g < go-, 
where go = e 2 /h ~ (25.8 kf2) -1 is the quantum unit of conductance. 
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Figure 6.18 (a) Schematic picture of a confined two-dimensional electron gas (a quantum dot) 
formed at the interface between a GaAs and an AlGaAs layer, (b) Electron microscopic image of 
the “real” device. (Source: Courtesy of C. M. Marcus.) 


tion of one from, the device incurs an energy cost of order Eq = e 2 /2C, where C is the 
capacitance of the system. This energy cost is offset by an external gate potential. The dis- 
creteness of this charging energy leads to a plethora of observable physical phenomena. 
The most basic of these is a strong suppression of the DoS at the Fermi surface. 

For a non-interacting system, the single-particle DoS is defined as p(e) = tr(<5(e — H j) = 
Im tr(G(e + iO)) = — ^ Im tr(G n )\i Un - > . e+ io, where G(z) = (z — H ) _1 is the Green 
function. The tunneling DoS generalizes this definition to the interacting case: u(e) = 
— \ Im tr(G n )|iw n ->€-M 0 > where the Green function G n is defined 85 as the Fourier transform 
of the coherent state path integral, 

G oiIS (t) = Z- 1 I D(^)e- s ^%(T)^ Q (0), (6.65) 

where Z = f D(ip, ijj) and the indices a, (3 enumerate the eigenstates of the single- 

particle contribution to the Hamiltonian. Having an irregular structure, the eigenstates of 
the single-particle Hamiltonian are unknown. The simplest prototype Hamiltonian describ- 
ing the joint effects of single particle dynamics and charging reads as H = Ho+Ec(N— No) 2 , 
where Ho = e a a\ y a a , N = ]G Q o3 a a a is the number operator and Nq represents the pre- 
ferred number of particles (as set by the gate voltage). The action controlling the behavior 
of the Green function (6.65) is thus given by 


S[lp a ,1pa\ = 


J dri^^ipaidr + ea- IJ.Wa + Ec ^2ip a tp a -N^j j. 


( 6 . 66 ) 


(a) Introducing a bosonic field variable V(t), decouple the interaction by means of a 
Hubbard-Stratonovich transformation. Bring the functional representation of the Green 


The terminology of “tunneling” DoS is motivated by the fact that i/(e) is an important building block in the 
calculation of tunneling currents. (Recall the “Golden Rule” : tunneling rates are obtained by multiplication of 
transition probabilities with state densities.) 
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function into the form G a (r) = Z~ x J DV e~ s ^Z^G^{T), where G a is the diagonal 
element of the Green function (the representation above implies that all off-diagonal 
elements vanish), Z^ v 1 is the partition function, and G^ represents the Green function 
of the non-interacting system subject to an imaginary time-dependent potential iV(r). 

(b) Represent the field V as a sum over Matsubara components, V (r) = e _ * u ’ mT Vj n -|- 

2nkT + Vo- where feZ and Vo € [0, 2ttT] (i.e. 2-irkT + Vo is but a complicated represen- 
tation of the zeroth Matsubara mode). Show that all but the static component Vo can 
be removed from the action S' by a gauge transformation. Why can not Vo be gauged, 
too? Explore the transformation behavior of the Green function and integrate over the 
non-zero mode components V m ^Q. 

(c) Making use of the relation YlkLi cos ^ x ^ = ^ — 8p| »!%■ H , perform the Matsub- 
ara summation, Show that the Green function can be expressed as G(t) = 

F(t)G(t), where the function F(r) = exp (—Ec(t — /3 _1 r 2 )) is obtained by integra- 
tion over the dynamical components of V, while G is a non-interacting Green function 
averaged over the static component Vo. 

(d) The remaining integration over the static component Vg is achieved by the stationary 
phase method. Neglecting the weak dependence of the non-interacting Green function 
on Vo, derive and interpret the saddle-point equation. Approximate the functional by 
its value at the saddle-point (i.e. neglect quadratic fluctuations around the saddle-point 
value for Vo). As a result, obtain a representation G(t) = F(t)Go(t) where Go is a 
non-interacting Green function evaluated at a renormalized chemical potential. 

(e) Assuming Ec T, compute an approximation of the Fourier transform of F(t). Use 
your result to obtain the zero-temperature DoS. 86 (You may approximate Matsubara 
sums by integrals.) 

Answer: 


(a) Using the identity e~ E ° S dr = / DV e~ , the 

quantum partition function takes the form 

Z = J D(i>, ip) J DV 

where S[V] = f dr - iN 0 V ). Thus, 

G a (r) = Z~ l J DVe~ s[v] J D(tp,ip) e~ ^ dT ^o,^(dr+e a -^+iV)^ a ^ a ^ a ^ 

= Z- 1 j DVe~ s[v] Z [v] G [ v ] {T), 
where G^ is obtained from a non-interacting theory with action 


For a more elaborate analysis of finite-temperature corrections, we refer to A. Kamenev and Y. Gefen, Zero 
bias anomaly in finite-size systems, Phys. Rev. B 54 (1996), 5428-37. 
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(b) The gauge transformation removing much of the time-dependent potential from the “tr 
In” is defined by 

i/S q (t) — e i f TdjT '( v ( T ')-^-, il> a (t)—> 

The zero-mode offset Vo has to be excluded from the transformation to preserve the 
time periodicity of the gauge factor (i.e. to make sure that the transformed field respects 
the time-antiperiodicity required of a Grassmann field) . Substitution of the transformed 
field leads to (i) removal of the dynamic V -components from the action, — y 5^°], 

and (ii) the appearance of a gauge factor multiplying the pre-exponential terms. We 
thus obtain 

G(t) = ^ J DVD(tp,ip)e- s[v] e--l' dT E, a ^[d,-^+iv 0 +e a ]^ a 

xe'^ dT ' (v ' (T ' ) -^ ) i(r)i(0) 

= 1^ ^y e -S[V] e i/ o T drGV(r0-V o )^[Vo] G [Vo]( r ) 

= F 1T j d ^° 

where we have omitted the 2i xkT contribution to the zero mode (as it does not play 
much of a role in the context of this problem - for the physical meaning of the integers 
k see Chapter 9), the function 

F(t) = J] f dV n _ Yl 

nit 0 J nitO 

and in the second equality we have performed the Gaussian integral over V n ^o . 

(c) Using the formulae given above, the Matsubara summation gives 

-2 E C T ^2 ~2 I 1 _ exp(-icu n r)) = -E c {\t\ - /? _ 1 t 2 ), 
i.e. F(t) = exp[— Ec{\t\ — /3 -1 r 2 )], and G a (r) = F(r)G a (r), where 

G a (r) = Z- 1 J dV 0 e~^d V ° 2+ ^ NoVo Z^G^(T). 

(d) Defining a Vo-dependent free energy by Z^ v ° 1 = exp(— 0F^ V °^), noting that Vo shifts 
the chemical potential, F^ v °\fj,) = F(/j, — i\ o), and neglecting the Vo-dependence of 
GW ~ G, we obtain the saddle-point equation 

0 = Wo ~ iN ° % ~ ~ ifi) ) = sfe* ~ iN ° + 

where in the second equality we have used the fact that F( ^ — i\ o) = —idpFfa — 

iVo) = Substituting the solution of the saddle-point equation Vo = 

2iE c (N 0 - (N)^) amounts to replacing the chemical potential /i by an effective 
chemical potential fi = fi + 2Ec(Nq — ( N)p , ). As a preliminary result we thus obtain 
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G a (T) = F(r)Goa/3i where the non-interacting Green function is evaluated at the renor- 
malized chemical potential. (In passing we note that the condition for the applicability 
of the saddle-point approximation reads as (Exercise: Why?) 1/(2 Ec) — d /I {N)p ft.) 
(e) For Ec ~^> T, the dominant contribution to the Fourier transform of F(r) comes from 
the boundary regions of the imaginary time interval, t ft and ft — r -C ft. Linearizing 
the exponent of F in these regions we obtain 



Using the fact that Go na = (iw n — 3a) '. where £ a = e a — n, we then obtain 

G na T'^2 / FmGo( n - m ) a 2 n J ^ iuj n - iuj - 3a + u 2 ico n - E c sgn(£ Q ) — 3a ’ 

From this result we obtain the DoS 

v{t) = ^ 5(e - E c sgn(£ a ) - 3a) = J du>u 0 {u)5(e - E c sgn(w) - u) 

= i/ 0 (e - E c sgn(e))0(|e| - E c ), 

where Vo is the DoS of the non-interacting systems. Due to the large charging energy, 
the single-particle DoS vanishes in a window of width 2 Ec centered around the Fermi 
energy: the Coulomb blockade. Particles of energy e > Ec larger than the charging 
threshold are free to enter the dot. However, in doing so they lose an amount Ec of 
(charging) energy, which explains the energy shift in the factor uq. 


Action of a tunnel junction 

In the previous problem, we considered the physics of a perfectly isolated quantum dot. However, in 
practice (see, e.g., the dot depicted in the last problem) the system is usually connected to an external 
environment by some leads. It is the purpose of this problem to derive an effective action accounting 
for the joint effect of charging and the coupling to an environment. 


Consider a quantum dot connected to an external lead (it is straightforward to generalize 
to the presence of several leads) . For simplicity, we model the latter as an ideal wave-guide, 
i.e. the eigenstates %j) a are plane waves whose detailed structure we need not specify. The 
composite system is described by an action = •S'dotlV’cn V’a] + 5iead[V’a ) Vl] + 

St [VG) Vc, V’a>V’a], where is given by Eq. (6.66), 


Slead[fa^«] = E/ 


dT-lp a (d T + e a — !JL)lpai 


and the coupling between dot and lead is described by 
ST[lp a ,l3 a ,tpa,tpa] = 
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Throughout we will assume that the coupling is sufficiently weak that contributions of 

0(T 4 ) to the effective action are negligibly small - the “tunneling approximation.” 

(a) Proceeding as in the previous problem, decouple the charging interaction by a Hubbard- 
Stratonovich transformation. Integrate out the fermions and subject the problem to 
the same gauge transformation as used above to remove the dynamical contents of the 
Hubbard-Stratonovich field V. You will observe that the gauge phase transforms the 
coupling matrices T. 

(b) Expand the action to leading (i.e. second) order in the coupling matrix elements T aa . 
(You may ignore the integration over the static component of the Hubbard-Stratonovich 
field; as discussed above, it leads to merely a shift of the chemical potential.) Assuming 
that the single-particle DoS of dot and lead do not vary significantly on the energy scales 
at which the field V fluctuates, determine the dependence of the tunneling term on the 
gauge phase </>(r) = f T dr' (V(r') — Vo) and identify its coupling constant as the Golden 
Rule tunneling rate 4 gr- 87 (To obtain a finite result, you will need to regularize the 
action by subtracting from the tunneling action Stun[<^] the constant Stun[0].) Expressing 
the charging action S' c [V r ] in terms of the gauge phase V = <j> + Vo (and neglecting the 
constant offset Vo), the complete dissipative tunneling action takes the form 




- 9t 


/ 


drdr 1 


sin 2 (7rT(r — t')) 


(6.67) 


INFO The action (6.67) was first derived by Ambegaokar, Eckern, and Schon . 88 Crudely, 
its behavior mirrors that of the quantum dynamics of a particle on a ring with kinetic energy 
~ (f> 2 /Ec and subject to a dissipative damping mechanism of strength ~ g t- Physically, the 
latter describes the dissipation of the energy stored in dynamical voltage fluctuations V ~ <j) into 
the microscopic degrees of freedom of the quasi-particle continuum. 

In the absence of dissipation, the action describes the ballistic motion of a quantum point 
particle on a ring. The ring topology reflects the 27r-periodicity of the quantum phase, which in 
turn relates to the quantization of charge (recall that charge and phase are canonically conjugate). 
It is, thus, no surprise that the periodicity of the 0-dynamics is the main source of charge 
quantization phenomena in the AES approach. For strong ( gr > 1) dissipation, the particle 
begins to forget that it actually moves on a ring (i.e. full traversals of the ring get increasingly less 
likely). This damping manifests itself in a massive suppression of charge quantization phenomena. 
Indeed, for increasing coupling between lead and dot, the charge on the latter begins to fluctuate 
and is no longer effectively quantized. For a detailed account of the physical phenomena relating 
to the crossover from weak to strong charge quantization, we refer to one of several reviews. 89 


We denote the tunneling rate by gx because (see Problem 7.6.3) it is but (four times) the classical conductance 
of the tunneling barrier, measured in units of the conductance quantum e 2 /h. 

U. Eckern, G. Schon, and V. Ambegaokar, Quantum dynamics of a superconducting tunnel junction, Phys. 
Rev. B 30 (1984), 6419-31. 

See, e.g., I. L. Aleiner, P. W. Brouwer, and L. I. Glazman, Quantum effects in coulomb blockade, Phys. Rep. 
358 (2002), 309-440. 
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Answer: 


(a) Decoupling the action and integrating over the fermionic degrees of freedom, one obtains 
the functional Z = exp(— S e g [V]), where 


'eff = 

= s c 

- tr In 

|d T - 

- n + iV + 
T t 

£d T 

d T ~ P + eij 


= 

= s c 

- tr In 

r 

- p + iVo + 
c ^.Tt 

e d Te 1 * 

d T - p + ej 

(6.68) 


Here, as in the previous problem, S C [F] = f dr{ — iNqV), e,j = {e Q () Qa '} and ei = 
{e a 6 a a'} contain the single-particle energies of dot and lead, respectively, the matrix 
structure is in dot/lead space, and 4>{t) = f dr ' ( V(t ') — Vo). In passing from the first 
to the second equality we have subjected the argument of the “tr In” to the unitary 
transformation (gauge transformation) described by the matrix diag(e l<?i ,l) (with the 
block structure in dot-lead space). 

(b) Expanding the “tr In” to second order in T, and regularizing by subtracting the constant 
‘S'tun[0], we obtain 

S tUD [<f>] = \T\ 2 Y / Yl Y. G <*^ ei *)rnG a , n+m {e- i *)- m -G a , n G a , n +S tun [0], (6.69) 

where G a / a>n = (iuj n — e Q / a + m) _ 1 are the Green functions of dot and lead, respectively, 
and the constant .S' tU n [0] will be omitted throughout. Approximating the Green functions 
by 

Ga/a,n = - J dz Pd/i(e + m) ^"^2 - -7rVd,isgn(w„), (6.70) 


where pa,i = pd,i(d) is the density of states at energy p, we arrive at the result 
Stnn\4>\ = y ^](-sgn(w„)sgn(w„ +TO ) + l)(e i?i ) m (e^)_ m 

= (6-71) 

where we note that -m = 1 5 9t = 2k 2 PiPd\T\ 2 is proportional to the 

Golden Rule tunneling rate between dot and lead, and the appearance of a term ~ |w m | 
is a clear signature of a dissipative damping mechanism. Using the fact that the Fourier 
transform of \u) m \ is given by (cf. Eq. (3.44)) nT sin _2 (7rTr), the tunneling action can 
be cast in a time-representation as 
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We finally add the charging action to obtain the result Eq. (6.67). 


Josephson junction 

Building on the results obtained in the previous problem, here we derive an effective action of a Josephson 
junction - a system comprising two superconductors separated by an insulating or normal conducting 
interface region. The problem includes a preliminary discussion of the physics of the Josephson junction, 
notably its current-voltage characteristics. In Chapter 8, renormalization group methods will be applied 
to explore in detail the phenomenology of the system. 

Consider two superconducting quantum dots separated by a tunneling barrier. Generalizing 
the model discussed in the last two problems, we describe each dot by an action 

— J dr + £ia a 3 + e^ < ^ <T3 Aai)^ , i = 1,2, 

where = (V’af; V’aj.) 1 ' are Nambu spinors, a, Pauli matrices in particle-hole space, and 
< fa the phase of the order parameter on dot i. Noting that two dots form a capacitor, we 
assume the presence of a “capacitive interaction” 

S int = E ^Jdr(N 1 - iV 2 ) 2 , 

where N, = ]C Q V’a cr 3V’a is the charge operator on dot i, and 1/2 Eq the capacitance of the 
system. Finally, the tunneling between the two dots is described by the action 

S T [V’a,V’a] = [ dT $a( r «|8 CT 3)#f + h.C., 

a/3 J 

where T a p = (a\T\(3) denotes the tunneling matrix elements between the single-particle 
states |a) and |/3). Now, were it not for the presence of the superconducting order parameter, 
the low-energy physics of the system would again be described by the effective action (6.67). 

EXERCISE Convince yourself of the validity of this statement, i.e. check that the dot-dot system 
can be treated along the same lines as the dot-lead system considered above and trace the phase 
dependence of the various contributions to the action. 


(a) Turning to the superconducting case, show that, at an intermediate stage, the action is 
given by 

s eS [v,<t> } 

= q m _ , , \d r + (4 + i(0i + V)/2 )<7 3 + Aar 

c[ J [ 7V(°' Or - (£2 + i(ci >2 - V)/2)cr 3 + Aa, 

where &,* = 1,2, comprise the single-particle energies of the system, and ,S' C [P] = 
(1/4 Ec) f dr V 2 is the charging action. At zeroth order in T, expand the action to 
second order in the combinations (j>i ± V. Vary your result with respect to </i,2 to obtain 
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the Josephson condition </> i = —<j)2 = —<p and V = <j). Neglecting the effect of these 
massive quadratic fluctuations, we will rigidly impose this condition throughout. (Hint: 
You may assume that the characteristic frequencies u> m carried by both the fields <j>i 
and the voltage V are much smaller than those of the bare Green function. Use this 
assumption to keep your analysis on a schematic level, i.e. try to argue in general terms 
rather than performing the expansion in great detail.) 

(b) Expanding the action to second order in T (i.e. the leading order) and using the Joseph- 
son conditions, show that S = S c + S tun , where 

SniM = \T \ 2 E tr (Gi,^\G2,^^(f- <M )-m *+ 0)) , 


Giau n = (*w„ — GioP'i ~ A<ti) _ 1 is the bare Gor’kov Green function, and we again regu- 
larize the tunneling action by subtracting St un [<A = 0]- Denoting the block diagonal/off- 
diagonal contributions to the Green function by G it ^/ 0 , respectively, the tunneling action 
splits into two contributions, (symbolically) tr(Gi rf e^ cr3 G 2 de“^ <73 -(-Gi 0 e^ CT3 G 2 0 e“^' T3 ). 
Show that, up to small corrections of O(oj n / A), the diagonal terms vanish and interpret 
this result. (Hint: Compare with the discussion of the previous problem.) 

(c) Turning to the particle/hole off-diagonal sector, show that 


S tU n.[A,4>\ = 7 / dr cos(2<^>(t)), 7 = |T| 2 (ttp) 2 A. 

Jo 

Combining everything, one obtains the action of the Josephson junction, 


m = 4^ jdT^ + 1 jdr cos(2c6(r)) + rSd iss >]. 


6.72) 


While our analysis above suggests that the coefficient of the dissipative term should be 
zero (on account of the absence of low-energy quasi-particle states which might act as a 
dissipative sink of energy), voltage fluctuations in “real” Josephson junctions do seem 
to be dissipatively damped, even at low fluctuation frequencies. Although there is no 
obvious explanation of this phenomenon, it is common to account for the empirically 
observed loss of energy by adding a dissipation term to the action. 

(d) Finally, explore the current-voltage characteristics of the non-dissipative junction. To 
this end, perform a Hubbard-Stratonovich transformation on the quadratic charging 
interaction. What is the physical meaning of the Hubbard-Stratonovich auxiliary field? 
Interpret the action as the Hamiltonian action of a conjugate variable pair and com- 
pute the equations of motion. Show that the Josephson current flowing between the 
superconductors is given by 

1= -2 7 sin(2<£). (6.73) 


According to this equation, a finite-order parameter phase difference causes the flow of 
a static current, carried by Cooper pairs tunneling coherently across the barrier: the 
DC Josephson effect. Application of a finite voltage difference or, equivalently, the 
presence of a finite charging energy, render the phase <j> = V dynamical. For a static 
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voltage difference, (f> increases uniformly in time and the current across the barrier 
behaves as time-oscillatory: the AC Josephson effect. Finally, if the voltage difference 
becomes very large, V A, the Fourier spectrum of <j> contains frequencies |w m | > A 
(think about it). At these frequencies, phase variations have the capacity to create quasi- 
particle excitations which in turn may tunnel incoherently across the barrier (thereby 
paying a price in condensate energy but benefiting from the voltage drop) . The tunneling 
of independent quasi-particles is described by the dissipative term in the action (which, 
we recall, is negligible only at frequencies \oj m \ < A). 

Answer: 


(a) The given result is proven by decoupling the capacitive interaction by a field V and 
integrating out the fermions. We next subject the “tr In” to the gauge transformation 
(symbolic notation) 


trln 




-» trln 


^g-i^io-3/2 




This removes the phase dependence from the order parameters of the Green functions. 
At the same time, the tunneling matrices become phase-dependent and the diagonal 
terms of the Green functions acquire a contribution ifa, as indicated in the formula 
above. 

An expansion of the T = 0 action to second order in <f > * and V gives an expression of 
the structure 

= E [ dT dT ' & - (“)’V)(t)F(t - r')(^ - i~yv){r') 

1 = 1 , 2 *' 

= cjdT(<j>i-(-yv) 2 (T) + ..., 

where the time dependence of the integral kernel F is determined by the bare Green 
function. Assuming that this fluctuates rapidly, we may expand as indicated in the sec- 
ond equality, where the ellipsis represents higher-order derivatives acting on the slow 
fields fa , V and the constant C must be positive on stability grounds. A straightfor- 
ward variation of the quadratic integral with respect to fa and V gives the Josephson 
conditions. 

(b) Decoupling the interaction operators by two Hubbard-Stratonovich fields Vi = fa and 
integrating out the fermions, we obtain 


Sefllfa 


y S c [fa] — tr In 
y S c [fa] — tr In 


d T - (£1 - ifa)a 3 - Am 
Tfa :i 

Q-l Te -iA<t»T 3 
gi Aorr.-i y f Q-l 


T<t 3 

d T - (£2 - ifat)os - Am 
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where £< denotes the matrix of energies of the two dots. An expansion to leading order 
in the off-diagonal blocks obtains the preliminary action Stun[^]- 

Substituting the diagonal contribution to the Green function {Gi,d)n = {—iu n — 
+ tf + A 2 ) into the tunneling action and comparing with Eq. (6.69), we find 
that the sum Eq. (6.70) becomes replaced by 

E (G «)«"- = /<M(« + K) ^/+A a a. -"ft (a ,a + A2)i/ g - 

Comparing with Eq. (6.71), we obtain 


\ ^ ( Wn Wn+m 1 ^ ~ / l Wm l/ 7r -^’ 

V V H + A2 ) 1/2 (“l+m + A 2 ) 1/2 + ) ~ \o + 0(U >m/A), 


|w m | > A, 
|w m | <C A, 


instead of a global factor \uj m \/2nT. The physical interpretation of this result is that 
only high-frequency (w > A) fluctuations of the voltage field V = <p have the capacity to 
overcome the superconductor gap and dissipate their energy by creating quasi-particle 
excitations. In contrast, low-frequency fluctuations do not suffer from dissipative damp- 
ing. 

(c) Substituting the off-diagonal term (G 0 i ,)„(t) = — Acri/(a; 2 -|-£ 2 -|-A 2 ) into the tunneling 
action and neglecting contributions of 0(|w m |/A), we find 


2 ££; 


A A 

\ + e a + A 2 w 2 + & + A 2 


tr ( f r 1 (e iCT ^) m a 1 (e- iff ^)_ ro ) 


= £tr {^{e^Ucnie-^-m) 

= |T| 2 (7rp) 2 A J dr cos(20(r)). 


(d) Think of the non-dissipative Josephson action as the action of a point particle with 
kinetic energy ~ E^}(j) 2 and potential energy cos(2 <f>). In this language, passage to the 
Hubbard-Stratonovich decoupled action 


S[<t>,N]=Ec J dr N 2 - 1-7 J dTCOs(2</>(r)) + i J dr 


N(j>, 


amounts to a transition from the Lagrangian to the Hamiltonian picture. The notation 
emphasizes that the momentum conjugate to the phase variable is the number operator 
of the system. (More precisely, N measures the difference of the charge carried by the 
two superconductors; the total charge of the system is conserved.) Varying the action, 
we obtain the Hamilton equations 


<fi = i2EcN, N = 2i7sin(2<^). 


Now, I = id T N is the current flowing from one dot to the other, i.e. the second relation 
gives the Josephson current Eq. (6.73). The first relation states that, for a finite charging 
energy, mismatches in the charge induce time variations in the phase. By virtue of the 
Josephson relation, such time variations are equivalent to a finite voltage drop. 
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Field theory of the BCS to BEC crossover 

As we have seen in the main text, the formation of the superconducting and Bose-Einstein condensates - 
the macroscopic occupation of a single quantum state - is characterized by two paradigms. 


In the BCS theory, the transition from a normal to a superconducting phase of electrons involves 
a pairing instability which takes place at a temperature T c <C cf where the Fermi energy cf sets 
the degeneracy scale. In particular, the formation of Cooper pairs and their condensation occur 
simultaneously at the transition temperature. 

Similarly, the transition of a Bose gas to a Bose-Einstein condensate (BEC) phase occurs at a 
temperature T c comparable to the degeneracy temperature - in the bosonic system, the temperature 
at which the thermal length hv/k^T becomes comparable to the typical particle separation. However, 
the bosonic particles participating in the condensate are invariably molecules or composites involving 
an even number of elementary fermionic degrees of freedom, e g. 4 He, Rb atoms, etc. Usually, the 
corresponding dissociation temperature of the particles into their fermionic constituents Td is is greatly 
in excess of the BEC transition temperature T c - the condensate forms out of preformed bosons. 


The separation of the energy scales Tdi S and T c means that systems which undergo a BCS or BEC 
transition can be neatly classified. However, in some systems (such as fermionic atom condensates), 
the transition temperature can be comparable to (or even tuned through) the molecular dissociation 
temperature. Such systems have the capacity to manifest a BEC to BCS crossover. The aim of this 
extended problem is to develop a field theory to describe the crossover. 


Although the study of the BEC to 
BCS crossover has a long history, the 
modern perspective on the subject 
can be traced back to the seminal 
work 90 of Anthony Leggett. In this 
early work, it was shown that the 
smooth crossover from a BCS to a 
BEC state could be described within 
the framework of a single variational 
wavefunction. In the following, we 
will develop a field theory to explore 
the crossover in the framework of the 
original Hamiltonian considered by Leggett . 91 


Sir Anthony J. Leggett 1938- 

Co-recipient of the 2003 Nobel 
Prize in Physics (with Alexei A. 

Abrikosov and Vitaly L. Ginzburg) 

“for pioneering contributions to 
the theory of superconductors and 
superfluids. " He has made impor- 
tant contributions to the theory 
of normal and superfluid helium liquids and other 
strongly coupled superfluids, macroscopic dissipative 
quantum systems, and the use of condensed sys- 
tems to test the foundations of quantum mechanics. 
(Image © The Nobel Foundation.) 



See, e.g., A. J. Leggett, Modern Trends in the Theory of Condensed Matter , A. Pekalski and R. Przystawa, 
eds. (Springer- Verlag, 1980) as well as the later work by P. Nozieres and S. Schmitt-Rink, J. Low Temp. Phys. 
59 (1985), 195-211. See also the related work by Keldysh and collaborators from the 1960s on the closely related 
problem of exciton condensation phenomena. 

Our analysis will follow closely that described in M. Randeria, Crossover from BCS theory to Bose-Einstein 
condensation, in Bose-Einstein Condensation , A. Griffin, D. W. Snoke, and S. Stringari eds. (Cambridge 
University Press, 1985). 
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Consider the Hamiltonian of a three-dimensional gas of spinful fermionic particles inter- 
acting through an attractive local (i.e. contact) pairwise interaction, 

H - jiN = - gL 3 / d 3 r cjc}c.|.c t , 

k<r 

where £k = £k — Ab £k = k 2 /2m, and the parameter g characterizes the strength of the 
interaction. 

(a) Starting with the coherent state path integral formulation of the quantum partition 
function, introduce a Hubbard-Stratonovich field, decoupling the pair interaction and, 
integrating out the fermionic degrees of freedom, obtain an effective action involving 
the pairing field A. (Select only the pairing channel in the decoupling.) 

(b) Varying the action with respect to A, show that the transition temperature is determined 
by the saddle-point equation 

1 _ ^ tanh(g k /2T c ) 

S k ' 

An inspection of the momentum summation will confirm the presence of a high-energy 
(UV) divergence. In the conventional BCS theory of the superconductor discussed in the 
main text, one may recall that this high-energy divergence was regularized by a physical 
cut-off derived from the pairing mechanism itself - the phonon exchange mechanism 
imposed a cut-off at the Debye frequency wd -C /j, the highest phonon energy allowed 
by the crystal lattice, which restricted the range of the momentum sum to an energy 
shell around the Fermi level. In the present case, the UV divergence reflects a pathology 
of the contact interaction. Had one chosen a more physical pair potential g( r — r') 
involving both a magnitude g and a range b , the momentum sum would have involved 
a soft cut-off at the momentum scale |k| ~ 1/6 (exercise). Equivalently, to implement 
the physical cut-off, one may replace the bare coupling constant g by the low-energy 
limit of the two-body T-matrix (in the absence of the surrounding medium). In three 
dimensions, this translates to the regularization 92 

TTl 1 v > 1 

47 ra g A' 2e k ’ 

where a denotes the s-wave scattering length. As a function of the (positive) bare 
interaction <7, 1/a increases monotonically from — oo for a very weak attraction to +oo for 
a very strong attraction. Beyond the threshold 1/a = 0, the two-body system develops 
a bound state with a size a and binding energy E b = 1/ma 2 . Therefore, one can 
identify the ratio of the typical particle separation and scattering length 1/fcpa as the 


To understand this result, one may note that, for an isolated two-bod^ scattering involving a potential V, the 
scattering T-matrix obeys the Lippmann-Schwinger equation T = V + VGoT, where Go denotes the bare Green 
function of the free particles. Applied to the potential V(r) = —gL d 8{ r), the diagonal k = 0 component of the 
T-matrix, which translates to the physical scattering length a, can be obtained from the self-consistent solution 
of the equation as T( 0, 0) = i-wa/m = [-(1 / g) + (l/2e fc )] — 1 . 
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dimensionless coupling constant where k f and cf denote the Fermi wavevector and 
energy of the non-interacting system. 

With this interpretation, the saddle-point equation acquires the regularized form 


m x ^ 
Aim. ' 


'tanh(£k/2T c ) 1 
. 2^ 2ek 


When combined with the equation for the particle density n(/Lt, T) = — ~ 

W.|, [1 — tanh (£k/2 T)], valid in the mean-field approximation, this presents an equation 
for T c and fi. 

In the weak coupling BCS limit, 1/fcpa — > —00, show that the chemical potential is fixed 
by the density such that n — e f and, making use of the identity 

where 7 = e c and C denotes Euler’s constant, show that the saddle-point equation 
translates to the condition 


rp 87 

l c — — ^epexp 


2fcp|ci| 


(d) In the strong coupling limit 1/kpa —¥ +00, the roles of the saddle-point equation 
and number density are reversed. The former now fixes the chemical potential /j, < 0 
while, superficially, the latter determines T c . In particular, making use of the identity, 

fo°° dz * V2 [|j|§t - s] = - f - show that 


Eb _ 1 „ 

^ 2 2 ma 2 ’ c 


Eb 


2 In 


\ 3/2 ' 


In fact, the apparent divergence of the transition temperature T c in the strong coupling 
limit is a pathology of the mean-field analysis. The inferred value of T c represents the 
dissociation energy of the pairs T c ji s rather than the temperature scale at which coherence 
is established. To understand why, we have to turn to the analysis of fluctuations. 

EXERCISE Enthusiasts may enjoy exploring the saddle-point analysis of the condensed 
phase at zero temperature when the order parameter A acquires a non-zero expectation 
value. These results may be compared with the variational analysis based on the ground 
state wavefunction |g.s.) == k (u k + UkC^ c! k )| ) described in Leggett’s original work. 

Note that, in the low-density limit, the variational parameter i>k describes the bound state 
wavefunction of a single pair. 


(e) Developing the action to quartic order in A show that the Ginzburg-Landau expansion 
of the action takes the form 

S=§+E n («)l A (<7)! 2 + / d d r ||A(r)| 4 + ... , 
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where Fq denotes the free energy of the non-interacting Fermi gas, 

= V f 1 ~ n /(^)-^F(Ck+q) 1 1 rn 

M> -&-&+„ 2e k J 4™’ 

denotes the pairing susceptibility, and u is a positive constant. In the weak coupling 
limit 1/kpa — oo, confirm that the gradient expansion of the susceptibility leads 

to the Ginzburg-Landau expansion discussed in Section 6.4. Conversely, in the strong 
coupling limit l/k^a — > +oo, show that 

n (tf) - \ ^== ~ 2/i + w B (q)) , 

where wb = —En + q 2 /4m. Absorbing the prefactor into an overall rescaling of the field 
A, the corresponding action can now be recognized as that of a weakly interacting Bose 
gas of composite particles with a mass 2 x m and density n/2 - cf. Section 6.3. The 
failure of the mean-field theory to infer the correct value of T c now becomes clear. In 
the mean- field description, it is assumed that, at temperatures T > T c , particles exist as 
unbound fermions. However, the action above shows that, for temperatures only slightly 
in excess of T c , the particles already exist as bound pairs. The bulk transition takes 
place when these bound pairs condense. 

Once identified as a weakly interacting Bose gas, one can immediately deduce that, 
in the strong coupling limit, T c becomes independent of the scattering length and varies 
with density as 



INFO BEC— BCS transition in fermionic alkali atomic gases and cold exciton liquids: 

As mentioned above, in the majority of condensed matter systems the dissociation energy Tdi S is 
well separated from the transition temperature T c , and the majority of condensates can be neatly 
classified as being of BCS or BEC type. However, two systems which present the opportunity to 
explore crossover phenomena have been the subject of considerable interest in the experimental 
and theoretical literature. 

At low temperatures a “quasi-equilibrium” degenerate gas of electrons and holes forms a two- 
component plasma . 93 At low densities, the constituent electrons and holes can bind to form 
neutral composite objects known as excitons. Comprising an electron and hole, these objects 
transform as bosons and, as such, have the capacity to undergo BEC. At high densities, the 
electrons and holes become unbound and exist as a two-component plasma. Yet, by exploiting 
the Coulomb interaction, at low enough temperatures the electrons and holes can condense into a 
collective BCS-like phase - the exciton insulator in which a quasi-particle energy gap develops at 
the Fermi surface. Leaving aside the capacity for other phases to develop (namely electron-hole 
droplets or molecules), the particle density can be used as a parameter to mediate a crossover 
between a BEC and a BCS-like phase. (In fact, a careful study of the free energy at mean-field 

Here the term “quasi-equilibrium” is used to acknowledge the fact that, in a conventional direct band-gap 
semiconductor, an electron— hole plasma can lower its energy by recombination. If the recombination rate is 
slower than the equilibration time, an electron-hole plasma may, in principle, acquire a quasi-equilibrium 
distribution. 
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shows the crossover to occur via a phase transition of high order. Whether the weak transition is 
smeared into a crossover by thermal fluctuations remains unclear.) To date, experimentalists have 
been unable to defeat the problems posed by fast radiative and Auger-assisted recombination 
processes to unambiguously realize such a condensate. 94 

A second example of a BEC to BCS crossover is presented by the atomic condensates. As 
mentioned in the text, the realization of BEC in atom condensates is now almost routine. Lately, 
there have been considerable efforts targeted at realizing BCS-like condensates of fermionic atoms 
such as e Li and 40 K. Yet, it being charge neutral, the experimental identification of a BCS phase 
presents considerable difficulties - a feature shared by the electron-hole condensate. As a result 
experimentalists have used “Feshbach resonance phenomena to tune the atomic pair interaction 
from weak to strong coupling, whence the atoms exist as tightly bound pairs. By monitoring the 
dynamics of BEC formation, attempts have been made to infer the properties of the ephemeral 
BCS-like phase. 95 

Answer: 


(a) Referring to Section 6.4, a decoupling of the pair interaction by a field A gives the 
partition function 

Z = J D(A, A) exp - J dr J d d r |A| 2 + lndet£ _1 J , 


where 


g- 1 


f-d T - a a \ 

V A -dr+tu) ' 


denotes the matrix Gor’kov Green function. 

(b) Once again referring to Section 6.4, a variation of the action with respect to A gives the 
saddle-point equation 


A(r,r) 

9 


a(r,r;r,r)Ef 2 h l = 0. 


In particular, for A — »• 0, an integration over the Matsubara frequencies leads to the 
required saddle-point equation. 

(c) Firstly, in the weak coupling limit, we have that T c <C /i. Therefore, to leading order, 
one may note that /i ~ c F . Then, in this approximation, applying the regularisation 
procedure outlined in the question, the saddle-point equation takes the form 


m 

47ra 



tanh[(e — e F )/2T c ] 
2(e-e F ) 


For a general review of the field, we refer to Griffin, Snoke, and Stringari, eds., Bose-Einstein Condensation. 
For a review of this general field, we refer to, e.g., M. Holland, S. J. J. M. F. Kokkelmans, M. L. Chiofalo, and 
R. Walser, Resonance superfluidity in a quantum degenerate Fermi gas, Phys. Rev. Lett. 87 (2001), 120406. 
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where v{e) = denotes the three-dimensional density of states. Making use 

of the given identity, and noting that rri/ Air av(eY) = 7r/2fcpa, one obtains the required 
estimate for T c . 

(d) In the strong coupling limit, one expects a bound state to develop in advance of the 
transition. In the limit of low density cf — > 0, the appearance of the bound state is 
signaled by the chemical potential reversing sign. For T c <C |/i|, the saddle-point equation 
becomes largely temperature-independent and acquires the form 

krisrs]- 


Making use of the given identity, one obtains the required formula for the chemical 
potential. In the low-density limit, the latter asymptotes to (one half of) the bound state 
energy £3- Then, when fi is substituted into the equation for the particle number, one 
obtains 


n ~ 2 / de v{e) exp — 


6 + £b/2‘ 

T 


When equated with the expression for the number density, n = J Q eF de z'(e), a rearrange- 
ment of the equation obtains the required estimate for T c . As mentioned in the question, 
here one must interpret T c as the dissociation temperature, which may be - and, in 
the physical context, usually is - greatly in excess of the Bose-Einstein condensation 
temperature. 

(e) When properly regularized, the expansion of the action to second order follows directly 
the procedure outlined in Section 6.4. In the weak coupling limit, the chemical poten- 
tial asymptotes to the Fermi energy of the non-interacting system, cp. Here the gra- 
dient expansion of the pair susceptibility n(g i ) is strictly equivalent to that discussed 
in Section 6.4. By contrast, the strong coupling expansion requires further consider- 
ation. In this case, one may expect n < 0. For temperatures T <C |/i|, noting that 
£k+q/2 + Ck-q/2 = 2(eic - n) + q 2 /4m one obtains 




iu) m - 2(e k - p) - q 2 /4 m 2e k 


m 

4na 


Then, making use of the identity proposed in part (d), one obtains 




„ q . m 

— 2 fl -| — tdlm — 7 ■ 

4 to 47ra 


Expanding the argument of the square root in deviations from the bound state energy 
Eb, one obtains the required gradient expansion. 


Metallic magnetism 

Previously we have seen that the functional field integral provides a convenient framework in which 
a perturbation theory of the weakly interacting electron gas can be developed. By trading the field 
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operators of the electrons for the dressed photon field <j>, the diagrammatic series that comprises the 
RPA can be organized into a systematic expansion of the action in the charge, e. However, we have 
also seen that interactions can have a more striking effect on the electron liquid, initiating transitions to 
new electron phases. In the following, we will explore the transition of the electron gas to an itinerant 
magnetic phase - the Stoner transition . 96 

Historically, the Stoner transition has assumed a special place in the theoretical literature. 
Developments in statistical mechanics through the 1950s and 1960s highlighted the impor- 
tance of fluctuation phenomena in the classification and phenomenology of classical phase 
transitions (for more details, see Chapter 8). In the vicinity of a continuous classical phase 
transition, the collective properties of a thermodynamic system are characterized by a set of 
universal critical exponents. In the quantum mechanical system, a phase transition can be 
tuned by a change of an external parameter even at zero temperature - a “quantum phase 
transition.” In a seminal work, it was proposed by John Hertz 97 that the region surrounding 
a quantum critical point was itself characterized by quantum critical phenomena. In 
this context, the problem of metallic magnetism presents a useful prototype - and the one 
used by Hertz to exemplify the phenomenology of quantum criticality. Lately, the class of 
heavy fermion materials has provided a rich experimental arena in which quantum critical 
phenomena have been observed and explored. In the following, we develop a low-energy 
theory of the interacting electron system and discuss the nature of the mean-field transition 
to the itinerant ferromagnetic phase. Later, following our discussion of the renormalization 
group methods in Chapter 8, we use the low-energy theory as a platform to discuss the 
general phenomenology of quantum criticality (see Problem 8.8.2). 

Our starting point is the lattice Hamiltonian for a non-interacting electron gas perturbed 
by a local “on-site” Hubbard interaction, H = Ho + Hu where 

N 

Ho = £ p c ia c P<7, Hu = U ^2 n ft h 4 . 

pa i 

Here, the sum runs over the N lattice sites i and, as usual, = cJ^Ci^ denotes the number 
operator for spin a on site i. The electron dispersion relation e p (a function of the lattice 
geometry) as well as the dimensionality is, for the present, left unspecified. 

As we have seen in Chapter 2, the phase diagram of the lattice Hubbard Hamiltonian is 
rich, exhibiting a range of correlated ground states depending sensitively on the density and 
strength of interaction. In the lattice system, commensurability effects can initiate charge 
or spin density wave instabilities while, at large U, the electron system can freeze into 
an insulating antiferromagnetic Mott-Hubbard state. Conversely, in the following, we will 
show that, at low densities, the system may assume an itinerant (i.e. mobile) spin-polarized 
phase, the Stoner ferromagnet. 

The capacity of the interacting electron system to form a ferromagnetic phase reflects 
the competition between the kinetic and interaction potential energies. Being forbidden 


E. C. Stoner, Ferromagnetism, Rep. Prog. Phys. 11 (1947), 43-112. 

J. A. Hertz, Quantum critical phenomena, Phys. Rev. B 14 (1976), 1165-84. 
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by the Pauli exclusion principle to occupy the same site, electrons of the same spin can 
escape the local Hubbard interaction. However, the same exclusion principle requires the 
system to occupy higher- lying single-particle states, raising the kinetic energy. When the 
total reduction in potential energy outweighs the increase in kinetic energy, a transition to 
a spin-polarized or ferromagnetic phase is induced. 

Once again, to facilitate the construction of a low-energy field theory of the magnetic 
transition, it is helpful to effect a Hubbard-Stratonovich decoupling of the Hubbard interac- 
tion. For this purpose it is convenient to first separate the interaction into channels sensitive 
to the charge and spin densities, i.e. 

&u = j + %) 2 - ij- - %) 2 - 

Since we expect that fluctuations in the charge density channel will have little effect on 
the thermodynamic properties of the low-density system, we can therefore neglect their 
influence on the interaction, setting Hjj ~ — [/ ^^(S 1 ?) 2 , where Sf = (hq- — h, t)/2- 

EXERCISE Here, for simplicity, we have isolated a component of the Hubbard interaction which 
couples to the spin degrees of freedom but violates the spin symmetry of the original interaction. 
How could the local interaction be recast in a manner which makes the spin symmetry explicit 
while isolating the coupling to the spin degrees of freedom? 

(a) Making use of the coherent state path integral, express the quantum partition function 
of the interacting system as a functional field integral. Decoupling the local quartic 
interaction by the introduction of a local scalar magnetization field TOj(r), integrate out 
the fermionic degrees of freedom and show that the partition function takes the form 

Z = Z 0 j Dm exp | ^ J dry^mf(r) + y^trln ^1 — ^cr 3 roGo 

where Z$ and £?o,p = (* e n — £ p ) _1 denote, respectively, the quantum partition function 
and Green function of the non-interacting electron gas, the matrix rh = {mi(v)§ 4 jS(r — 
t / )}, and (J-j are Pauli spin matrices. Left in this form, the expression for the quantum 
partition function is formally exact, but seemingly unmanageable. 

Yet there is an expectation that the system will undergo a phase transition to a mag- 
netic phase at some critical value of the interaction, U = U c , signaled by the appearance 
of a non-zero expectation value of the magnetization field m;. This being so, a mean- 
field description of the transition can be developed along two complementary lines. 
Firstly, when far below the transition, one expects that the quantum partition function 
is characterized by a well-developed saddle-point of the field integral. By minimizing 
the action with respect to mq the corresponding saddle-point equation can be used to 
track the development of magnetic order (exercise). Alternatively, if the transition to 
the magnetic phase is of second order (i.e. the expectation value of the magnetization 
field grows continously from zero as the interaction is increased through U c ), a field the- 
ory of the system near the critical point can be developed as a perturbative expansion 
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of the action in powers of the magnetization field. It is this program to which we now 
turn. 

(b) Drawing on the RPA expansion of the weakly interacting electron gas discussed in the 
main text, expand the action to fourth order in the magnetization field. Subjecting the 
magnetic susceptibility to a gradient expansion, show that the action takes the general 
form (where f dx = J drd d x) 

5 N = ^ E [ r + £ 2q2 + ^j{] Kl 2 + ^ j dxm\x) + -.. , 

where r = \/{Uv) — 1 , v = vfc, and c is some numerical constant. 98 Identify the 
coefficients of the expansion. (Hint: Recall the discussion of the Lindhard function on 
page 218; at fourth order in the expansion you will encounter the product of four fermion 
Green functions. Assuming that the momentum q carried by the magnetization field is 
much smaller than the typical momentum of the electronic single-particle states, you 
may approximate the product of Green functions by a constant whose value you need 
not specify.) 

Finally, rescaling the magnetization field and u, bring the action into the form 


q ' 

[r + fY + ^1 

L «IqIJ 

l 1 9 U 

K'l + 4 J 

f dxm 4 {x). 


(c) In the mean-field approximation, show that the system exhibits a (Stoner) transition to 
a spin-polarized phase when U c v = 1. 

I N FO Although the mean-field theory provides a good qualitative understanding of the nature of 
the transition, the Stoner criterion itself is unreliable. In the lattice system, the density of states 
is typically set by the bandwidth, i.e. v ~ 1/t. Therefore, at the Stoner transition where U/t ~ 1, 
the system enters the strongly correlated phase where the interaction cannot be considered as a 
small perturbation. In this regime, the electrons experience an effective interaction renormalized 
by the screening effect of the charge redistribution. Following Kanamori," an estimate for the 
effective interaction, U e g ~ U/( 1 + U/t), shows that, in the relevant regime, the Stoner criterion 
becomes replaced by the condition vt > 1. Typically, for a smoothly varying density of states, v ~ 
1/t and the inequality is difficult to satisfy. In practice, the Stoner transition to ferromagnetism 
tends to appear in materials where there is significant enhancement of the density of states near 
the Fermi energy. 


The validity of the gradient expansion of the action as an approximation relies on the benignancy of transverse 
fluctuations of the magnetization density. Recent results (D. Belitz, T. R. Kirkpatrick, and T. Vojta, Non- 
analytic behavior of the spin susceptibility in clean Fermi systems, Phys. Rev. B 55 [1997], 9453—62) have cast 
some doubt on the integrity of this approximation. However, we should regard this pathology of the present 
scheme as an idiosyncrasy of the itinerant ferromagnetic system, while the present action provides a sound 
illustration of the guiding principles. 

J. Kanamori, Electron correlation and ferromagnetism of transition metals, Prog. Theor. Phys. 30 (1963), 
275-89. 




354 


Broken symmetry and collective phenomena 


Answer: 


(a) When cast as a field integral, the quantum partition function takes the form 

E '-W(<?r + £p)VW - J , 

. p l 

where the sum over repeated spin indices is assumed. In this form, the interaction may 
be decoupled with the introduction of a commuting scalar field conjugate to the local 
magnetization density, 

z = J Dm -D^VOexp j- j <It ^ ^ y m if ( r ) + E + ^p)^W 

7 

+ — y’icrcrmi tpi, 




Finally, integrating over the fermionic degrees of freedom, one obtains the required 
partition function. 

(b) The expansion of the action mirrors closely the RPA of the weakly interacting electron 
gas. Carried out to fourth order, it may be confirmed straightforwardly that terms odd 
in powers of m vanish identically (a property compatible with the symmetry rn — > — m), 
while the even terms lead to the expression Z = Zq f Dme~ s ^ m \ where 

1 . T . 4 

SN = + ^5Zw 4 ({ft})n m a <5 E 1 9i.o + • • • 

q qi i= 1 


Focusing on the second-order contribution, one finds that i> 2 (q) = ^ (1 — UH q ) where 
n ? represents the familiar fermion bubble or Lindhard function, 


n 9 


T 

N 


T, Go,k<3&^q - - 


E 


n F (ek) ~ nF(ek+q) 

iu n + £k - ^k+q 


As one would expect, the existence and nature of the magnetic instability depend sensi- 
tively on the susceptibility i >2 (<?), which, in turn, depends on the detailed structure of the 
spectrum ek- The static susceptibility is dominated by contributions to the momentum 
sum where £k — <fk+ q - This involves the regions of reciprocal space where q is small or 
where, for some non-zero q = Q, £k — Ck+Q over a wide region of the Brillouin zone. 
The second condition reflects a nesting symmetry where a translation by a constant 
wavevector leaves the spectrum invariant. It is in this situation, where commensurabil- 
ity effects become significant, that magnetic or spin density waves develop. If, instead, 
the spectrum varies smoothly, so that nesting symmetry is absent, the susceptibility is 
maximized for q = 0. Leaving the spin density wave system for “private investigation,” 
we focus here on the channel q = 0. 


EXERCISE Explain why bipartite lattices lead naturally to nesting symmetry. 
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While, in general, the Lindhard function assumes a complicated structure reflecting 
the detailed dispersion of the non-interacting system, for the free electron system ek = 
k 2 /2 to an exact expression can be derived explicitly (see Eq. (5.30)). In particular, 
at frequencies |u; n |/|q|i>F small, this expression has the expansion Il q ~ no, q — 
where v = cvp with a constant c depending on dimensionality. (In the three-dimensional 
system, c = 7r/2.) Similarly, for q small, one may expand the static susceptibility in 
gradients, i.e. n 0 , q ~ v[l — £ 2 q 2 H ] where £ ~ 1/k-p. 

Turning now to the quartic interaction, the general structure of iq again involves a 
complex computation. However, since its effect near the critical point is merely to control 
the strength of the magnetization, in the following we may focus on the frequency- and 
momentum-independent contribution, setting 

Vi({qi}) u 4 (0) = u= 5Z[ G °(P)] 4 ~ C/4 *'"’ 


where v" = d^\ f _ E ^v{e). Taken together with the quadratic interaction, the total effec- 
tive action assumes the form given above. 

(c) In the leading approximation, an understanding of the model can be developed by ignor- 
ing the effect of fluctuations - spatial and temporal. In such a mean-field approximation, 
the fields m are assumed independent of q (or x). In this case, the rescaled action takes 
the form 


S [to] 


/ o 4 

= 2 m +-m, 


where r = U{1 — Uv)/2. Minimizing the action with respect to m, one finds a Stoner 
transition to a spin-polarized ferromagnetic phase when r = 0, i.e. U c v = 1. At values 
of U in excess of U c , the (oversimplified) mean- field analysis above predicts that the 
magnetization will grow in a continuous yet non-analytic manner, i.e. m = \J — r/Au. 
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Functional bosonization 


The techniques introduced in this chapter are engaged to develop a functional-integral-oriented scheme 
of bosonizing the one-dimensional electron gas. 

Consider the interacting one-dimensional electron gas as described by the relativistic action 
Eq. (4.43) and the interaction contribution Eq. (4.44). Throughout, it will be convenient to 
formulate the last in a matrix representation, 

= drdx{p + p ) drdx(p + 

To probe the response of the system to external perturbations, we add to the action a source 

term S SOUIce [ip,ip^ = f dr dx y~)„ [_ (iptj s +i|V’s)- By Grassmann differentiation with 

respect to the source fields j and jfi, we may then generate any correlation function of 
interest. 

(a) Decouple the four-fermion interaction by introducing a two-component Hubbard- 
Stratonovich field ip T = (<p + ,<p-). Show that the action can be written as 

5 [(,’>, ip, ip\ = ^ J d 2 x (p T g~ 1 ip — J d 2 x ip(ft — i tf>)ip, (6-74) 

where ip T = (ip + , ip - ) , ip = (^>1 , V’+ ) , and we have switched to a covariant notation: x\ = 
r, = x, d 2 x = dx i dx 2 , and ft = and = crpipp. The space-time components 
of the interaction field are defined by ipi = |(y+ + <p~) and <p 2 = ~ t P-)- 

The interaction field ip couples to the fermion action as a two-dimensional vector 
potential. As with any two-component vector, the coefficients of <p can be decomposed 
into an irrotational and a divergenceless contribution (the Hodge decomposition): ip^ = 
+ if-pv^vn)- This is an interesting decomposition as it suggests that the vec- 
tor potential can be removed from the action by a generalized gauge transformation. 
Indeed, the transformation ip — > e l ^ + ' l ' na ' 3 ip, ip — > naively removes the vector 

potential from the action (exercise: check this). A moment’s thought identifies the two 
transformations by £ and r] as the vectorial and axial gauge transformations discussed 
in Section 4.3 above. It turns out, however, that the transformation by r] is actually not 
permissible - a direct manifestation of the chiral anomaly. 

INFO There are different ways to understand the origin of the chiral gauge symmetry 
violation. For instance, one may observe that the transformation leaves the action invariant, 
but not the measure dip dtp . 100 This means that the functional integral as a whole lacks 
gauge invariance. Alternatively, one may integrate out the fermions and realize that the 
resulting “tr In” exhibits problematic UV behavior. Applying a UV regularization scheme - 
of which, due to the importance of the Dirac operator in particle physics, there are many - 
one observes that the chiral gauge invariance gets lost . 101 


K. Fujikawa, Chiral anomaly and the Wess-Zumino condition, Phys. Rev. D 31 (1985), 341-51. 

For a detailed discussion of this point, see J. Zinn-Justin, Quantum Field Theory and Critical Phenomena 
(Oxford University Press, 1993). 
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Notice that both the lack of gauge invariance of the measure and the UV problems manifest 
themselves in an integral over (quantum) fluctuations, i.e. while the symmetry is preserved 
on the classical level, it gets lost in the quantum theory. This is the defining property of an 
anomaly. 

(b) To explore the consequences of the anomaly, integrate out the fermions and expand the 
resulting “tr In” to second order in the fields ip± . Switching to a frequency-momentum 
representation and approximating the Matsubara sum by an integral, one obtains an 
expression that is formally UV divergent. Regularize the integral by introducing a cut-off 
A in momentum space. Show that the effective ^-action reads as 

S[<p] = 0 _1 + n,-) P-q, (6.75) 

Q 

where q = (u, q) and ft,/ = { ^ }' Finally, introduce the field doublet T T = (£, rj) 

to represent the action as 

s\r] = | (r 1 + n,-) D-, r_ 5 , (6.76) 

q 

where the transformation matrix 

D (V-™ -q-vA 

\—q — ioj q + iu J 

mediates between the field variables V and ip (exercise). 

We next turn our attention to the source terms. The integration over the original 
fermion variables generates a source contribution 

£source[Vh $,3, 3 s ] ~ > S[j, /,!’] = J d?x d?x' j(x)G [T] (x, x') j(x') 

= J d 2 x d 2 x' (je _l ^ +7,<73 ^)(a:)G(a;,a; / )(e ^ ^ +T,0 ’ 3 ^j)(a; , ), 

where x are space/time indices, the superscript [T] indicates that the fermion Green func- 
tion depends on the Hubbard-Stratonovich interaction fields, and in the last step we have 
applied the generalized gauge transformation above to transfer the (£, //(-dependence 
to the source vectors j. The action above contains the free fermion Green function (a 
matrix in both space-time and ±-space) as an integration kernel. To proceed, notice 
that matrix elements of the fermion Green function can be obtained as correlation func- 
tions of a free bosonic theory. This connection was introduced in Problem 4.5 on the 
example of a specific free fermion correlation function. Generalizing the results of that 
problem, one may verify that (exercise) 

G ss '{x,x') = (2tt a)- 1 {^f +ae ^e i ^ +ae ^), (6.77) 

where a is the lattice spacing and the action of the bosonic doublet T = (<f), 9) is given 
by 

Sol ] = 2 X] Kq = -^{q 2 - iqujax) 

q 


(6.78) 
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i.e. a non-interacting variant of the Luttinger liquid action (cf. Eq. (4.49)). 

(c) Use the Fermi-Bose correspondence to represent the generating function as a double 
field integral over E and . Next shift the integration variables to remove the field E 
from the source action and perform the quadratic integral over E. Show that the final 
form of the action is given by (4.51). Summarizing, you have rediscovered the action 
of the interacting Luttinger liquid, and the boson representation of fermion correlation 
functions (the latter obtained by differentiation with respect to the source parameters 
j). While the present derivation is certainly less transparent than the one discussed in 
Section 4.3, it has the advantage of being more “explicit” (inasmuch as we start from 
the standard “tr In,” which is then subjected to manipulations standard in many-body 
field theory. On the other hand, the authors are not aware of applications where this 
aspect turned out to be of much practical relevance: usually, the standard bosonization 
approach is just fine; and, where it is not, the formalism above would not be any better). 


Answer: 


(a) This is resolved by a straightforward exercise in Gaussian integration and reorganizing 
indices. 

(b) Integrating over fermions and momentarily forgetting about the impurity, we obtain the 
effective action 


= iy d 2 £</> T 5“V-trln(j$-* t>) 

= <Pxip T g~ 1 ip- itr(jTV j^ _1 0) + &(<P 4 ) 

= 2 X) Vg,8 {fjJ + ^.ss'H.s.q) ip S', f + 0(<P 4 ), 

g 


where q = ( u,q ) and n s g = f d 2 p (e + isp) -1 (e + cu + is(p + g)) _1 . Evidently, the 
structure of this integral poses a problem: while all poles of the integrand appear to be 
on one side of the real axis (so that analyticity arguments might suggest a vanishing of 
the integral), the double integral is manifestly divergent. We are, thus, confronted with 
a 0 • 00 conflict and a regularization scheme is called for. (At [n > 2]nd orders in the 
tp expansion, one “fast” momentum integration extends over n > 2 energy denomina- 
tors. These terms indeed vanish by analyticity, i.e. the second-order expansion of the 
logarithm is, in fact, exact.) To some extent, the choice of the regularization scheme is 
dictated by the physical context: in particle physics, relativistic covariance is sacred, and 
a rotationally invariant (Euclidean formalism!) regularization is required. However, in 
condensed matter physics, where the effective action is obtained by linearization of some 
band Hamiltonian, frequency and momentum play different roles. While the integration 
domain of the former is infinite, the latter is bounded to values \p\ < A, where A is some 
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cutoff. To proceed, we first integrate over frequencies and then do the finite integral over 
momentum: 


f A dp i 

f de ' 

/-A 27T J 

2tt 


* % [ S ®“ <SI,) _ s s n(s (J>+9)H = 


isLO + q 

(Notice that the result is actually independent of the non-universal cutoff A.) Substi- 
tuting this result into the action, we obtain Eq. (6.75). 

(c) Representing the fermion Green function as in Eq. (6.77), we obtain the local expression 


Z = J D DT e“ So[ l-^exp J d 2 x 


-*(S+0+(» 1+0)<T3) _|_ g*(C+^+(»7+fi)o' 3) 


3 


where the non-universal factor 27ra has been absorbed in the definition of the source 
fields. (To confirm that the -integral faithfully reproduces the source action, one has 
to take into account the fact that exp (i(</>± 0)) <-)• ip is a Gaussian correlated variable.) 
The structure of the source term suggests a shift <fi (j) — £, 6 — > 0 — rj, or — » — T 

for short. Denoting the now T-independent source contribution by exp (— S SOUIce [ ]), the 
partition function assumes the form Z = f D DTe~ s ° I - r ]-S[r]-s BOU rce[ ]_ yy e further 
note that Kg = —DjHgD_q to obtain the integral over T, 


Sol - r] + S[T] = \Y^[ T qKq + rT [Djg- l D_ ? -)T_ g - 

q 

+ fKgT_ 9 + T?Kg _|l, 

q 

= ^ E n Kq+%q 2 ( 9 *- 92V3)] -q 
q 

m /V$*f 2Tr(g 4 -g 2 )} -iquj 

- 2?r ™ V <? 2 [1 + 27 t( 54 + 92)}. 



where we have used the fact that KqD _ i = —g( 1 — icr- 2 ). Transforming back to real 
space/time, we obtain Eq. (4.51). 
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The chapter begins with a brief survey of concepts and techniques of experimental condensed matter 
physics. It will be shown how correlation functions provide a bridge between concrete experimental 
data and the theoretical formalism developed in previous chapters. Specifically we discuss - an example 
of outstanding practical importance - how the response of many-body systems to various types of 
electromagnetic perturbation can be described in terms of correlation functions and how these functions 
can be computed by field theoretical means. 


In the previous chapters we have introduced important elements of the theory of quan- 
tum many-body systems. Perhaps most importantly, we have learned how to map the basic 
microscopic representations of many-body systems onto effective low-energy models. How- 
ever, to actually test the power of these theories, we need to understand how they can be 
related to experiment. This will be the principal subject of the present chapter. 

Modern condensed matter physics benefits from a plethora of sophisticated and highly 
refined techniques of experimental analysis including the following: electric and thermal 
transport; neutron, electron, Raman, and X-ray scattering; calorimetric measurements; 
induction experiments; and many more (for a short glossary of prominent experimental tech- 
niques, see Section 7.1.2 below). While a comprehensive discussion of modern experimental 
condensed matter would reach well beyond the scope of the present text, it is certainly prof- 
itable to attempt an identification of some structures common to most experimental work 
in many-body physics. Indeed, we will need a discussion of this sort to construct meaningful 
links between the theoretical techniques developed above and experiment. 


7.1 Crash course in modern experimental techniques 
7.1.1 Basic concepts 

Crudely speaking, experimental condensed matter physics can be subdivided into three 1 
broad categories of analytical technique: 

> experiments probing thermodynamic coefficients; 

> transport experiments; 

> spectroscopy. 


There are a few classes of experiment that do not fit comfortably into this three-fold scheme. These include 
scanning tunneling microscopy, a technique to be discussed in more detail below. 
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A summary of their utility, the basic experimental setup, the principal areas of application, 
and concrete realization of these families is given in the following section. (Readers who are 
totally unfamiliar with the basic notions of experimental many-body physics may find it 
useful to browse through that section before reading further.) The few occasional references 
to experimental data given in previous chapters were all to thermodynamic properties. The 
reason for this restriction was that the extraction of thermodynamic information from the 
theoretical formalism is relatively straightforward: one need only differentiate the partition 
function (alias the field integral) with respect to a few globally defined coefficients (the 
temperature, homogeneous magnetic field, etc.). This simplicity has advantages but also 
limiting aspects: thermodynamic data are highly universal 2 and, therefore, represent an 
important characteristic of a system. On the other hand, they contain information neither 
on spatial structures, nor on dynamical features. This means that thermodynamic data do 
not suffice to fully understand the physics of a system. 

With the other two categories of experiment the situation is different. Transport and 
spectroscopic measurements can be used to probe both static and dynamical features of a 
system; further, fully angle/frequency-resolved spectroscopic data contain detailed informa- 
tion on the spatio-temporal structure of the dominant excitations of a system or, in other 
words, on their dispersion relation. It is for these reasons that the focus in the present 
chapter will be on the last two of the experimental classes mentioned above. 

In spite of the wide diversity of present day analytical techniques, there are a few struc- 
tures common to all experimental probes of condensed matter: 

!> Firstly, the interaction of a many-body system with its environment is almost exclu- 
sively mediated by electromagnetic forces. 3 Accordingly, most experiments subject the 
system under consideration to some external electromagnetic perturbation (a voltage 
drop, an influx of spin magnetic moments carried by a beam of neutrons, the local electric 
field formed at the tip of a scanning tunneling microscope, etc.). In a second step, the 
“response” of the system is then recorded by an appropriate detector or measuring device. 

Formally, the externally imposed perturbation is described by a (time-dependent) con- 
tribution to the Hamiltonian of the system, 

H F = Jd d rF'( r,f)X'(r). (7.1) 

Here, the coefficients F-, sometimes referred to as generalized “forces,” represent a per- 
turbation that couples to the system through some operators X'. For example, F- (r, t) = 
<j>( r, t) could represent a space- and time-dependent electric voltage coupling to the density 
of the electronic charge carriers in the system X[ = p, etc. 
t> The use of the term “perturbation” is appropriate because the forces {F-} will, in general, 
be much weaker than the internal correlations of the system. 
t> The forces perturb the system out of its F' = 0 reference state. The measurable effect 
of this perturbation will be that certain operators A*, whose expectation values vanish 


2 Remember that a few thermodynamic variables, i.e. numbers, suffice to unambigously characterize the state of 
a homogeneous system in equilibrium. 

3 An important exception involves heat conduction. 
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in the unperturbed state, build up non-vanishing expectation values Xj(r,f) = (Xj(r,f)). 
(For example, in response to an external voltage F' = (f>, a current might begin to flow, 
Xi = ji, etc.). The ultimate goal of any theory will be to understand and predict the 
functional dependence of the measured values of Xi on the forces F'. 
t> In general, there is nothing one can say about that connection other than that Xi [Fj] 
will be some functional of the forces. However, for a sufficiently weak force, the situation 
is simpler. In this case, one may expect that the functional relation between forces F' 
and the expectation values Xi is approximately linear, i.e. of the form 

Xi(v,t) = J d d r' J df'x^r,t;r^^'(r',t') +0(F' 2 ). (7.2) 

While the quantities Fj and {X, : } are externally adjustable/observable - either as an 
experimental input/output, or as parameters in the theory - the integral kernel % repre- 
sents a purely intrinsic property of the system. It describes how the system “responds” 
to the application of an external probe {F/} (wherefore it is commonly referred to as a 
response function) or generalized susceptibility. The functional profile of the response 
kernel is in turn determined by the dominant excitations of a system (notably its long- 
range excitations), i.e. our prime objects of interest. 

These considerations show that response functions play a principal role in promoting the 
dialog between experiment and theory. Experimentally, they will be measured by relating 
the input Fj to the response {X;}. Theory will attempt to predict the response behavior, 
ideally in a way that conforms with experimental observation. 


7.1.2 Experimental methods 

To keep our discussion of the relation “experiment O theory” less abstract, it is instructive 
to list a few prominent experimental techniques of condensed matter physics. Of course, the 
summary below can be no more than an introduction. 4 Our intention is merely to illustrate 
the connection (perturbation response) through a few examples; indeed readers lacking 
a background in experimental many-body physics may welcome some motivation before 
plunging into the formalism of correlation functions and response developed below. 

Thermodynamic experiments 

Of the thermodynamic properties of a system that can be accessed in experiment, those most 
commonly investigated include the following: the specific heat, c v = dU/dT, i.e. the rate of 
change of the internal energy under a change of temperature; the magnetic susceptibility, 
X = dM/dH , the change of magnetization in response to a (quasi-)static magnetic field; 
the (isothermal) compressibility, k = —V~ 1 dV/ dp, the volume change in response to 


4 For a short and pedagogical (if perhaps a bit outdated) introduction to a number of experimental approaches we 
refer to the classic text by N. W. Ashcroft and N. D. Mermin, Solid State Physics (Holt-Saunders International, 
1983), while, for an up-to-date and detailed exposition of methods of spectroscopy, we refer to H. Kuzmany, 
Solid State Spectroscopy (Springer-Verlag, 1998). 
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the external pressure etc. Note that, strictly speaking, the magnetic susceptibility and the 
isothermal compressibility are both tensor quantities. 

The thermodynamic response functions are highly universal. (Remember that a few 
thermodynamic state variables suffice to unambiguously characterize the state of a given 
system.) Specifically, for given values of chemical potential, magnetic field, pressure, 
etc., a calorimetric experiment will produce a one- dimensional function c v (T). The low- 
temperature profile of that function generally contains important hints as to the nature of 
the low-energy excitations of a system. 5 However, the universality of thermodynamic data 
also implies a limitation: thermodynamic coefficients contain information neither about the 
spatial fluctuations of a given system nor about its dynamics. 


Transport experiments 

When subject to a gradient of a generalized “volt- 
age” U, a current flows through a device (see figure). 

Although, typically, the voltage is electrical, U = V, 
one can apply a temperature gradient U = AT or even 
attach the sample to two “reservoirs” of different mag- 
netizations U = AM. One then records the current flow induced by U. The corresponding 
current can be electrical, I carried by the charge of mobile carriers, the “thermal current” 
It carried by their energy, or a “spin current” Is carried by their magnetic moments. Also 
notice that the current need not neccessarily be parallel to the voltage gradient. For exam- 
ple, in the presence of a perpendicular magnetic field, a voltage gradient will give rise to 
a transverse Hall current I±. The ratio of a current and a generalized voltage defines a 
conductance, g = jj. 

Conductance measurements represent the most common way to determine the transport 
behavior of a metal or the thermal conduction properties of insulators. A disadvantage is 
that conductance measurements are invasive, i.e. the system has to be attached to contacts. 
There are situations where the local injection process of charge carriers at the contact 
(rather than the bulk transport behavior in which one is interested) determines the value 
of the conductance. (For a further discussion of this point, we refer to Problem 7.6.1.) 



Spectroscopic experiments 

The general setup of a spectroscopic experiment is shown in Fig. 7.1. A beam of particles p - 
either massive (electrons, neutrons, muons, atoms, etc.), or the quanta of electromagnetic 
radiation - is generated at a source and then directed onto a sample. The kinematic infor- 
mation about the source beam is stored in the dispersion relation (k, w(k )). 6 The particles 


5 For example, the specific heat of the Fermi liquid, c„, Fermi liquid ~ T, is linear in temperature, that of phonons, 
c v> phonon ~ T 3 , is cubic, while in a system without low-lying excitations, it vanishes exponentially (exercise: 
consider why). 

6 For some sources, e.g. a laser, the preparation of a near-monochromatic source beam is (by now) standard. For 
others, such as neutrons, it requires enormous experimental skills (and a lot of money!). 



364 


Response functions 



Figure 7.1 Basic setup of a spectroscopic experiment. A beam of electromagnetic radiation (or 
massive particles) of frequency-momentum (w(k),k) is emitted by some source (5) and directed 
onto a target sample. The sample responds by emitting radiation according to some distribution 
P(o/(k'), k'), which is, in turn, recorded by a detector ( D ). Notice that the emitted radiation can, 
but need not, contain the same type of particles as the source radiation. For example, light quanta 
may lead to the emission of electrons (photoemission spectroscopy) . 


of the source beam then interact with constituents X of the sample to generate a secondary 
beam of scattered particles p'. Symbolically, 

p + X — ► p’ + X' 

t t t t 

k,w(k) K,fi(K) k',w(k') K', fi(K'), 


where X' represents the final state of the process inside the sample. Notice that the particles 
p' leaving the sample need not be identical to those incident on the sample. (For example, in 
photoemission spectroscopy, X-ray quanta displace electrons from the core levels of atoms in 
a solid. Here p represent the light quanta, and p' electrons.) Further, the dominant scattering 
process may be elastic (e.g. X-rays scattering off the static lattice structure) or inelastic 
(e.g. neutrons scattering off phononic excitations) . In either case, the accessible information 
about the scattering process is stored in the frequency-momentum distribution P(w(k / ), k') 
of the scattered particles, as monitored by a detector. 

From these data, one 
would like to restore 
the properties (i.e. the 
dispersion (f2(K),K)) 
of the states inside the 
solid. This is where 
the detective work of 
spectroscopy begins. 

What we know is that 
the dispersions of the 
scattered particles and 
of the sample con- 
stituents, (k,u;(k)) 
and (K,fl(k)), respec- 
tively, are related 


Sir Chandrasekhara 
V. Raman 1888- 
1970 (left), Lord 
Rayleigh (John 
William Strutt) 

1842-1919 (mid- 
dle), Max von Laue 
1879-1960 (right) 

Raman was awarded the Nobel Prize in Physics in 1930 "for his 
work on the scattering of light and for the discovery of the effect 
named after him." Lord Rayleigh was awarded the Nobel Prize in 
Physics in 1904 “for his investigations of the densities of the most 
important gases and for his discovery of argon in connection with 
these studies.” Laue was awarded the Nobel Prize in Physics in 1914 
"for his discovery of the diffraction of X-rays by crystals." (Image © 
The Nobel Foundation.) 
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through an energy-momentum conservation law, i.e. 

k + K = k' + K', 
w(k) + fi(K) = w(k') + ft(K'). 

According to this relation, a “resonant” peak in the recorded distribution P(uj(\i'). k') sig- 
nals the existence of an internal structure (for example, an excitation, or a lattice structure) 
of momentum AK = K' — K = k — k' and frequency AQ = O' — Q = oj — u/. However, what 
sounds like a straightforward recipe in principle may be quite involved in practice: solid 
state components interact almost exclusively through electromagnetic forces. When charged 
particles are used as scattering probes, these interactions may actually turn out to be too 
strong. For example, a beam of electrons may interact strongly with the surface states of 
a solid (rather than probing its bulk), or the scattering amplitude may be the result of a 
complicated process of large order in the interaction parameters, and therefore difficult to 
interpret. The obvious alternative - scattering of neutral particles - is met with its own 
problems (see below). Notwithstanding these difficulties, spectroscopy is one of the most 
important sources of experimental information in condensed matter physics. A number of 
prominent “sub-disciplines” of solid state spectroscopy are summarized below: 

> Raman spectroscopy: The inelastic scattering of visible light can be used to explore 
the dispersion of optical phonons (magnons, plasmons, or other electronic excitations). 
Such techniques require experimental skill to discriminate the “Raman peak” from the 
much larger “Rayleigh peak” corresponding to the elastic scattering of light quanta. 

> Infrared spectroscopy: The scattering of light in the infrared range can be used to 
explore the vibrational modes in polycrystalline solids and the band-gaps in semiconduc- 
tors. 

> X-ray crystallography: By measuring the angle-resolved intensity profile (the von Laue 
pattern) , the elastic scattering of X-rays from the lattice ions of a solid can be used 

to infer the structure of 
a crystalline substance. 

(Notice that the typical 
wavelength of X-rays ~ 

10 -10 m is of about the 
size of typical interatomic 
spacings in solids.) Such 
techniques of solid state 
spectroscopy have already 
acquired a long history dat- 
ing back to 1913. 

> X-ray/electron spectroscopy: A number of spectroscopic techniques are based on the 
fact that the ionization energies of atomic core levels lie in the X-ray range. In X-ray 
absorption spectroscopy the absorption of X-rays by a solid is measured as a function 
of the light frequency (see Fig. 7.2(a)). The absorption cross-section rises in a quasi- 


Sir William H. Bragg 
1862-1942 (left) and 
his son Sir William 
Lawrence Bragg, 1890- 
1971 (right) 

Awarded the Nobel Prize 
in Physics in 1915 "for 
their services in the 
analysis of crystal structure by means of X-rays." (Image 
© The Nobel Foundation.) 
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(a) (b) 


.O 


/ 



(c) (d) 


Figure 7.2 The different types of X-ray/electron spectroscopy, (a) X-ray absorption: the loss of 
X-ray radiation due to the ionization of core levels, (b) X-ray fluorescence: the recombination of 
valence electrons with previously X-ray-emptied core levels leads to the emission of radiation, which 
also lies in the X-ray range. The spectral analysis of this radiation contains information about the 
level structure of the system, (c) Photoemission spectroscopy (PES): detection of the frequency- 
dependence of the electrons kicked out by X-ray core level ionization, (d) Auger spectroscopy: 
the energy emitted by a valence electron recombining with a core level is transmitted to a second 
valence electron which leaves the solid as a high-energy Auger electron. 


discontinuous manner whenever the energy of the X-rays becomes large enough to ionize 
an atomic core level of the atoms of the solid. Due to interatomic correlations, these 
energies differ from the ionization energies of gaseous atoms, i.e. information about solid 
state binding energies is revealed. 

> In X-ray fluorescence spectroscopy the radiation emitted by valence electrons recom- 
bining with core holes created by incident X-ray radiation is measured (Fig. 7.2(b)). This 
type of spectroscopy is frequently used to chemically analyze a sample, or to detect the 
presence of impurity atoms, i.e. different elements have different core/valence excitation 
energies. Peaks in the fluorescence spectrum at frequencies characteristic of individual 
atoms therefore identify the presence of these atoms in the target sample. In photoe- 
mission spectroscopy (PES) core electrons 7 displaced by X-ray radiation are detected 
(Fig. 7.2(c)). The fully frequency /angle-resolved measurement of the photo-electron cur- 
rent, known as angle-resolved photoemission spectroscopy (ARPES), is one of the 
most important spectroscopic techniques in the experimental analysis of band structures. 
Auger spectroscopy is based on an interaction process of higher order (Fig. 7.2(d)). In 
this process, a core hole is created by irradiation by either X-rays or high-energy electrons. 
In a secondary process, part of the energy emitted by a recombining valence electron is 
transferred to another valence electron, which then leaves the atom and is detected. 


To access valence electrons, soft X-ray radiation or hard UV radiation 


be employed. 
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> Neutron scattering: Thermal neutrons are scattered elastically or inelastically by a 
solid state target. Being a neutral particle, the neutron interacts only weakly with solid 
state constituents (i.e. magnetically, through its spin) and hence penetrates deeply into 
the sample. Owing to its particular energy dispersion, the neutron is tailor-made to the 
analysis of low-lying collective excitations (phonons, magnons, etc. - for example the data 
shown on page 78 were obtained by neutron spectroscopy). Just as with X-ray scatter- 
ing, elastic neutron scattering can be employed to obtain crystallographic information. 
Unfortunately, the production of thermal neutrons requires a nuclear reactor, i.e. neutron 
scattering is an extremely expensive experimental enterprise. 

> Magnetic resonance: A sample containing particles of non- vanishing moment is placed 
into a static (in practice, a slowly varying) magnetic field, strong enough to cause complete 
magnetic polarization. The sample is then exposed to an AC magnetic field perpendicular 
to the polarizing field. If the AC field frequency is in resonance with the Zeeman energy, 
magnetic transitions are resonantly induced. The observable effect is a strongly enhanced 
radiation loss of the AC field. 

> In nuclear magnetic resonance (NMR) the nuclear spins of the sample are polarized. 
In solid state physics, NMR is applied to obtain information about the magnetic properties 
of the electronic states of the solid. Owing to the hyperfine interaction of the electron spin 
and the nuclear spin, the effective magnetic field seen by the nucleus partially depends on 
the surrounding electron cloud. For example, in metals, the Pauli paramagnetic response 
of the electrons causes a characteristic shift of the spectral fines (as compared with the 
NMR spectra of nuclei in uncorrelated environments) known as the Knight shift. Anal- 
ysis of this shift obtains information about the magnetic properties of the conduction 
electrons. Resonance spectroscopy of transitions between spin-polarized electron states, 
electron spin resonance (ESR) , is frequently used in chemical analysis and molecular 
physics. 

For discussion of other spectroscopic techniques, e.g. Mossbauer spectroscopy, 
positron— electron annihilation spectroscopy (PES), muon scattering, electron 
energy loss spectroscopy (EELS), etc., we refer to the literature. 

Other experimental techniques 

There are a few experimental probes of condensed matter physics that do not fit comfortably 
into the three-fold “transport-thermodynamics-spectroscopy” scheme discussed above. This 
applies, for example, to scanning tunneling microscopy (STM), a technique whose 
development by Binnig and Rohrer in the 1980s has triggered a revolution in the area of 
nanotechnology. 
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The basic principle of STM 
is easily understood: a small 
tip, kept at a positive elec- 
trostatic potential, is brought 
in proximity to a surface. 

When the tip-surface separa- 
tion becomes comparable to 
atomic scales, electrons begin 
to tunnel from the substrate 
onto the tip. The resulting 
tunnel current is fed into a piezoelectric crystal that in turn levels the height of the tip. 
Through this mechanism, the surface-tip separation can be kept constant, with an accuracy 
of fractions of typical atomic separations. A horizontal sweep of the tip then generates 
an accurate image of the surface profile. For example, Fig. 7.3 shows an STM image of a 
carbon nanotube. 



Figure 7.3 STM image of a carbon nanotube. (Figure courtesy of C. Dekker.) 


Gerd Binnig 1947- and 
Heinrich Rohrer 1933- 

Awarded one half of 
the 1986 Nobel Prize 
(together with E. Ruska, 
the inventor of the 
electron microscope) 
for “their design of the 
scanning tunneling microscope." 
Foundation.) 



(Image © The Nobel 


7.2 Linear response theory 

In the previous section, we argued that condensed matter experiments typically probe the 
(linear) response of a system to the application of weak perturbations F' . Such linear 
response can be cast in terms of a generalized susceptibility \\ Eq. (7.2). In the following 
we try to give the formal expression (7.2) a concrete meaning. Specifically, we relate the 
response function \ to microscopic elements of the theory familiar from previous chapters. 
However, before entering this discussion, let us list a few properties of x that follow from 
common sense reasoning. 

> Causality: The generalized forces F'(t') cannot cause an effect prior to their action, i.e. 

= 0 ,t < t' . Formally, we say that the response is retarded. 

> If the system Hamiltonian does not explicitly depend on time (which will usually be the 
case), the response depends only on the difference of the time coordinates, Xij{ r > r/ ; t; t') = 
Xij( r, r'; t—t'). In this case it is convenient to Fourier transform the temporal convolution 
(7.2), i.e. to express the response in frequency space: 

Xi( r,w) = J d d r' X ij{*y^)Fy,u)+0{F'*). 


(7.3) 
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The important statement implicit in Eq. (7.3) is that, in the linear response regime, a 
(near) monochromatic perturbation acting at a certain frequency us will cause a response 
of the same frequency. For example, an AC voltage with frequency us will drive an AC 
current of the same frequency, etc. We can read this statement in reverse to say that, if 
the system responds at frequencies ^ us, we have triggered a strong, nonlinear response. 
Indeed, it is straightforward to verify (exercise) that an expansion of the general functional 
R[F'] to nth order in F' generates a response with frequency nus. 8 According to Eq. (7.3), 
a peak in the response Xi(us) at a certain frequency us indicates a local maximum of the 
response function, i.e. the presence of an intrinsic excitation with characteristic frequency 
us. 

!> For systems that are translationally invariant, the response function depends only on 
differences between spatial coordinates, Xij( r i r';t — t') = \ij (r — r':t — t'). Spatial Fourier 
transformation then leads to the relation 

Xi(d,ut) = *«(q; w)Fj(q,w) + 0(F' 2 ). (7.4) 

By analogy to what was said above about the frequency response, we conclude that a 
peak of the function Xj(q, us) signals the presence of an excitation with frequency oj and 
momentum q. We thus see that, at least in principle, linear response measurements are 
capable of exploring the full dispersion relation of the excitations of a system. 

This is as much as one can say on general grounds. In the following we will employ the field 
integral formalism to relate the response function to concrete microscopic properties of the 
system. 

EXERCISE Consider X-ray or neutron radiation probing a crystalline medium whose unit cells 
are spanned by vectors a*, i = 1 , ... ,d. Show that the response function \ shows this periodicity 
through the condition y(k, k';w) oc Jk-k'-G, where G belongs to the reciprocal lattice of the 
system. That is, the angle-resolved scattering pattern displays the full periodicity of the reciprocal 
lattice and, therefore, of the original lattice. This is, in a nutshell, the main principle behind 
spectroscopic crystallography. 


7.2.1 Microscopic response theory 

We now set out to relate the response function to the microscopic elements of the theory. 
Previously we saw that, in quantum theory, the response signal X(t) should be interpreted 
as the expectation value of some (single-particle) operator 9 X = J2 a a' c a.Xaa' c a / . where c a 


While these “side bands” are usually negligible, they may become sizeable in, for example, laser-spectroscopic 
experiments. The field intensities reached by laser beams can be so large as to generate frequency-doubled or 
-tripled response signals. However, these phenomena, which belong to the realm of nonlinear optics, are beyond 
the scope of the present text. 

For notational clarity, the indices i and r labeling a multi-component set of local response quantities Xi (r) will 
be dropped in this section (similarly with FUr')). 
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may, as appropriate, represent bosonic or fermionic operators. Within the formalism of the 
field integral, the expectation value at imaginary times is thus given by 

X(r) = £0 a (r) x aa'i> a '(v}h (7.5) 


where, as usual, (•••) = Z~ x f D (‘ip, ip) (• • • )exp{— ip, ip]} is the functional average 
over the action describing our system. The action of the system is given by S[F',ip,ip] = 
So[ip,ip] + 55" [F'; V>, ip], where So is the action of the unperturbed system and 

SS'[F',ip,ip] = J dr H f > = J dr F'(t) ^ ip a (T)X' aa ,ip a ,(T), 

is the perturbation introduced by the action of the generalized force (cf. Eq. (7.1)). 

In practice, it is often convenient (for a better motivation, see the next section) to repre- 
sent X(t) as a derivative of the free energy functional. To this end, let us formally couple 
our operator X to a second “generalized force” and define 

5S\F,1p,1p] = J dTF( T )X(T) = J dTF(T)Y^i’a{T)X aa >1p a '(T), 


as a new element of our action. With S[F, F' ,ip, ip] = So[ip, ip] + 8S[F, ip, ip] + SS'[F', ip, ip], 
we then have 


X(r) 


d 

SF{r) 


In Z[F,F’], 

F = 0 


where the notation Z[F,F'] = J D(ip,ip) e ~ s ^ F ’ F ' indicates that the partition function 
Z functionally depends on the two generalized forces. 

Now, if it were not for the presence of the driving force F', the expectation value X 
would vanish. On the other hand, we also assume that F' is weak in the sense that a 
linear approximation in F' satisfactorily describes the measured value of X. Noting that 
the formal first-order expansion of a general functional G[F’] is given by G[F’] ~ G[0] + 
f dr ' 1 |ljr'— o F' (t'), we can write 


X(t) ~ — J dr' 


k SF(t) 6F'(t>) | f=f , =0 
Comparison with Eq. (7.2) then leads to the identification 

<5 2 I 


In Z[F,F'] F'(t'). 


x (t, t') = - 


SF(t) 6F>(t>) | f=f , =0 
of the response kernel. Carrying out the derivatives, we find 

<5 2 I 


In Z[F,F'], 


X (t,t') = -Z- 

+ (z~ 


SF(t) SF'(t') | f=f , =0 

8 


Z[F, F'] 


8F'(t') 


Z[0,F'] 


8F(t) | f=( 


Z[F, 0] 
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Now, by construction, the second term in large parentheses is the functional expectation 
value (X (t)) taken over the unperturbed action. We had assumed that this average vanishes, 
so that our preliminary, and still very formal, result for the response function is given by 


X {t,t') = -Z~ 1 


<5 2 

6F(t) 5F’(t’) 


Z[F, F']. 

F=F ’= o 


(7.6) 


Performing the two derivatives, we obtain a more concrete representation of the response 
function in terms of a four-point correlation function: 


x(r> T ') = = -{^^ a (T)X ab ^ b (T)Y^^a’{T')X' a , b ,^ bl (T')). (7.7) 

ab a'b' 


EXERCISE Directly expand Eq. (7.5) to first order in the generalized force F to obtain this 
expression. As mentioned above, the usefulness of the derivative construction outlined above will 
become clear shortly. 


INFO Equation (7.7) indicates a connection between two seemingly very different physical mech- 
anisms. To disclose this relation, let us consider the case where the observed and the driving 
operator are equal: X' = X. (Shortly, we will see that the important application to the electro- 
magnetic response of a system falls into this category.) Using the vanishing of the equilibrium 
expectation values, {X(r)) = 0, we can then rewrite (7.7) as 

x(T,T'],m-((X(T) - (X(t)))(X(t') - <X(r')»>. (7.8) 


This relation is called the fluctuation— dissipation theorem. Indeed, the right-hand side of the 
relation clearly describes the quantum thermal fluctuation behavior of (the physical observable 
represented through) the operator X. By contrast, the left-hand side is of dissipative nature. For 
example, for the case X = j (see below), x relates to the conductance of the system, i.e. the way 
in which the kinetic energy of a charge carrier is dissipated among the intrinsic excitations of 
the system. 


ir. 

cut line 




f * 
/ 


We might now proceed to evaluate this function by means of the 
machinery introduced in the previous chapters and that is, indeed, how 
the response function will be computed in practice. However, before 
doing this, we must face up to one more conceptual problem. What we 
are after is the real-time response X(t) to a real-time dynamical per- 
turbation F'(t'). However, our functional integral formalism produces, 
naturally, an imaginary-time response x(r, t'). In fact, we have fre- 
quently met with this problem before: while we are generally interested 
in real-time properties the formalism makes imaginary-time predic- 
tions. In previous chapters, we dealt with this problem by remembering 
that the imaginary-time setup could be obtained by analytical con- 
tinuation t — > — ir of the integration contour of a real-time functional integral. Reversing 
this “Wick rotation” we argued that a real-time response X(t) can be extracted from an 
imaginary-time result X (r) by substitution r — > it. 


IT' 

H 


\ t 
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In the majority of cases, this procedure indeed leads to correct results. However, some- 
times one has to be more careful and that applies, in particular, to our present linear 
response calculus. Indeed, the simple substitution F(t) — > F(t) = F(t — > it) is a crude 
shortcut of what mathematically should be a decent analytical continuation. Problems with 
this prescription arise when the answer F(t) generated by the functional integral contains 
singularities in the complex r-plane. 10 If a fictitious contour interpolating between the 
limiting points r and it inevitably crosses these singularities (see the figure), the simple 
substitution prescription becomes a problem. The upshot of these considerations is that, 
before proceeding with the construction of the linear response formalism, we need to develop 
a better understanding of the mathematical structure of correlation functions. 


7.3 Analytic structure of correlation functions 

It is the purpose of this section to clarify - at last - the connection between imaginary and 
real-time correlation functions. Throughout much of this section we will return to the tradi- 
tional operator representation, i.e. expressions with circumflexes, X, represent canonically 
quantized operators and (•••) = .Z _1 tr(- • • exp{— f3[H — /iA r ] }) represents the quantum- 
thermal expectation value. Restricting ourselves to correlation functions of operators taken 
at two different times, 11 the general definition of the imaginary-time correlation func- 
tion reads 


C T XlX2 (r i - r 2 ) = ~{T t Xi {ti)X 2 (t 2 )) = 


f (Xl(Ti)X2(T2)), Ti>T2, 
lCx{^(r 2 )Xi (n)}, t 2 >ti, 


(7.9) 


where X t (c. cd) are arbitrary operators represented in terms of either boson or fermion opera- 
tors {c, td}. The time-dependence of these operators is defined through the imaginary-time 
Heisenberg representation, 


X(t) = t T ( fl ^X e r( fl (7.10) 

i.e. the imaginary-time analog of the real-time Heisenberg representation familiar from quan- 
tum mechanics. 12 The role of the time-ordering operator T t , whose action is defined 


10 More precisely, the quantity F(r) produced by evaluating the functional integral should be interpreted as a 
function defined on the imaginary axis of a complex- time domain. Analytical continuation then leads to a 
generalization F(z), where z may take values in some two-dimensional domain of the complex plane. What we 
imply when we substitute F(r) —¥ F(r it) is that the analyticity domain includes the real axis, and that 
analytical continuation amounts to a simple re-substitution of the argument. Both assumptions may happen to 
be violated. 

11 According to our discussion above, correlation functions of this type suffice to explore the linear response of a 
system. Although the situation with nonlinear response signals is more complicated, much of what we are going 
to say below has general validity. 

Within imaginary-time theory, it is customary to absorb the chemical potential into the dynamical evolution 
of an operator. 
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through the second equality, 13 is to chronologically order the two operators Xl(ti) and 

X 2 (r 2 ). 

Notice that for one-body operators Xi = the definition above describes 

the response functions discussed in the previous section. For X- t = c a ,X 2 = ct,, Eq. (7.9) 
coincides with the one-body imaginary-time correlation functions (“Green functions”) dis- 
cussed earlier in this text, i.e. Eq. (7.9) and its real time descendants to be discussed next 
cover most of the correlation functions of relevance in many-body physics. 

I N FO Although we shall postpone the discussion of the connection to the field integral formal- 
ism for a while, it is instructive to compare the definition (7.9) with the field integral correlation 
function (7.7) defined above. Indeed, if the quantities Xi(n) in (7.9) were to be interpreted as 
the functional representation of second quantized operators, X(a\a) Xfyjip), the two cor- 
relation functions would coincide. The reason is that the time-ordering operation acting on the 
functional representation of an operator pair is redundant, T T Xi(‘ip{Ti), V'(ti))A 2 (V’(t 2 ), ipfa)) = 
Xi(ff(Ti),'tp(Ti))X2('4’(T2),tf(T2)) (Exercise: Try to think why). In other words, the correlation 
function (7.9) reduces to (7.7) when the operators are represented within the field integral for- 
malism. (The reason why T r is not redundant for canonically quantized operators is that these 
have a non-vanishing commutator/anti-commutator at equal times.) 

In a manner that is difficult to motivate in advance, we next introduce not one but three 
different response functions of a real-time argument. Substituting in Eq. (7.9) real-time 
arguments for imaginary times, r — >• it, we obtain the real-time response function 

(7.11) 

where the factor of i has been introduced for later convenience, T t chronologically orders 
real times, and X(t) = d u (" ^Xc. it{ ~" '' v ) are real-time Heisenberg operators. While 
this expression appears to be the “natural” generalization of (7.9), it is not our prime object 
of interest. Much more physical significance is carried by the retarded response function 

(7.12) 

i.e. an object that exists only for times t\ > t 2 (hence the attribute “retarded”). The 
complementary time domain, ti <t 2 , is described by the advanced response function 

(7.13) 

INFO What is the physical meaning of the real-time retarded response function? To 

address this question we need to reformulate the linear response arguments given above in the 

13 The sign factor C, x = ±1 depends on the statistics of X ,, i.e. C, x = 1 if Xi are bosonic. Note that the operator 
Xi is bosonic if {c, c^} are Bose operators or if it is of even order in a fermion algebra. Conversely, £ x — — 1 if 
they are fermionic. 
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language of the canonical operator formalism. What we would like to compute is the expectation 
value 

X{t) = {X F \t)), (7.14) 

building up in response to the presence of a weak perturbation F'(t)X' in the Hamiltonian 
H = Ho + F'(t)X' . In Eq. (7.14), the superscript a F'” indicates that the time evolution of 
X follows the full Hamiltonian (including the perturbation F'(t)X'). In contrast, the angular 
brackets represent a thermal average (• • • ) = Z _1 tr[(- • • )exp{— pHo}] that does not include the 
perturbation. 14 The philosophy behind this convention is that, somewhere in the distant past, 
t — > — oo, the system was prepared in a thermal equilibrium distribution of the unperturbed 
Hamiltonian Ho- As time evolved, a perturbation oc F'(t) was gradually switched on until it 
began to affect the expectation value of the dynamically evolved operator X. (One need not be 
irritated by the somewhat artificial definition of the switching on procedure; all it tells us is 
that X' acts on a system in thermal equilibrium, an assumption that is not problematic if the 
perturbation is sufficiently weak.) To compute the expectation value, it is convenient to switch 
to a representation wherein the evolutionary changes due to the action of the perturbation are 
separated: 

X(t) = ((U F ')- 1 (t)X(t)U F ’(t)), (7.15) 

where U F (t) = t/^~ 1 (t) U(t), the evolution of X follows the standard Heisenberg dynamics 
X(t ) = Uq 1 (t)XUo(t) , and U(Uo) generates the time evolution of the full (unperturbed) Hamil- 
tonian. Using the defining equations of these evolution operators, it is straightforward to verify 
that U F obeys the differential equation, dtU F ( t ) = —iF' (t)X' (t)U F ( t ), i.e. the time evolu- 
tion of U F is controlled by the (Heisenberg representation of the) perturbation X' . According 
to conventional time-dependent quantum mechanical perturbation theory, the solution of this 
differential equation (with boundary condition U F ( t — oo) — >• 1) is given by 

U F ' (t) sc T t exp J dt' F'(t')X'{t')j ~ 1 -ij dt 1 F'(t')X'(t') + ■■■ 

Substituting this result into Eq. (7.15) we obtain 

X(t ) = -i j dt' e(t-t')F'(t') [X(t),X'(t')} = j dt'C+ x ,{t-t')F\t'), 

i.e. the retarded response function turns out to generate the linear response of X to the presence 
of the perturbation. In other words, the function C + is our prime object of interest, while all other 
correlation functions defined above play the (potentially important) role of supernumeraries. 

We next set out to explore the connection between the different correlation functions defined 
above. In doing so, the principal question that should be at the back of our minds is “How 
do we obtain the retarded real time function C + provided we know the imaginary-time 
correlation function C T ?” The key to the answer of this question lies in a highly formal rep- 
resentation of the correlation functions C T,T,+ ~ , known as the Lehmann representation. 
This representation is obtained by representing the correlation functions in terms of an exact 
eigenbasis {| l I' a }} of the system: representing the trace entering the thermal expectation 
values as tr(- • • ) = • • • l^a), and inserting a resolution of unity 1 = 1 1 $ r j g) ^ | 


To simplify the notation, the chemical potential has been absorbed into the definition of H. 
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between the two operators appearing in the definition of the correlation function, it is 
straightforward to show that, e.g. 15 

C T (t ) = ( Q(t)e -f* “ + C xQ(-t)e- 0 ») . (7.16) 

a/3 


Here, E a is the eigenvalue corresponding to a state >F Q and we have introduced the shorthand 
notations a = E a — nN a , a p = a — p , X a p = |S> Q ,|X|\I/ J g). We next Fourier transform 
C T to find 


C T (U) = r dtCTft)^-^^ Z-'Y^XlafiX^ 

° 0 a/3 


W + a /3 + it] 


-tx: 


where the convergence-generating factor r] - which will play an important role throughout! - 
has been introduced to make the Fourier representation well-defined. 16 

Equation (7.16) is the Lehmann representation of the real time correlation function. What 
is the use of this representation? Clearly, Eq. (7.16) will be of little help for any practical 
purposes; the equation makes explicit reference to the exact eigenfunctions/states of the 
system. Should we have access to these objects, we would have a full solution of the problem 
anyway. Rather, the principal purpose of spectral resolutions such as (7.16) is to reveal exact 
connections between different types of correlation functions and the analytical structure of 
these objects in general. To do so, we first need to compute the Lehmann representation 
of the other correlation functions. Proceeding as with the real time function above, it is 
straightforward to show that 


C T (u) 

C+(u) 

C~{w) 


\ 


1 


Z-'Y'XictpXzp, 

a/3 




(7.17) 


From this result, a number of important features of the correlation functions can be 
readily inferred. Anticipating the analytical structures alluded to above, we should think of 
C T ’+’~(z) as functions of a set of complex variables z. (The representations above apply to 
C T ’ + ’~(z = oj) where u is restricted to the real axis.) This extended interpretation allows 
us to view C T ’ + ~ as complex functions with singularities in the immediate vicinity of the 
real axis. More specifically: 


> The retarded correlation function C + has singularities for 2 = — a p — ir) slightly below 
the real axis. It is, however, analytic in the entire upper complex half plane Im(^) > 0. 


15 Wherever no confusion may arise, we omit the operator subscript C XX ' carried by the correlation functions. 

16 Indeed, we can attach physical significance to this factor. The switching on procedure outlined above can be 

implemented by attaching a small damping term exp(— \t\r)) to an otherwise purely oscillatory force . If we absorb 
this factor into the definition of all Fourier integrals, / dt ( F(t)e ~ *1 \)e zujt (- ••) —¥ f dtF(t) ( e ~ ^ • • ), 

we arrive at the Fourier regularization mentioned above. 
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t> Conversely, the advanced correlation function C~ has singularities above the real axis. 
It is analytic in the lower half plane Im(z) < 0. Notice that C + and C~ are connected 
through complex conjugation, 


<7 + (w) = [C-pu)]* . 


(7.18) 


> The time-ordered correlation function has singularities on either side of the real axis 
(which makes it harder to analyze). 

> The position of the singularities in the vicinity of the real axis contains important infor- 

mation about the fundamental excitations of the system (see Fig. 7.4). For example, 
consider the case where X\ = c\ and X -2 = ct, are one-particle creation and annihilation 
operators (no matter whether bosonic or fermionic). In this case, N a — Ng = AN = 1 
(independent of the state indices a, (i) and E a — Eg is of the order of the single-particle 
energies of the system. (For a non-interacting system, E a — Eg strictly coincides with 
the single-particle energies - exercise: why?) That is, the singularities of C T ’ +, ~ map out 
the single-particle spectrum of the system. This can be understood intuitively by remem- 
bering the meaning of the one-particle correlation function as the amplitude for creation 
of a state |a) followed by the annihilation of a state |6) at some later time. It is clear 
that the time Fourier transform of the amplitude, |a) A |6), becomes “large” when the 
phase (~ u>t) of the Fourier argument is in resonance with an eigenphase ~ {E a — Eg)t 
supported by the system. (If you do not find this statement plausible, explore the simple 
example of a plane wave Hamiltonian.) Similarly, for a two-particle correlation function, 
Xi ~ C;,, the energies E a — Eg describe the spectrum (the “energy cost”) of two-particle 

excitations, etc. Notice that the single-particle spectrum can be continuous (in which case 
the functions C lT,+ > _ have cuts parallel to the real axis), or discrete (isolated poles). 

> Once one of the correlation functions is known, all others follow straightforwardly from 
a simple recipe: using the familiar Dirac identity, 


lim — = =F*7r<5(a;) + P-, 

v\0 X±IT) X 


(7.19) 
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Figure 7.4 Illustrating the singularities of advanced and retarded correlation functions in the com- 
plex plane. The points denote poles and the lines branch cuts. 
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where P is the principal part, it is a straightforward matter to show that (exercise!) 

. 20 ) 

and 

(7.21) 


i.e. the information stored in the three different functions is essentially equivalent. 

After our discussion of the real-time correlation functions, the analysis of the imaginary-time 
function C T is straightforward. The imaginary-time analog of Eq. (7.16) reads 

C t (t) = -Z-^Xi^X^e (e(r)e-* “ + C ^(-r^ ») . (7.22) 

a/3 

Inspection of this representation for positive and negative times shows that C T acquires the 
periodicity properties of the operators X\ 

C t (t) =C^C' T (r + /3), t < 0. (7.23) 

Consequently C T can be expanded in a Matsubara Fourier representation just like a con- 
ventional operator, C T (icj n ) = J ^ drC T (r)e iWnT , where, depending on the nature of the 
operators Xi, 2 , oj n may be a bosonic or a fermionic Matsubara frequency. Applying this 
transformation to the Lehmann representation (7.22), we obtain 

C T (iu n ) = Z- 1 y X ^ X ^ \e-P - - Cxe-' 3 ->1 . (7.24) 

a p l 

Our final task is to relate the four correlation functions defined through Eq. (7.17) and 
(7.24) to each other. To this end, we define the “master function” 

C(z) = z 1 y X W X W° [ e -P a _ Cx(; ^ *] , (7.25) 

a/3 Z+ 

depending on a complex argument z. When evaluated for z = u + ,u~ ,ico n , respectively, the 
function C{z) coincides with C + ,C~ ,C T . Further, C(z) is analytic everywhere except for 
the real axis. This knowledge suffices to construct the relation between different correlation 
functions that was sought. Suppose then we had succeeded in computing C T (iu n ) = C(z = 
iuj n ) for all positive Matsubara frequencies. 17 Further, let us assume that we had managed 
to find an analytic extension of C(z = ioj n ) — > C(z) into the entire upper complex half plane 
Im z > 0. The evaluation of this extension on the infinitesimally shifted real axis z = to + iO 
then coincides with the retarded Green function C + (ui ) (see figure below). In other words, 

17 Keep in mind that, in practical computations of this type, we will not proceed through the Lehmann represen- 
tation. 
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To find C + (ui) we need to (i) compute C T (ioj n ) for all positive Matsubara 
frequencies (e.g. by means of the thermal field integral) and then (ii) continue 
the result down to the real axis, iw n — »• u) + iO. 


The advanced Green function C~ is obtained analogously, and by 
analytic extension the thermal correlation function C T {iw n < 0), to 
frequencies with a negative offset, u — iO. 

These statements follow from a theorem of complex function the- 
ory stating that two analytic functions -^ 1 ( 2 :) and Fi{z) coincide if 
F\(z n ) = F‘ 2 ( z n ) on a sequence {zi} with a limit point in the domain 
of analyticity. (In our case iu n — > ioo is the limit point.) From inspection of (7.25) we 
already know that F\{iu) n ) = C + (w — »• icj n ) coincides with F 2 (iu n ) = C T (icj n ). Thus any 
analytic extension of C T must coincide with C + everywhere in the upper complex half plane, 
including the infinitesimally shifted real axis. 

EXERCISE Writing z = u> ± irj, transform the spectral representation (7.25) back to the time 
domain: C(f) = ^ f dw e~ lut C( u> ± irj). Convince yourself that, for Im(z) positive (negative), 
the temporal correlation function <7(t) contains a ©-function 0(f) (0(— t)). (Hint: Make use 
of Cauchy’s theorem.) Importantly, the presence of this constraint does not hinge on r) being 
infinitesimal. It even survives generalization to a frequency-dependent function r](uj) > 0. (For the 
physical relevance of this statement, see below.) All that matters is that, for 77 > 0, the function 
C(u> ± irj) is analytic in the upper (lower) complex half plane. This observation implies a very 
important connection between analyticity and causality: temporal correlation functions 
whose frequency representation is analytic in the upper (lower) complex half plane are causal 
(anticausal). (A time-dependent function is called “(anti)causal” if it vanishes for (positive) 
negative times.) 

How is the continuation process, required to find the retarded correlation function, carried 
out in practice? Basically, the answer follows from what was said above. If we know the 
correlation function C T (iu n ) for all positive Matsubara frequencies, and if that function 
remains analytic upon substitution C T {iu) n — > z) of a general element of the full complex 
half plane, the answer is simple: we merely have to substitute iu) n —>■ cv + iO into our result 
to obtain the retarded correlation function. Sometimes, however, we simply do not know 
C T (ioj n ) for all positive frequencies. (For example, we may be working within an effective 
low-energy theory whose regime of validity is restricted to frequencies L 0 n < oj* smaller than 
some cut-off frequency.) In this case, we are in serious trouble. Everything then hinges on 
finding a “meaningful” model function that can be extended to infinity and whose evaluation 
for small frequencies ui n < u>* coincides with our result; there are no generally applicable 
recipes for how to deal with such situations. 

I N FO As a special case of great practical importance, let us briefly explore the non-interacting 
single-particle Green function, i.e. the single-particle correlation function Xi = c a , X 2 = 
cl for a non-interacting system. (We assume that {|a)} is an eigenbasis of the one-particle 
Hamiltonian.) As expected, these correlation functions assume a particularly simple form. In 
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the non-interacting case, the eigenstates |a) = |oi,a 2 ,.. 
combinations of single-particle eigenstates \ai). Their ener 
the single-particle energies. Using the fact that Ep = E a 
that the correlation function acquires the simple form 


- antisymmetrical 


i.e. the partition function entering the definition of the general correlation function cancels 
against the thermally weighted summation over |a) (check!). Notice that the thermal version 
of this Green function, G a (icon) = (ioJ n — £ a ) _1 , appeared previously as the fundamental build- 
ing block of perturbation theory. This is, of course, no coincidence: within the formalism of 
the field integral, the Green function appeared as the functional expectation value ( V’a,nV , o,n)o 
taken with respect to the Gaussian non-interacting action. But this object is just the functional 
representation of the operator correlation function considered above. 

Building on this representation, it is customary to introduce a Green function operator 
through the definition 


By design, the eigenvalues of this operator - which are still functions of z - are given by the 
correlation function G a {z) above. Numerous physical observables can be compactly represented 
in terms of the operator Green function. For example, using Eq. (7.19), it is straightforward to 
verify that the single-particle density of states of a non-interacting system is obtained as 

/.>((•) — . — * Im tr G~ (c). (7.26) 

by taking the trace of the retarded Green function (operator). 

To illustrate the procedure of analytic continuation, let us consider a few elementary 
examples. 

1. For the single-particle Green function ( Xi = c a . A 2 = c\ ) of an elementary excitation 
with energy e a , G a (oj n ) = ( iu> n — £ a ) _1 , the continuation amounts to a mere substitution, 


2. We have seen that quasi-particle interactions lead to the appearance of a - generally 
complex - self-energy E(z): G a (co n ) — > (iu> n — £ a ~ H(wu„)) _1 , where we have simplified 
the notation by suppressing the potential dependence of the self-energy on the Hilbert 
space index a. Extension down to the real axis leads to the relation 


aV ' a>+ - - E(w+) ’ v * ' 

where u> + = u> + iO and E(u; + ) is the analytic continuation of the function E(z) to the 
real axis. Although the specific structure of the self-energy depends on the problem under 
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consideration, a few statements can be made in general. Specifically, decomposing the 
self-energy into real and imaginary parts, we have 

Re E(w+) = +Re E(w“), ImE(w+)=-ImE(w“)<0. (7.28) 


In words, the self-energy function has a cut on the real axis. Upon crossing the cut, its 
imaginary part changes sign. This important feature of the self-energy can be understood 
from different perspectives. Formally, it follows from Eq. (7.18) relating the retarded 
and advanced Green functions through complex conjugation. More intuitively, the sign 
dependence of the imaginary part can be understood as follows. Suppose we start from 
a non-interacting imaginary-time formalism and gradually switch on interactions. The 
(Landau) principle of adiabatic continuity implies that nowhere in this process must the 
Green function - alias the propagator of the theory - become singular. This implies, 
in particular, that the combination i{uj n — Im Ej Wn ) must not become zero, lest the 
dangerous real axis of the energy denominator be touched. The safeguard preventing the 
vanishing of the imaginary part of the energy denominator is that — Im E and uj n have 
the opposite sign. Of course, this feature can be checked order by order in perturbation 
theory. Decomposing the self-energy into real and imaginary parts, E = S' + ?E", and 
transforming G + (w) back to the time domain we obtain 

G+(t) = [ —e~ iuit G + (u) » e-^+^+^'ett), 

J 2-7T 

where we have made the (over) simplifying assumption that the dependence of the self- 
energy operator on ui is negligible: E(w) « E. 


EXERCISE Check the second equality above. 


If we interpret G + (t) as the amplitude for propagation in the state |a) during a time 
interval t, and |G + | 2 as the associated probability density, we observe that the probability 
to stay in state |a) decays exponentially, |G + | 2 a e 2tE , i.e. 2E" = — - can be identified 
as the inverse of the effective lifetime r of state |a). The appearence of a finite lifetime 
expresses the fact that, in the presence of interactions, single-particle states no longer 
represent stable objects, but rather tend to decay into the continuum of correlated many- 
body states. This picture will be substantiated in Section 7.3.1 below. 

3. Let us apply Eq. (7.26) to compute the BCS quasi-particle DoS of a superconductor. 

In Section 6.4 we saw that the thermal Gor’kov Green function of a superconductor with 
spatially constant real order parameter is given by G(iu> n ) = \ioj n — (H — — Aui] -1 . 

Switching to an eigenrepresentation and inverting the Pauli matrix structure, we obtain 


-^tr G(iu n ) = * £ tr 


iuj n + £ a t73 + A<n 

W 2 + £2 + A 2 


2iui n i > 1 

^ w 2 + £ 2 + A 2 ‘ 


Next, performing our standard change from a summation over eigenenergies to an inte- 
gral, we arrive at 


1 - 2 ioj n 

tr G(xoj n ) ~ 1 


dti 


1 

+ £2 + A 2 


2 iu> n v 
V^ 2 + A 2 ’ 
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where, as usual, v denotes the normal density of states at the Fermi level. This is the 
quantity we need to continue to real frequencies. To this end we adopt the standard 
convention whereby the cut of the square root function is on the positive real axis, i.e. 
y/—r + iO —sj—r - *0 = for r positive real. Then, 

1 a,. +N 2e + v 2 cv 

7T \/—e 2 — iO sgn (e) + A 2 

where we anticipate that the infinitesimal offset of e in the numerator is irrelevant (trace 
its fate!) and, making use of the fact that, for e approaching the real axis, only the sign 
of the imaginary offset matters: (e + iO) 2 ~ e 2 + 2i0e ~ e 2 + 2i0sgne. Finally, taking the 
imaginary part of that expression, we arrive at the standard BCS form 


o(e) = Im 


2 as 

\/—e 2 — iO sgn(e) + A 2 


0 , 

V e 2 - A 2 ’ 


N < A, 
|e| > A. 


7.3.1 Sum rules and other exact identities 

In the next section we apply the analytical structures discussed above to construct a powerful 
theory of real time linear response. However, before doing so, let us stay for a moment 
with the formal Lehmann representation to disclose a number of exact identities, or “sum 
rules,” obeyed by the correlation functions introduced in the previous section. Admittedly, 
this addition to our discussion above does not directly relate to the formalism of linear 
response, and readers wishing to proceed in a more streamlined manner are invited to skip 
this section at first reading. 

In fact, the formulae we are going to collect are not specific to any particular context and 
that is precisely their merit: identities based on the analytical structure of the Lehmann 
representation are exact and enjoy general applicability. They can be used (a) to obtain full 
knowledge of a correlation function from fragmented information - e.g. we saw in Eq. (7.20) 
and (7.21) how all three real time correlation functions can be deduced once any one of them 
is known - and, equally important, (b) to gauge the validity of approximate calculations. 
The violation of an exact identity within an approximate analysis is usually an indication 
of serious trouble, i.e. such deviations mostly lead to physically meaningless results. 


The spectral (density) function 

We begin by considering an object that carries profound physical significance in its own 
right, especially in the area of strongly correlated fermion physics, i.e. (—2 times) the imag- 
inary part of the retarded correlation function, 


A(u>) = -21m C+(cj). 


(7.29) 
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Equation (7.29) defines the spectral function or spectral density function. Using 
Eq. (7.19) and the Lehmann representation (7.17) it is straightforward to verify that it has 
the spectral decomposition 

A(u) = 27TZ- 1 ^ X laP X 2/Sa [e -/3 - - % + a p)- (7-30) 

a/3 

INFO To understand the physical meaning of the spectral function let us consider the 
case where X 1.2 are single-particle creation/annihilation operators, Xi = c a , X 2 = c\. (We 
assume that the non-interacting part Ho of the Hamiltonian Ho + V is diagonalized in the 
basis {|o)}.) For a non-interacting problem, V = 0, it is then straightforward to show that 
A a (u>) = 2ir5(ui — £ a ), i.e. the spectral function is singularly peaked at the single-particle energy 
(measured from the chemical potential) of the state |a). Here, the subscript “a” indicates that 
we are dealing with a spectral function defined for a pair of one-body operators c a , cj . 

Heuristically, the singular structure of A a (io) can be understood by observing that, in the 
non-interacting case, the state c\\a) obtained by adding a single particle |a) to the many-particle 
state |a) is, again, an eigenstate of the system. In particular, it is orthogonal to all states by 
itself, whence the eigenstate summation over |/3) contains only a single non-vanishing term. We 
say that the “spectral weight” carried by the (unit-normalized) state c\\a) is concentrated on a 
single eigenstate of the system. 

What happens to this picture if interactions are restored? In this case, the addition c\\a) of 
a single-particle state to a many-body eigenstate will, in general, no longer be an eigenstate of 
the system. In particular, there is no reason to believe that this state is orthogonal to all but 
one (N + 1) particle states |/3). We have to expect that the spectral weight carried by the (still 
unit-normalized) state c„|a) gets distributed over many, potentially a continuum, of states |/3). 

It is instructive to explore the consequences of this phenomenon in the representation (7.27) 
where the effect of interactions has been lumped into a self-energy operator E. Taking the 
imaginary part of this expression, we find 

Mu) = _2 ( W -c 0 -E'( W y+(E"(«)) 2 ’ 

where E' and E” denote the real and imaginary part of the self-energy, respectively, and we 
have neglected the infinitesimal imaginary offset of w + in comparison with the finite imaginary 
contribution iY" . The result above suggests that the net effect of interactions is an effective 
shift of the single-particle energy e a — > e a + E(w) by the real part of the self-energy operator; 
interactions lead to a distortion of the single-particle energy spectrum, an effect that follows, for 
example, from straightforward perturbative reasoning. More importantly, the (5-function obtained 
in the non-interacting case gets smeared into a Lorentzian (see Fig 7.5). In a sense, the spectral 
weight carried by the many-body states Ct |a) is distributed over a continuum of neighboring 
states, wherefore the spectral function loses its singular character. The width of the smearing 
interval is proportional to the imaginary part of the self-energy and, therefore, to the inverse of 
the lifetime r discussed in the previous section. 

Notice that the smeared spectral function still obeys the normalization condition, f ^ A a {oj ) = 
1, as in the non-interacting case. 18 This suggests an interpretation of A as a probability measure 
describing in what way the spectral weight carried by the state cl\a) is spread out over the 

18 Strictly speaking, we can integrate A only if the variance of E(a;) over the interval [£ a + E' — E /x , £ a + E r + E /; ] 
in which the Lorentzian is peaked is negligible (but see below). 
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continuum of many-body states |/3) . To put that interpretation onto a firm basis, not bound to the 
self-energy representation above, we consider the spectral decomposition (7.30). The positivity 
of all terms contributing to the right-hand side of the equation implies that A a (uj) > 0, a 
condition necessarily obeyed by any probability measure. To verify the general validity of the 



Figure 7.5 Illustrating the meaning of the spectral function. For zero interaction, 

Y t= 0, the spectral weight is singularly concentrated on the single particle energies 
of the system. As interactions are switched on, the spectral weight is distributed 
among a continuum of states concentrated around a shifted center point. The width 
of the distribution increases with growing interaction strength. Dark solid curve: 
(schematic) shape of the spectral function if the self-energy carries a pronounced uj 
dependence. 

unit-normalization condition one may integrate Eq. (7.30) over u>\ 

a/3 

-Z -1 ^^(a |c a Co| a)e _0 ““ — Cc y~](/3 |c„c a | /3)e~^" g 

•Z -1 ^ e - ^““ (a| c a ct — Ccctca |a) = Z _1 ^ e -/3 “ a = 1. 

[ca,ct ]{c =i Z, 

Positivity and unit-normalization of A a {oi) indeed suggest that this function measures the dis- 
tribution of spectral weight over the many-body continuum. 

To further substantiate this interpretation, consider the integral of the spectral function 
weighted by the Fermi- or Bose-distribution function: 

/ f^ n F/B (w)A 0 (w) = Z- 1 YsCa a pcl f}a \er p ’ 3 ‘°‘ - Cce - ^] J dw5{oj + S aP ) 



= Z- 1 Y j e- pS e{P\clc a U) = {na), 

P 



which tells us that, if we weight the spectral density function of a single-particle state |a) with the 
thermal distribution function, and integrate over all frequencies, we obtain the total occupation 
of that state. The relation 


J 2tt 


n F/B (w)^ 0 (u) = {n a ), 


(7.31) 
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indeed states that A a is a distribution function describing in what way the spectral weight of 
the state c\\a) spreads over the continuum of exact eigenstates. 


A number of exact identities involving correlation functions are formulated in terms of 
the spectral function. We begin by showing that the spectral function carries the same 
information as the correlation function itself. (In view of the fact that A is obtained by 
removing the real part of C , this result might come as a surprise.) Indeed, starting from 
A(of) = i{C + (bS) — C~(of)), one may confirm the relation 



dw A{of) 
27 t z — oj 


(7.32) 


Note that the first (second) term contributing to the right-hand side of the definition (a) is 
analytic in the upper (lower) half of the complex w-plane and (b) decays faster than w -1 
for M — > oo. For Imz > 0, the theorem of residues then implies that the C _ -term does 
not contribute to the integral. (Without enclosing singularities, the integration contour can 
be closed in the lower half plane.) As for the C + contribution, one may integrate over an 
infinite semicircle 7 closing in the upper half plane to obtain 


do) A(oj) lm_z >0 1 1 ^ 

, 27 t z — oj 2ni 


C+(u 


= C{z), 


(7.33) 


where the second identity relies on the analyticity of C + in the upper half plane. The case 
Im 2 ; < 0 is treated analogously. We thus find that knowledge of the imaginary part suffices 
to reconstruct the full correlation function. (Notice, however, that the identity (7.32) is 
heavily “non-local” ; i.e. we need to know the spectral function for all oj, including oj — >• ± 00 , 
to reconstruct the correlation function at a given value of z.) 

Considering the second equality in Eq. (7.33) and setting 2 = o> + , one obtains 




A [jj -A/) . 

2m J oj — oj' + tO 


Representing the denominator under the integral in terms of the Dirac identity (7.19), and 
collecting terms, the identity assumes the form 

C + (oj) = — [ du'C+tu') P— . 

TTl J U}' ~ W 


It is customary to consider real and imaginary parts of this relation separately, whence one 
arrives at the celebrated Kramers— Kronig or dispersion relations: 


Re C + (u) = - 1 

dhJ Im C + (a/)P— , 

7T J 

ur — oj 

Im C + (u) = -- 

[ du/ Re C + U) P , 1 . 

7T , 

J of - UJ 


(7.34) 


I N FO To appreciate the physical content of the Kramers— Kronig relations, let us anticipate 
our discussion below and note that the scattering amplitude of particles incident on a medium at 
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energy w is proportional to the retarded Green function C + (w) . Taking as an example the scattering 
of electromagnetic radiation from a solid state sample, the Kramers-Kronig relations then tell us 
that the real part of the scattering amplitude - the index of refraction - is proportional to the imagi- 
nary part - the index of absorption - integrated over all energies, i.e. the Kramers-Kronig relations 
establish a connection between the two seemingly unrelated physical mechanisms of absorption and 
refraction. Other areas of application of dispersion relations include high energy physics, optics (both 
classical and quantum), and many more. 

In the next section we discuss some concrete applications of the Kramers-Kronig relations 

in many-body physics. 


The dielectric function: a case study 

Equation (7.34) represents a “master identity” from which numerous other exact relations 
can be obtained. As an example of the derivation and usefulness of these identities, let 
us consider the frequency- and momentum-dependent dielectric function e(q, u>), i.e. the 
object describing the polarization properties of a medium in the presence of an electromag- 
netic field. In Section 5.2 we explored the dielectric function within the framework of the 
RPA approximation. However, as we are about to discuss exact relations, we should now be 
a bit more ambitious than that, i.e. we should base our discussion on a more generally valid 
representation of the dielectric function. Indeed, it is a straightforward exercise in linear 
response to show that (see Problem 7.6.2) 


e(q,w) 


Vb(q) 

"0 


J dr e* Wmr (n(q, T)h{— q, 0)) c 



(7.35) 


where Vo(q) = 47t e 2 /q 2 is the bare Coulomb potential, (hn) c denotes the connected thermal 
average of two density operators n(q, r) = c^(r)c q (r), and iu m — > u + indicates symbolically 
the analytical continuation to real frequencies. Heuristically, Eq. (7.35) can be understood 
by noting that 1/e = V e s/Vo measures the ratio between the effective potential felt by a test 
charge in a medium and the vacuum potential. The difference between these two quantities 
is due to the polarizability of the medium which, in turn, is a measure of its inclination to 
build up charge distortions S(n) in response to the action of the potential operator ~ f dVyri. 
In linear response theory, 19 6(h) is given by the kernel fd(nn)V o, i.e. the second term in 
Eq. (7.35). 

We thus observe that the (inverse of the) dielectric function is determined by the retarded 
correlation function C~(q. w) with X-\ = X 2 = h. (For obvious reasons, this function is 
called the retarded density— density response function. It appears as an important 
building block in many areas of many-body physics.) Building on the relation 

e(q,w) _1 = 1 - ^rC + ( q,£u), (7.36) 


The linear response approximation is quite appropriate here because the standard definition of the dielectric 
function e = limvo-^o(Vo/V^ff) implies an infinitesimally weak external perturbation. 
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we next show how analyticity arguments and certain limit relations can be employed to 
derive strong test criteria for the dielectric functions. We first apply the Kramers-Kronig 
relation to e _1 — 1 oc C + to obtain 

1 f°° 1 

Ree(q, cn) -1 — 1 = — / du>' Ime (q, a/) -1 . 

7T J_ 00 <j 0' - bJ 

Using the fact that Ime(q,w') _1 = — Ime (— q, — a/) -1 = -Ime(q,-w') _1 , where the first 
identity holds for the Fourier transform of arbitrary real-valued functions and the second 
follows from real space symmetry, the integral can be brought to the compact form 


Ree(q, w) 



<kJ Ime(q, a/) 1 — — 2 


(7.37) 


We next consider this relation for the special case ui = 0 and |q| — ► 0. The probing of this 
limit is motivated by the fact that the behavior of the static dielectric function e(q, u = 0) 
is significantly more simple to analyze than that of the generic function e(q, to). The reason 
is, of course, that a static external field does not prompt any dynamical response of the 
system. Indeed, one can show on general grounds (see the Info block below) that, in the 
static limit, 

e(q, 0) -1 = (1 + 47Ti'|q| -2 ) -1 '-^4° 0. (7.38) 


Substitution of this result into Eq. (7.37) then leads to i 

,£ 

the identity 


[~ 


J o w 2 

(7.39) 



Equation (7.39) is a typical example of a sum rule. For 
the derivation of a few more sum rules to be obeyed by 
the dielectric function see, e.g., the text by Mahan. 20 


INFO To understand the behavior of the dielectric function in the static limit, imagine a 
system of charged particles subject to a potential Vo(r). Since Vo is static, it does not drive the 
system out of equilibrium. In particular, all fermionic quasi-particle states 21 are filled up to a 
uniform chemical potential p. This, however, necessitates a redistribution of charge. The reason 
is that the potential shifts the energies e a of quasi-particle states |a) according to the relation 
e a — > e a — V'eff(r), where V e g is the effective potential seen by the particles. (In order for this 
relation to make sense, the typical spatial extent of a quasi-particle must, of course, be smaller 
than the modulation range of U(r).) For low temperatures, states will be filled up to an energy 
p—V e g(r) (see the figure). We can make this argument quantitative by introducing a distribution 
function n e fi(e, r) = np(e — V e a (r)) locally controlling the occupation of quasi-particle states. In 


G. Mahan, Many Particle Physics (Plenum Press, 1981). 

We assume that we are dealing with a Fermi liquid, i.e. that we can think of the constituents of the system as 
fermionic quasi-particles, in the spirit of Landau’s theory. 
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a linear approximation, the induced charge density screening the external potential is then given 
by 


Pind(r) = 


~Yd X^ neff ( £a ’ r ) _nF ( e <0) = -v j de(n e fi(e, r) -n F (e)) 
-V J de(n F (e - KffW) - n F (e)) 

vj ded n F (e)V e s(r) = -v>Ves(r). 


To compute the difference between the external and the effective potential, respectively, we 
assume that the former had been generated through some charge density po\ — 9 2 Vo = 47rpo, or 
Vo(q) = 47rq _2 po(q). In contrast, the full potential V e s will be generated by a charge distribution 
peff comprising the external charge and the screening charge, I4fj(q) = 47rq _2 (po(q) +Pind(q)) = 
Vo(q) — 47U'q -2 I4fr(q). We thus find that 


e(q,0) 


Vo(q) 

Kff(q) 


l + 4w|q| 2 . 


as stated above. Indeed, we had obtained this result, generally known as Thomas— Fermi 
screening, long before within the more microscopic framework of the RPA. In contrast, the 
merit of the present argument is that it is not based on any specific approximation scheme. 22 


While Eq. (7.39) was for the specific example of the dielectric function, the general con- 
struction recipe has much wider applicability: (a) a quantity of physical interest (here, the 
dielectric function) is represented in terms of a retarded response function which then (b) is 
substituted into a Kramers-Kronig-type relation. This produces a frequency non-local con- 
nection between the response function at a given frequency to an integral ( “sum” ) over all 
other frequencies, (c) This integral is evaluated for a reference frequency for which our ref- 
erence quantity is generally known (here, u> — > 0). This produces an integral relation which 
should hold under very general conditions. As mentioned at the beginning of this section, 
sum rules play an important role as test criteria for physical approximation schemes. 


Experimental access to the spectral density function 
Earlier, we have seen that the spectral density function contains highly resolved micro- 
scopic information about a many-body system. But how do we access this information other 
than by theoretical model calculation? Interestingly, it turns out that the spectral function 
is not only central to theoretical analysis, but also directly related to a key experimen- 
tal observable, the inelastic scattering cross-section. To appreciate this connection, 
consider again the prototypical setup of a scattering experiment shown in Fig. 7.1. The 
frequency- and angle-resolved scattering cross-section is a measure of the rate of transi- 
tions from the incoming state (e,k) into an outgoing state (e 1 , k'). To give the problem a 
quantum mechanical formulation, we first note that the full Hilbert space H of the system 
is the direct product of the Fock space, T , of the target system and the single-particle 
space Hi of the incoming particle species, H = T <S> Hi. We assume that the interaction 


However, the argument is a bit phenomenological and does rely on the Fermi liquid doctrine. 
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between the incoming particle and the constituent particles of the system is governed by 
an interaction Hamiltonian whose conventional (first quantized) real space representation 
reads -Hint = V{*i — r). Here, r* are the positions of the particles of the system and r 
is the position of the incoming particle. For simplicity we assume that the interaction is 
point-like, i.e. V(r — r') = C6(r — r'), where C is some constant. (The generalization to more 
general interaction potentials is straightforward.) The second quantized representation of 
the interaction operator then reads 

Hint = c Jd d r <5(r - r)c) (r)c(r), (7.40) 


where r retains its significance as a single-particle operator acting in the space Hi 
alternative and, for all that follows, more convenient representation is given by 


An 


= C I ddr I $*&**M* = C J 


d d q 

J2ny 6 


r p(q), 


where we have made use of a plane wave representation of the (5-function, <5(r) = 
J(d d q/(2n) d )exp(i<i ■ r), and p(q) = f dr e _ * qr ct (r)c(r) describes density modulations 
in the target system of characteristic momentum q. Assuming, for simplicity, that the 
sample is kept at zero temperature (i.e. it is in its ground state), a scattering process 
of first order in the interaction Hamiltonian is described by the transition amplitude 
A(q) = (/9, k — q|-Hmt|0, k), where |0) represents the ground state of the system, and |/3) 
may be any exact eigenstate of the target system. 23 Substitution of the representation of 
the interaction Hamiltonian above brings the transition amplitude to the form 


*4(q) = </8,k— q|ff int |0,k) a (£|/3 q |0). 


(7.41) 


A first conclusion to be drawn from Eq. (7.41) is that the scattering amplitude probes 
density modulations in the bulk system. According to Fermi’s Golden Rule, the transition 
rate associated with the scattering amplitude A(q) is given by 


V{q)=2n^2\{(3\p{q L )\0)\ 2 6(w- 0O ), (7.42) 

P 


where po = Ep — Eq > 0 is the excitation energy of \/3) above the ground state and 
the (5-function enforces energy conservation. The summation over /3 reflects the fact that 
only the beam of scattered particles is observed while the final state of the target remains 
unspecified. 


Using the nomenclature of Fig. 7.1, one might identify |0) with a ground state of zero collective momentum 
K = 0 and |/3) = |K = q) with a state that has absorbed the momentum of the scattered particle. The present 
discussion is more general in that it does not assume that the target eigenstates carry definite momentum. 
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It is instructive to reformulate Eq. (7.42) in a number of different ways. Representing the 
^-function as a time integral, 2tt5(u>) = f dt exp(-Hwt), one obtains 

p( q ) = [ 

j 0 

= f dte +iu ’ 4 ^(0|e i ^-^‘ / 3(-q)e- i ^-^) t |^)(/3|p(q)|0) 

J 0 

= J dte +iu,t (0|e i ^- /iJV ) t / 3(— q)e- i ^-^ )t p(q)|0) = J die +fart <0|/3(-q, t)p(q, 0)|0). 

This clearly illustrates the connection between the observable scattering rate and the micro- 
scopic characteristics of the system, i.e. the rate P(q,w) is a measure of the dynamical 
propagation of density modulations of wavelength q at time scales ~ u/ _1 . 

To establish the connection to the previously developed apparatus of response functions, 
we introduce the abbreviation p{ q) a p = (a|/3(q)|/3) and reformulate Eq. (7.42) as 


p(«) 


_ 2 Im y »(q)wo(-i)w = _ 2 Bm Im2 -, y 

P w++ 0/3 U++ af3 

-2 lim Im 2- 1 V 1 Z ’) 

— 2 hm^ ImC + (w) = A(q, u>), 


where -A(q,w) denotes the spectral density function evaluated for the density operators 
X-\ = p{ q), X '2 = p(— q). Here, we have made use of the fact that, for T — > 0, the Boltzmann 
weight exp (— /3 a ) projects onto the ground state. Similarly, for ut > 0, the contribution 
exp(— /? p) vanishes (exercise: why?). 

We thus note that information about the scattering cross-sections is also carried by a 
retarded real time response function. More specifically: 


The inelastic scattering cross-section for momentum transfer q and energy 
exchange uj is a direct probe of the spectral density function .4(q, uj). 


Although the derivation above did not exactly follow the linear response scheme, it had 
the same weak coupling perturbative flavor. (The golden rule is a first-order perturbative 
approximation!) While we derived our formula for the particular case of a short-range density 
coupling, it is clear that a more general beam-target coupling mechanism would lead to an 
expression of the same architecture, i.e. a suitably defined retarded response function. 


7.4 Electromagnetic linear response 

In the previous two sections we have assembled everything needed to compute the response 
of physical systems to moderately weak perturbations. We have learned how to linearize 
the response in the strength of the generalized force and to extract real time dynamical 
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information from imaginary time data. In this section we will illustrate the functioning of 
this formalism on undoubtedly its most important application, the response to a general 
electromagnetic field. 

The general setup of the problem is easily formulated: suppose a system of charged parti- 
cles has been subjected to an electromagnetic signal represented through a scalar potential 
<f>(r,t) and/or a vector potential A(r,f). To simplify the notation, let us represent the per- 
turbation through a (1 + d)-dimensional 24 potential A^(x) = (<fi(x), A(x)), where x = ( [t , r) 
is a (1 -f- d)-dimensional space-time argument vector. The system will respond to this per- 
turbation by a redistribution of charge, p(x) = ( p(x )), and/or the onset of current flow 
j(x) = (j(x)). Confining charge and vectorial current into a (1 + d)-dimensional generalized 
current vector = (p, j) , our task is to identify the linear functional j = K[A] + 0(A 2 ) 
relating the current to its driving potential. Written more explicitly, 

jn(x) = [ dx' Kn V (x,x')A v {x'), (7.43) 

Jt'<t 

where the condition t’ <t indicates that the response is retarded. To compute the elements 
of the response tensor, we proceed according to the general recipe constructed in the preced- 
ing sections: we first employ the general formalism to derive the imaginary-time response 
K( r, r 7 ; iu n ), and then continue to real frequencies, iu n u + i0. 

However, before formulating this programme in detail, let us derive two fundamental 
constraints to be fulfilled by the kernel K. Firstly, a gauge potential A lt = d )t f (/ is a 
function) cannot drive a physical current. Substituting this condition into Eq. (7.43), we 
find 

0= / dx , K fiV (x,x')d I 'f(x') = — dx' (d^.,K^(x,x'))f(x'). 

Jt'<i Jt’<t 


Since / is arbitrary one may conclude that 0 = K^ lv d v where the arrow indicates that 
the derivative acts from the right. Secondly, current conservation demands = 0 or, 
transcribed to Eq. (7.43), 0 = j t , <t dx' d!‘K llv (x. x')A v {x'). Due to the arbitrariness of A t , , 
this condition can only be generally valid if <9 ,J ' K^ v = 0. Summarizing, 


Gauge invariance and particle number conservation demand the identity 
d» K, tI> = K, w d v = 0. 


We next turn to the derivation of the linear response kernel. Our starting point is an 
observation made already in Chapter 1 (and several times thereafter), namely the fact that 


To not unnecessarily exclude important applications of the formalism to effectively two- or one-dimensional 
problems, the dimensionality of space is left unspecified. 
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the coupling of matter to the classical electromagnetic field is described by the relation 25 


. _ 55 c [A] 

ja , ; 


(7.44) 


where the subscript “c” indicates that only the part of the action that couples matter to 
the field (but not the background action of the field) is differentiated. In this expression we 
are back to a Euclidean metric = vqWq + v • w (on account of the definition, jo = ip, 
Ao = i<f> of imaginary time current and vector potential, respectively). The additional i 
appearing in this convention mirrors the i of the temporal component of the imaginary-time 
vector ( t , r) — > (— ir, r). Notice, however, that we are merely changing conventions, i.e. both 
the imaginary- and the real-time action contain the invariant contribution —p(f> + j • A. 
To keep the notation simple, we will not indicate the difference between imaginary-time 
and real-time vectors explicitly; the appropriate convention is determined by the context in 
which we are operating. 

This appealingly simple structure (7.44) is all we need to write down a general expression 
for the linear response kernel. Comparison with Section 7.2.1 shows that F = F' = T M , 
i.e. the generalized force, F', and the auxiliary “force” F used to generate the expectation 
value of j by differentiation are both given by A^. Equation (7.6) then tells us that 


K fJ , u (x,x l ) = Z 


S 2 


SA^(x) SA v (x r 


-Z[A] 


(7.45) 


which is one of the most important relations of microscopic response theory. Notice that, 
owing to the interchangeability of the two derivatives, K^ v (x,x') = K vtJb (x' , x). This sym- 
metry in turn implies that - an observation repeatedly made before - gauge invariance, 
K llv d v = 0, and particle number conservation, d fl K IJV = 0, represent different sides of the 
same coin. Equation (7.45) is a very general, but also very formal, result. To give it some 
meaning, it is instructive to evaluate the two derivatives for a few specific functionals Z[A], 
as described in the next section. 


Electromagnetic response of the microscopic theory 
As a particularly important example, let us consider the microscopic action of a Fermi or 
Bose system in the presence of an electromagnetic field. Ignoring the coupling of the field 
to the spin degrees of freedom, 

iSp, ip, A] = J dx Vv (dr +4+ - A) 2 - M + V 0 ) fd + 5 int $, ip], 

25 You may ask yourself why we use the relation (a) j ^ = 5 a ^ S c [A] instead of a direct representation of linear 
coupling (b) 5 C [-A] = f j^A^ + . . . The reason is that in some prominent cases - the microscopic theory of 
charged particles discussed in Section 7.4 being the most prominent example — the current [A] depends 

on A by itself. This means that A will enter the action non-linearly (viz. by a “diamagnetic term” ~ A 2 ) and 
this in turn implies that (a) ceases to be a faithful representation of the more general (b). However, our theory 
of linear response is based on derivative operations such as (b) anyway and the above complication does not 
pose a significant problem. 
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where Vo denotes a (field- independent) potential. With Ao = i<f >, Ai = (A ).$, and 

Jo =ip= 3? = (j)» = di (7.46) 


where f di g = fd^g — ( dif)g , we have j A = jJ-S'fA] and the notation j A hints at 
the important fact that the current itself depends on the vector potential. Application of 
Eq. (7.45) then gives 


-l 


K^fx^x') = Z 
= -Z 


SA^x) 5 A, 
* 8 


— -I [ D$,i 0 

Ax')\ a =oJ 

D(&t)j A (x)e~ s &*’ A } 


SMx')\ A =o 

= (~8(x - d% v d Ali ( x )j£(x) + j£{x)j A (x')) 


where the angular brackets denote functional averaging. Setting A = 0, one obtains 


K^(z,x ') = ~ a/JM 1 - ^o) + Q?,(x)W))- 


(7.47) 


The first term contributing to the response kernel is known as the diamagnetic term. 
Indeed, tracing back through the derivation, one discovers that this term originates from 
the diamagnetic contribution ~ A 2 to the Hamiltonian. Conversely, j p = j\ A _ 0 is called 
the paramagnetic contribution to the current operator. The functional expectation value 
~ (j p j p ) defines the paramagnetic term. 

Notwithstanding its unappealing structure, Eq. (7.47) represents a strong result which 
encompasses virtually all aspects of electromagnetic linear response. 26 For example, the 
diagonal vectorial components if) j, i = 1,2 , ,d, of the response tensor describe the lon- 
gitudinal conductivity of the system (see Section 7.4.1). In cases where a magnetic field 
is present, the off-diagonal components K^j measure the Hall conductivity. The tem- 
poral components ffoo describe the density response of the system, a feature that is of 
importance, for example, in the analysis of scattering data, etc. 


Electromagnetic response of e eetive theories 

In cases where we are operating on a level beyond the microscopic description, i.e. within 
the framework of an effective low-energy theory, the structure of the response tensor may 
differ from Eq. (7.47). To identify the “effective” response tensor, we need to carry out a 
canonical two-step programme: (i) identify the coupling of the electromagnetic potential to 
the relevant degrees of freedom (in cases where one has not kept track of the field dependence 
from the very beginning, it usually suffices to minimally couple the field, i.e. to introduce 
the components A ^ so as to make the action gauge invariant); (ii) perform the two- fold 
derivative Eq. (7.45). 


Except for spin-related phenomena. 
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We illustrate the procedure on a familiar example, the phase action of the BCS 

superconductor, 

S[0\ m l dx \v {d T ef + |^(V6>) 2 ] • (7.48) 

Under a gauge transformation induced by some function /(#), d^O — >• d^O + d^f, i.e. the 
gauge invariant extension of the action reads (cf. Eq. (6.34)) £[0, A] = J dx [ v(d T 0 — <j)) 2 + 
^(V0 — A) 2 ]. Differentiating this result with respect to the components A*, one obtains 

Kij(x,x') = \dij - — (di6(x)dj6(x')}] . (7.49) 

771 L 777 J 

EXERCISE Evaluate the correlator {diO djO) to show that the conductivity au ( q = 0, u) diverges 
in the limit u> — > 0. The divergence of the conductivity can be traced back to the fact that, 
in a superconductor, there is no cancellation between diamagnetic and paramagnetic responses. 
Compute the response of the system to a static magnetic field. Show that, in the London gauge 

• A = 0, current and vector potential are proportional to each other, j oc A. Recall (cf. 

Section 6.4) that this result, in combination with the Maxwell equation x H = 47rj, implies 
the Meissner e ect. 


7-4-1 Longitudinal conductivity of the disordered electron gas 

Equations (7.47) and (7.49) exemplify the structure of the response tensor after the master 
formula (7.45) has been applied. However, to obtain a concrete result for the conductivity, 
say, one must evaluate the two-((/3)) or four-((j j)) point correlation functions appearing in 
these expressions. Of course, the details of this last step of the program depend sensitively 
on the microscopic structure of the theory under consideration, i.e. no universally applicable 
computational recipe can be formulated. Notwithstanding these differences in details, a 
number of large-scale structural elements are recurrent in the mathematical analysis of 
the current-current correlation functions of microscopic response theory. To illustrate these 
common structures, let us consider an example that is of considerable importance in its own 
right, namely the longitudinal AC conductivity of the electron gas. 

The AC conductivity is defined through the relation 

j(w) = <7(w)E(w), (7.50) 

i.e. as the coefficient relating the current density to a homogeneous external field oscillating 
with frequency u>. In the absence of symmetry-breaking perturbations such as a strong 
magnetic field or system-intrinsic anisotropies, the current flow will be in line with the 
field gradient. This justifies the scalar, rather than a more complex tensorial ansatz for the 
conductivity in Eq. (7.50) (For a more complex situation, see Problem 7.6.4.) 

INFO Before entering the details of the linear response calculation, it is instructive to recapit- 
ulate how the conductivity can be obtained from common sense reasoning. To this end, let us 
look at the world from the persective of an individual conduction electron. In the presence of an 
electric field, the electron will be subject to two different forces: the force of the field, — eE, and 
a dissipative friction force — — r inhibiting its free acceleration (deviating from our convention, 
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e = 1, we temporarily reintroduce the electron charge). The damping rate is essentially set by 
the scattering rate off static impurities, viz. 1/r. To a first approximation, the dynamics of the 
electron is thus described by the equation of motion ror = — eE(t) — or, in Fourier represen- 

tation, -*mwv(w) = eE(cu) — — v(w). Solving for v one finds that the current density j = —new 
(n is the particle density) is given by j(w) = cr(a;)E(u;) with 


<r(w) = 


J 

— ioj 


(7.51) 


In the limit w t -1 , <t(oj) ~ i.e. the electron current is determined by the ballistic 

motion of the electrons in the electric force field. Conversely, in the DC limit u> — > 0, one obtains 
the Drude formula a = , i.e. the field drives a steady current density whose value is limited 

by the amount of momentum relaxation. 


Below we compute the conductivity under the simplifying assumption that Coulomb or 
other types of many-body interaction are negligible. 27 By contrast, the discussion of the 
Info block above shows that any meaningful analysis of the conductivity must take account 
of the presence of static disorder; without disorder, the field would make the electrons 
freely accelerate, i.e. there would be no such thing as a steady current flow in metals. 
Thus, what we should have in mind when we think about the conductivity is ( a)y , the 
conductivity averaged over all realizations of a microscopic disorder potential. 28 To obtain 
the conductivity, we will explore the current flow in response to a vector potential A, where 
d t A = — E generates the electric field. Anticipating that, after averaging over impurities, 
the system will be effectively translationally invariant, the analysis is most economically 
carried out in Fourier space, i.e. we compute the expectation value j(q — » 0, uj) in response 
to a potential A(q — > 0, u>), where the limit q — ► 0 indicates that we are interested in a 
spatially homogeneous external field. Fourier transforming Eq. (7.47) in the difference of 
coordinates, x — x' (cf. the remarks made in connection with Eq. (7.4)), one verifies that 
the appropriate response kernel is given by 

K (o) = . ( 7 - 52 ) 


where the vectorial index i is arbitrary and the double brackets indicate the two- fold average 
over the quantum thermal distribution and the disorder potential. Also notice that the 
connection u m A (q) = E(<jr) implies 


u(cj) = — lim ~^—K(q) 


(7.53) 


Nonetheless, the calculation will still be fairly technical. If you find the Drude conductivity of the electron 
gas too elementary an observable to justify these efforts, please keep in mind that our prime motivation is 
methodological: technical operations very similar to those detailed here appear in practically every quantum 
response analysis. 

A good question to ask is whether the conductivity of any particular realization of a disordered metal will differ 
from the averaged conductivity, i.e. whether the conductivity is a “self- averaging” quantity. This leads one to 
the interesting problem of conductance statistics, a topic beyond the scope of the current text. 
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To process this expression, we substitute the Fourier transform of Eq. (7.46) (exercise), 


P = E Vv.pVv.p, % q = E J^( 2p + QOiVv.pV’ 


into (7.52) and apply Wick’s theorem to compute the thermal expectation value. This leads 
to the expression 


K(q) = - 


L d 


E 

n, p,p' 


(Gn, 


, ( 2 P + q)i(ip' + q)i /r , 
P’ P/V+ (2mp {Gr 


+m,p+q,p'+q<- 


(7.54) 


where the factor of two accounts for the spin summation (for simplicity we assume that 
the Hamiltonian is spin- independent), G„ >P)P ' = (tVptVi.p') is the thermal electron Green 
function for a particular realization of the disordered background, and the remaining bracket 
stands for the disorder average. (Notice that, before disorder averaging, the system lacks 
translational invariance, i.e. the single-particle Green function is not diagonal in momentum 
space.) A diagrammatic representation of the paramagnetic response kernel in terms of 
Green functions is shown in the upper part of the figure below, where the wavy lines denote 
the current operator and the dashed lines symbolically represent the scattering off the static 
impurity potential (cf. our discussion of the impurity scattering and Green functions in 
Section 6.5.) 

To make further progress with this expression, we shall adopt an approximation that 
critical readers may find questionable: we will replace the average of the two-Green-function 
correlator by an impurity average of the individual Green functions, (GG) — > (G){G). 29 In 
Section 6.5 it has been shown that these averaged Green functions are given by 


(G n ,p, P <) 


— Sp,p'G p 


ico n 


w 

£ P + s § n ’ 


where r defines the mean impurity scattering time. (Recall what has been said above about 
the meaning of the imaginary part of the Green function denominator as an effective inverse 
lifetime. In the present context, r measures the time after which a particle with initial 
momentum p gets scattered into states of different momentum.) Substituting this result 
into Eq. (7.54), we obtain 


= £(>>•+ 


(2 m)' 


;(2p + q) 2 G p+q Gj 


(7.55) 


Readers who have navigated through the disorder section in Chapter 6 are invited to critically assess the validity 
of this approximation. The result of this analysis is that, for hard impurity scattering potentials, i.e. for impurity 
potentials (p|V|p ) = const, that scatter isotropically over the entire momentum shell, the approximation 
above becomes justifiable in the limit q — »■ 0. To understand why, construct any diagram involving impurity 
lines connecting the two Green functions. You will notice that, in this case, the two fast momenta carried 
by the current operators are no longer locked to each other. The subsequent angular integration over these 
momentum vectors then leads to a vanishing of the diagram. The physical mechanism behind this vanishing 
is that, in a disordered medium, the velocity vector of an electron changes rapidly, i.e. a velocity- velocity (or 
current— current) correlation function is highly susceptible to large-angle impurity scattering. 
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Looking at the structure of Eq. (7.54), one might 
be tempted to perform the frequency summation by 
one of the standard formulae. However, a moment’s 
thought shows that this strategy will not work. The 
reason is that, in the present case, the extension 
of the Green function to the complex plane, G p = 
G p (z) = (z — £ p + £ sgn Im z) , has a cut along 
the real axis; this, in turn, means that the product 
G p -|-q(z + iuj m )G p (z) has cuts at lm(z) = 0 and Im(z + iu m ) = 0 (see the figure). To 
circumvent this difficulty, we need to integrate over a contour that (a) encompasses all 
Matsubara frequencies and (b) avoids the cut lines. While there is no simple contour that 
does the job, a joint integration over the three contours 71 , 2,3 shown in the figure faithfully 
represents the Matsubara sum: 





dz riY{z)Gp{z)Gp +tl {z + i 


The three integrals contributing to the sum lead to very different results: as shown in the 
exercise below, the two integrals over the contours 71 and 73 cancel against the diamagnetic 
contribution to the response tensor, i.e. the first term in Eq. (7.55). 


INFO To appreciate why the contribution from the contours 71 and 73 is unphysical, notice 
that, e.g. 


) dz nF(z)Gp(z)(jrp-|-q(;2 + ioj-m) — 


2 ^ 


denF{e)Gp(e + i0)(2p+q(e H- iuJm ) 


m— (d 
2-7TZ 


/: 


de nv(e)Gp{e + iO)G p +q(e + w + iO) , 


i.e. the integral extends over a product of two retarded single-particle Green functions. Now, 
remember that at the end of the day we want to compute the conductance, i.e. a quantity that 
resembles a transition probability (namely the probability for electrons propagating under the 
influence of an applied electric field). In quantum mechanics, probabilities <7 ~ GG* appear as 
absolute squares of amplitudes, G. Indeed, we have seen before that the retarded Green function 
can be interpreted as a transition amplitude of quantum mechanical particles. We also saw that 
its complex conjugate, G * , is an advanced Green function. Put differently, our admittedly very 
hand-waving argument indicates that the quantum dynamics of conduction should be described 
in terms of products of advanced and retarded Green functions. However, a product of two 
retarded (or two advanced) Green functions lacks an obvious physical interpretation. Indeed, 
these contributions cancel against the diamagnetic term. 
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We thus focus on the integral over 72 : 

dz n F (z)G p (z)G p+<i (z + iu m ) 

72 

= J de n F (e) [— G p (e — zO)G p + q (e + iu > m ) + G p {e — iu m )G p + q (e + *0)] 
m — >+ ^ri J de n F (e) [—G p (e )G p+q (e + + u) + G p (e — w)G p+q (e + )] 

- - ,£_i J de [n F (e) — n F (e + cu)]G p (e _ )G p+q (e + + u), 


where, in the first equality, we have used the symmetry n F (e + ico m ) = n F (e) of the dis- 
tribution function under translation by bosonic frequencies. Substituting this result into 
Eq. (7.55) and (7.53), we arrive at the intermediate result 


a(u) 



n F (e) — n F (e + co) 2 
u> L d m 2 


^ Z P i G P ( e ) G p( e + W )> 


where G±(e) = (e - £ P ± 


(7.56) 


EXERCISE To complete our analysis, we need to understand better the cancellation of the 
diamagnetic term and, relatedly, the role played by the integration contours 71,3. We first 
write K(q) = K(0) + [K(q) — -ff(O)] and observe that (i) the contribution of 72 discussed above 
sits solely in the finite q term, K(q) — K( 0) (the contour 72 collapses to zero at q = 0), while (ii) 
the diamagnetic term and the contribution of the curves 71,3 are in K( 0). (Finite 9-corrections to 
the contour integrals along 71,3 vanish due to the strong convergence of the integrands and the 
fact that all singularities lie on one side of the real axis: think about this point.) The cancellation 
of the diamagnetic term against the contribution of 71,3 thus amounts to showing K( 0) = 0. 

Introducing the notation G n , = G p \ p= , show that 

*<o>— f £/<Mo(i + 


where z/(e) = ^ S(e — e p ), and we used ^ F{t p )p 1 = E(e p )e p by the isotropy of 

space. Integrating by parts (why do the contributions of the integration boundaries vanish?), 
and using the fact that o(e) = const, x £ ( d-2 )/ 2 5 show that -ff(O) = 0. 


The analytical structure of this result actually reveals a number of important elements of 
the electron conduction process: an AC field of frequency oj creates electron-hole pairs of 
excitation energy u. The phase volume accessible to these processes is measured by the 
e-integral, weighted by the difference of two Fermi functions, i.e. Fermi statistics demands 
that the energy of the electron to be excited lies within a shell [p—uj,p\ at the Fermi surface 
(see the figure below). The dynamics of the excited electron is described by a retarded Green 
function of energy e + u. 
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Conversely, the dynamics of the hole is described by the advanced 
Green function. Heuristically, this can be understood by noting 
that the advanced Green function describes the (fictitious) prop- 
agation of an electron backwards in time. However, an electron 
propagating in a chronologically reversed direction can be inter- 
preted as a hole moving forward in time. Thus, the advanced electronic Green function 
effectively represents a descriptor of hole dynamics. That the product of two Green func- 
tions is weighted by two operators p i/m reflects the fact that current transport essentially 
depends on the velocity of the electrons. 

To perform the sum over p we note that, for a radially symmetric function _F(£ p ), 



J d d p p?F(£ p ) = - d j d d p p 2 F(£ p ) = (t + m&FiO- 

Identifying the function F with the product of our two Green functions, we obtain 


)^p( e+ + a; ) — 

P 


4 m 

(2n ) d d 

4 mpu 
(2t x) d d 


s: 

/: 


d£ (£ + p)v{C)G^ (e)Gf (e 4- w) 
d£G ? -(e)G+( e + u;)=gi gr-U^ 


where, in the third equality, we have used the fact that the Green functions are strongly 
peaked on scales r _1 <C p around £ = 0 implying that, to a very good approximation, the 
^-integration can be extended to the entire real axis and the energy variation of the density 
of states v(£) on the effective interval of integration is negligible. The last integral can be 
performed by elementary means or, more elegantly, by closing the ^-integration contour 
in the upper or lower half plane and using the analytic structure of the Green functions. 
Substituting the result into our formula for <r(w), and using the fact that / de (np(e) — 
riF(e + w))u; -1 ~ 1, we arrive at the final result Eq. (7.51). 

At this point it is rewarding to pause for a moment to look back at a number of large-scale 
structures of the analysis: 


t> On the microscopic level, the two-point function (jj) of current operators is represented 
in terms of a four-point function ~ (t/^t/^) of field operators. A moment’s thought shows 
that four-point correlation functions generically appear as microscopic descriptors 
of the response functions of quantum single-particle operators (simply because, in sec- 
ond quantization, a single-particle operator maps onto a bilinear of field operators) and, 
therefore, of the vast majority of response functions. 
t> This observation implies a hierarchy of correlation functions. For example, in our 
non-interacting example, the retarded response correlation function was described in 
terms of a product of an advanced and a retarded microscopic single-particle correlation 
function. 30 This almost trivial observation is of some importance because it entails 


For interacting problems, the response functions no longer neatly split into a product of two single-particle 
Green functions. Instead, the four field operators describing the response functions can be connected by an 
arbitrarily complicated network of interaction vertices. However, ultimately, the propagators connecting this 
network are again the single-particle Green functions of the problem. 
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t> a separation of energy scales. The argument (u), q) of the response function is a 
measure of the frequency/momentum scale of the external perturbation. These parame- 
ters are to be distinguished from the argument (e, p) of the single-particle propagators 
microscopically describing the response process. In many cases, the two sets of scales are 
parametrically different. For example, in the problem discussed above, (tv, q) is set by 
the resolution of an external electronic apparatus and, therefore, much slower than the 
frequency/momentum scale (e, p) of electronic charge carriers at the Fermi surface. In 
these circumstances, the ratios tv/e and |q|/|p| define small expansion parameters which 
can be used in the approximate evaluation of the response kernel. (In other cases, e.g. 
spectroscopic analysis of microscopic single-particle excitations, it is essential that (tv, q) 
and (e, p) be of the same order.) 


7.5 Summary and outlook 

This completes our preliminary survey of theory and application of correlation functions 
in many-body physics. We have seen that correlation functions, notably real-time retarded 
response functions, represent a principal interface between experiment and theory. The 
connection between these objects and experimental data - the latter represented through 
expectation values of certain operators - was established by a formalism known as linear 
response: assuming an experimentally imposed perturbation of a many-body system to be 
weak, its response can be analyzed within a controlled expansion scheme whose leading- 
order term is identified as a retarded correlation function. 

A number of general properties of the response function - e.g. causality - could be identi- 
fied from common sense reasoning. However, to develop a full understanding of this object, 
notably the connection between the imaginary time response function (the quantity pro- 
duced by evaluation of the field integral) and its retarded real time counterpart (the quantity 
we are interested in), we had to go beyond that level. Indeed, we saw that the relation 
between various types of correlation function could be revealed by means of a powerful spec- 
tral representation that was simple, exact, and not dependent on any particular physical 
context (while not being particularly useful for practical computational purposes). 

The structural elements developed early in the chapter were finally bundled into a power- 
ful theory of electromagnetic linear response. We illustrated the application of this machin- 
ery on the example of the longitudinal conductivity of the electron gas and pointed out a 
number of computational elements common to most concrete microscopic linear response 
calculations. 

In the next chapter we return to the intrinsic development of the theory. We become 
acquainted with the renormalization group, a versatile analytical tool designed to analyze 
correlation functions (and other objects of physical interest) within the vast space left 
open between straightforward perturbation theory at one end and the few available exact 
evaluation schemes at the other. 
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7.6 Problems 

7. 6. 1 Orthogonality catastrophe 

This problem describes a scenario realized frequently in condensed matter in which a spatially localized 
quasi-particle is immersed in a large host system (the latter described by some iV-body ground state 
wavefunction). The resulting wavefunction, “single-particle state TV-particle ground state,” turns out 
to be largely orthogonal to the true ground state of the now ( N + l)-particle system. In practice, this 
means that it takes a very long time for the intruding particle to become accommodated to its new 
environment, a retardation mechanism known as the orthogonality catastrophe. Following Anderson's 
original line of argument we here explore the basic physical mechanism underlying the phenomenon. 

In the thermodynamic limit, Anderson 31 has shown that the ground state | T') of a system 
in the presence of a local perturbation F(r) is orthogonal to the ground state |\I/} of the 
unperturbed system: 

|(' 5|^ , )| 2 

(Here, the local perturbation serves as a caricature of the interaction potential created 
by a new particle in the system.) This mechanism implies, for example, that transport 
coefficients measured in setups where a sample is connected to external leads at local con- 
tacts often determine a “contact resistance” (determined by the resistance of the sample to 
accommodate external charge carriers) rather than the bulk conduction properties of the 
system. 


(a) Consider a system of non-interacting fermions. Show that the overlap between the 
many-particle ground states of the unperturbed system and of the perturbed system is 
bounded from above: 


X= I <*!*'} | 2 <X = exp 



K n l TO ')l 2 

EF,e m / >Ep 


= exp[— 2], 


where | n) and |rn') are single-particle states of the unperturbed and perturbed system, 
respectively, and Ep is the Fermi energy. To obtain this inequality, recall that the non- 
interacting many-particle states are Slater determinants. What is the overlap between 
ground states? To simplify the resulting expression, use the fact that det A < 1 if A has 
normalized, but non-orthogonal rows. Here one may assume that, if the perturbation is 
small, i >Ep l( n l TO, )| 2 ^ 1 f° r e n < F"f • 

(b) We next compute the exponent I for the particularly simple case of a spherically sym- 
metric system of radius R. Focusing on the sector of lowest angular momentum l = 0, 
the unperturbed states are given by <f> n (r) = N n sin(k n r)/k n r, where k n = itn/R and 
the normalization N n = k n j \/2t:R. The asymptotic profile of the perturbed wavefunc- 
tions can be approximated as <t>' n (r) = N n sm{k n r + S m ( 1 — r/R))/k n r, where S is the 


P. W. Anderson, Infrared catastrophe in Fermi gases with local scattering potentials, Phys. Rev. Lett. 18 
(1967), 1049-51. 
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s-wave scattering phase shift. Use this ansatz to compute first the overlap matrix A and 
then the exponent X. 

Answer: 


(a) In the ground state, all single-particle levels with energies below the Fermi energy Ep 
are filled while all levels with higher energies are empty. Thus, the overlap of ground 
states is given by x = detA, where A nm = (n\m') and e n ,e m > < Ep. Defining a row- 
normalized matrix A by A nm = A nm /N n with U n = l<«K>r ', we conclude that 

X = det A\\ n M n < n„ .A/"„. We next take the logarithm of this relation to obtain 


lnx< ln (- A/ ’«) = ^ ln H • 

e n <i?F e n <£ ? F 

Using the closure relation 'Em' = 1 an d expanding the logarithm in x = 

E e i>Ef |(n|m')| 2 ^lwe arrive at the approximation 


ln K n l TO ')l 2 

e n <£/F e m /<22 f 


= ln K n i m ')i 2 

e n <Ep € m />i?F 

- - \{n\m')\ 2 = -21. 

€ n <^Fj€ m / >Ep 


(b) The overlap matrix A is given by 


A n m = (n\m!) = 


4t xN n N m 

2nN n N n 

k n k m 


dr r‘ 
sin 5, 


2 sin (k n r) sin ( k m r + 6 m ( 1 - ^)) 


k n r 

1 


k m r 

skwL, 


k n — k m + - 


7r [n — m) + S m 


Performing the summation over wavenumbers n larger or smaller than the Fermi wave 


number n-p = kpR/n we obtain I = + 2^2 E n 


■> sin 2 S ln M, where 


Em = {nM/R) 2 /(2m) is a cut-off energy at which the phase shift 5 = Sm has become 
negligibly small. Since Em is some fixed (independent of the system size) energy scale, 
M ~ EmR 2 ~ N grows with the system size. Thus I ~ + ln TV, which in turn implies 
that the overlap |(\E , |\E ,, }| 2 vanishes as some negative power of N. 


7.6.2 RPA dielectric function 

Much of the response of a system of charged fermions to an external electromagnetic perturbation is 
encoded in the dielectric function e q . While the dielectric function cannot be computed rigorously, the 
most common approximation scheme - asymptotically exact in the limit of the infinitely dense gas - 
is the RPA. It is the purpose of this problem to elaborate upon the various sporadic encounters with 
the RPA dielectric function we had in the text. Specifically, we will derive Eq. (7.35) and discuss the 
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Figure 7.6 Representation of the dielectric function in terms of the full density-density correla- 
tion function (the dark-shaded bubble) and the interaction-irreducible density-density correlation 
function (light-shaded bubble). 


generalization to a disordered electron gas. The problem is physically instructive and a good exercise in 
diagrammatic calculus. 

(a) Derive Eq. (7.35). To this end, consider two infinitesimally weak (yet generally time- 
dependent) test charges immersed in a system of charged fermions. Omitting the self- 
interaction of the particles, determine the interaction correction to the free energy and 
use your result to compute the ratio between the vacuum and the actual interaction 
potential. Show that 

0 - T /y! q) ^» -) C ) ’ (™7) 

where the subscript “c” indicates that only connected diagrams contribute. Analytic 
continuation to real frequencies obtains Eq. (7.35). 

(b) Equation (7.57) represents a rigorous yet quite formal result for the dielectric function; 
it involves the exact density-density correlation function which is generally unknown. 
As a first step towards a more manageable expression, show that Eq. (7.35) is equivalent 
to 

e q = l+ TV ^ (n q n- q )i TI , (7.58) 

where (hh) i rr is the interaction-irreducible density-density response function, i.e. the 
sum of all diagrams that cannot be cut into two just by cutting one interaction line 
(i.e. (nn)irr is to the effective interaction what the self-energy is to the Green function). 
In order to show that Eq. (7.57) and (7.58) are equivalent, find a series expansion of 
C(q, iu) m ) in terms of Vo and II(q, iu) m ). The series is very similar to the RPA series. 
Note, however, that, in contrast to RPA, the bubble (hh) i rr still contains interaction 
lines. A graphical representation might be helpful. 


Answer: 
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(a) If we represent the two test charges with two charge distributions pip.{q), the Coulomb 
interaction becomes <Si nt = ^ X^[^g+Pi,g + / c) 2,g]Vb(q)[fi-q+pi,-g + P2,-g]- An expan- 
sion of the field integral (or, equivalently, the partition function) to lowest order in p\g 
then produces the interaction contribution to the free energy, 

X*2 I" TV 2 (a) I 

^int[pi,P2] = h'o(q) jj^-{h-qh q ) c p2,-q. 

g L -I 

The term in rectangular brackets determines the effective interaction potential V e s(q). 
Remembering that e q = Vo(q)/Ieff (q), we obtain Eq. (7.57). 

(b) For a diagrammatic representation of the solution, see Fig. 7.6. 


7.6.3 Electromagnetic response of a quantum dot 

In Problem 6.7 we considered a model of quantum dots coupled by tunnel contacts to external leads. 
There, we derived an effective action for this system but did not say much about its actual physical 
behavior. As a first step towards a better understanding of the electromagnetic properties of small 
quantum dots, we here derive a formula elucidating the classical response of the system. In Prob- 
lem 8.8.2, we then apply renormalization group methods to study how quantum fluctuations affect the 
low-temperature physics of the system. (Before addressing this problem, refamiliarize yourself with the 
derivation of the dissipative tunneling action in Problem 6.7.) 

(a) Consider Eq. (6.67) for a quantum dot coupled to a lead. Assuming that a bias voltage iU (r) 
between lead and dot has been applied, show that the argument of the dissipative action 
generalizes to (j>(r) — t (j>(r) + fj dr' U(t'). Focusing on the regime of high temperatures, 
expand the action to second order in <j> (argue why, for high temperatures, anharmonic 
fluctuations of <f> are small) and compute the 00-component of the linear response kernel, 
Koo . Analytically continue back to real frequencies and show that the current flowing across 
the tunnel barrier in response to a voltage U (w) is given by 

m = (7-59) 

ST iu 

(b) To understand the meaning of this result, notice that, classically, the dot is but a capacitor 
connected via a classical resistor to a voltage source. Apply Kirchhoff’s laws to obtain the 
current-voltage characteristics of the classical system and compare with the result above. 
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Answer: 


(a) Consider the action of the system at an intermediate stage, immediately after the 
fermions have been integrated out (cf. Eq. (6.68)). In the presence of a bias voltage, the 
“tr In” contains a term i(V + U), where V is the Hubbard-Stratonovich field decoupling 
the interaction. Removing the sum of these two fields by the same gauge transformation 
as used in the derivation of the dissipative action, the tunneling matrix elements acquire 
the phase factor specified above. 

At second order in the expansion in </>, the action assumes the form 

s< %. VI = iWW + E 


where the Fourier identity \ui m \ —nT sin has been used. Differentiating twice 

with respect to U, we obtain 


(A'oo); 


z -i T 


su m su 


-Z[U\ = - 


2-k\ul 


'> m) 


27t|o. 


9t \Um 
2 ttT 


1 


1 +2E C 


The 00-element of the linear response tensor describes changes in the particle number 32 
in response to an applied potential, 5N m = ( Koo) m U m ■ Noting that id T SN = I is 
the current through the barrier, and substituting our result for Kqq . we obtain I m = 
U m ■ Analytical continuation from positive imaginary frequencies \uJ m \ — » 

— iu + 0 to real retarded frequencies gives Eq. (7.59). 

(b) According to Kirchhoff’s laws, the sum of all voltage drops in the system must equal 
zero, 0 = U(u) + C^ 1 6N(uj) — RI(ui), where C and R denote the capacitance and 
resistivity, respectively. Using the fact that 5N = I/ico, and solving for I, we readily 
obtain I(ui) = , i.e. the current voltage characteristics of a classical RC-circuit. 

Identifications R = 2h/Gt and C = 1/2 Ec bring us back to the formula derived 
microscopically above. 


7.6.4 Hall conductivity 

In this problem we derive microscopic expressions for the current response of a (generally disordered) 
electron gas in the presence of a perpendicular magnetic field. Our results will be the starting point for 
the analysis of the quantum Hall effect in the next chapter. 

Consider a two-dimensional electron gas in the presence of a perpendicular magnetic field. 
The system is described by the Hamiltonian H = ^(p — A) 2 + U, where V is a one-particle 


Here, we notice that the factor of “volume” discriminating between particle number and density is contained 
in our definition of the source variables. 
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potential (disorder and/or boundary potential) and A = |r x e 3 is the vector potential 
of the external field. To compute the conductivity tensor, we start from the real-space 33 
expression of (cf. Eq. (7.47) and (7.53)), 

<?ij = J d 2 x d 2 x' x, )(Po(x)) + (j m ,i(x)j„ mj (xO)^ | 

where ji is defined in Eq. (7. 46). 34 Using Wick’s theorem to compute the expectation value 
of the current and density operators, we obtain 

c Tij = J d 2 x d 2 x' x')G n (x, x) 

+(u i G„ +m )(x,x , )(u i G„)(x , ,x) j 

where v = (— iV — A) is the velocity operator and we have set the electron charge to 

unity. Anticipating the presence of cuts on the real energy axis, the frequency summation 
is split into three parts, a = a + cr 4 + cr 4 " 1 ", where cr ±4= denotes the contribution for 
which the frequency indices n + m and n are positive or negative, respectively. 

(a) Beginning with the diagonal contribution to the conductivity tensor, a xx , show that 
the paramagnetic contributions to a~~ + cancel against the diamagnetic term. 
(Hint: Use the fact that for n ■ (n + m ) > 0 the product of two Green functions is short- 
range: (G n+m G n )v = {G n+ m)v{G n )v, where, as usual, (G)v decays on the scale of 
the mean free path (i.e. unlike the combination G n< oG n + m >o> n o long-range quantum 
interference processes contribute to the product of two Green functions of identical 
signature). Due to this short-rangeness, we may approximate the position operator 
entering the definition of the current operator by a c-number valued vector whose value is 
set by spatial arguments x ~ x' of G(x, x r ). Consequently, both G and j are diagonalized 
by the basis of momentum states.) 

(b) Owing to the vanishing of cr 4 " 4 " and a ~~ , the longitudinal contribution to the conduc- 
tance tensor cr xx = a+~ is obtained by summation over the sector of mixed frequency 
signature. Show that 

a xx £; J d 2 x d 2 x' (u x G + )(x,x , )(u x G _ )(x , ,x), (7.60) 

where G ± = (p± i/^T — H) -1 . 

(c) Turning to the Hall coefficients, we first note that (think why) a xy = —a yx or a xy = 

Using the simplifications listed in (a) and the identity v = ^[H, x], show that 
<7++ + Gy- ~ j^8b{N), where N is the particle number operator. Convince yourself 


In view of the coordinate dependence of the vector potential, it is preferable to work in a real-space basis. 

Do not confuse the vector potential of the external magnetic field with the weak vector potential employed 
to probe the electromagnetic response of the system; the current operators in the conductivity formula above 
contain the former but not the latter. 
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that the region of mixed frequency signature contributes as in the longitudinal case, i.e. 
that the total linear response Hall conductivity is given by 

(Txy - J ^ x d * x ' i G + ) (x, x )(?).,■ ) (x , x) + -^d B {N). (7.61) 

(d) In Chapter 9, we apply the linear response formulae derived above to a field-theoretical 
analysis of the quantum Hall effect, the physics of a two-dimensional electron gas subject 
to a strong perpendicular field. Our starting point will be the replicated partition sum 
Z = f D(ip,ip) exp(— S[ip,ip]), where 

S[ip,^\ = J d 2 x Vi(x) jhfo -3 - fi+ ^(-*V ~ A ) 2 + ^( x )j V>( x )> 


and ip = {> or } is a 2/?,-coniponent field carrying a replica index r = 1 , ,R and an 
index a = +, — distinguishing between “retarded” and “advanced” field components. 
The Pauli matrix 03 acts in the space of a-indices, thus determining the imaginary 
offset of the inverse propagator in the action. To generate the linear response formulae 
derived above, we couple the action to a source: A — > A + U~ 1 'VU, where the matrix 
U = exp(i(xK x ai + yn v <j 2 ) 0 V and V ^ = {£ r iA-'i } projects onto the first replica 
channel. Show that 


. = lim dl 
R-t 0 47rT 2 x x 


i = lim 

R-^o AmL-* ‘ 


(Hint: You will encounter terms ~ (V’V’) ~ R- e G + , which may be neglected.) 


Answer: 


(a) As with the previous analysis of the non-magnetic case, 
1 T 

y , j d~x d “ 1 

n> 0 


rr++ — . 


' uj m L 2 (2tt ) 2 


[ <Px d 2 p ^2 (-G+ (p, x ) + u^(p,x)(G+(p,x)) ; 
J n~>0 


where we made use of the simplifying approximations above, v(p,x) = (p — A (x))/rn 
(both p and x are vectors of c-numbers) and G+ = ( iui n — mv 2 /2 + fi + i/2r)~ 1 . We 
may now shift p — > p + A to remove the spatial dependence of the integrand, use the 
identity G 2 = +mp~ 1 d Px G, and integrate by parts to conclude that < 7 ++ = a~~ = 0. 
(b) It is an instructive exercise to obtain the result by contour integration. Introduce a 
contour that, avoiding the cuts on the real axis, runs around the frequency interval 
n(n + m) < 0 and expand the product of Green functions to leading order in iu> m . Then 
take the DC limit. A simplified, yet less rigorous, derivation goes as follows. For low 
frequencies, u> m < 1/r, G„+ m >o — G + and G n< 0 — G~ are approximately independent 
of the frequency index. Noting that ui m /{2i tT) terms contribute to the frequency sum, 
we arrive at Eq. (7.60). 
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(c) Representing the frequency sums by a contour integral, 


- a., x = — 


d 2 x d 2 x' 


4 

x ((t>iG)(e + iu„ „x, x , )(DjG ! )(e + iO,x', x) 
— (DjG) x(e — *0,x, x.')(vjG)(e — iuj m ,x',x) 


We next (i) use the identity v = — *[G,x], (ii) expand the Green functions to leading 
order in iu) m , and notice that the zeroth-order term in the expansion vanishes. This 
leads to the expansion 

1 


’ + cr 7l . 


An tL 2 


de n F (e)eij tr [x iG(e + iO)i)jG(e + *0) — c.c.J 
^2 J de n F (e)d e tr (d(e + iO)d B H - c.c.) 


2ttL 2 

— jji I de n F (e)d e tr [S(e - H)d B H) 


= J2 9b J de M e ) tr “ H)) = tf d B{N), 

where in the second line we have used the approximate commutativity of the operators 
under the trace and the identity 8 B H = | t^ViXj . 

(d) To simplify the notation, we will discuss the longitudinal and Hall conductances sep- 
arately. To generate the longitudinal conductance, we need to expand the generating 
functional to second order in k x . Setting n y = 0, we obtain U~ 1 'VU = iP^ 0 e x k x and 


S[ip,ip] = S[ij),ip ]\ K = o - iKx / d 2 xv l a i v x ty 


J d^x^aiiixip 1 — k 2 J 


d^V’V 1 - 


Twofold differentiation with respect to k x then generates the expectation value 


lim —8 2 K 
R->0 L 2 - 


Z = lim — 
„ R-s-0 L 2 


d^x^aiVxip 1 ) +2 / d^xif) 1 ^ 


J • 


J d 2 x d 2 x' (u x G + )(x, x.')(v x G )(x', x) = 47 tct xx , 


where in the second line we have noted that {-i/jP^hp) ~ G + + G~ generates the real 
part of the Green function which we decided to omit. Turning to the Hall conductivity, 
an expansion up to order n x n y gives 

U _1 Vf7 = ipW (/t x crie x + K y a 2 e y + K x K y a 3 (xe y - ye x )) . 

Substituting this expression into the action, we obtain 

S[lp, V>] = S[lp, V’]| K =0 — i J d 2 X$ 1 ((TiK x V x + CT2K y V y )'*p 1 


,K X K. 

% ~2 


d 2 xxp 1 az({v v ,x} - {vx,*)})^ 1 . 
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Then applying the derivative, we obtain 

lim d: 2 Z = lim / d 2 x ^criVxip 1 f d 2 x' ^a^Vyip 1 

R ^o L 2 KxKv \ K=0 mL 2 J J 

I"* f d 2 x d 2 x' eij(viG + )(x,x')(vjG~)(x' ,x) 


1 

L2 


—4 J d 2 x CijViXj Im G + (x, x) = 47ricr a; j / . 
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The method of the renormalization group (RG) provides theorists with powerful and e cient tools to 
explore interacting theories, often in regimes where perturbation theory fails. Motivating our discus- 
sion with two introductory examples drawn from a classical and a quantum theory, we first become 
acquainted with the RG as a concept whereby nonlinear theories can be analyzed beyond the level of 
plain perturbation theory. With this background, we then proceed to discuss the idea and practice of 
RG methods in more rigorous and general terms, introducing the notion of scaling, dimensional anal- 
ysis, and the connection to the general theory of phase transitions and critical phenomena. Finally, to 
conclude this chapter, we visit a number of concrete implementations of the RG program introduced 
and exemplified on a number of canonical applications. 


In Chapter 5, </> 4 - theory was introduced as an archetypal model of interacting continuum the- 
ories. Motivated by the existence of nonlinearities inherent in the model, a full perturbative 
scheme was developed, namely Wick contractions and their diagrammatic implementation. 
However, from a critical perspective, one may say that such perturbative approaches present 
only a limited understanding. Firstly, the validity of the ^> 4 -action as a useful model theory 
was left unjustified, i.e. the </> 4 -continuum description was obtained as a gradient expan- 
sion of, in that case, a d-dimensional Ising model. But what controls the validity of the 
low-order expansion? Indeed, the same question could be applied to any one of the many 
continuum approximations we have performed throughout the first chapters of the text. 
Secondly, having identified a number of terms contributing to the perturbative expansion of 
the model (i.e. represented them in terms of momentum integrals over the non-interacting 
Green function), it rapidly became apparent that the expansion was seemingly uncontrolled: 
Those terms that were constructed explicitly (cf., e.g., Eq. (5.17) and the equation before) 
contained divergences at large momenta. We also saw that, in the transition region between 
the ferro- and the paramagnetic phase of the </> 4 -theory, these integrals were prone to the 
build-up of infrared divergences. Worse still, we had to concede that we had no clue as 
to how to overcome these problems! Needless to say, such difficulties exemplified by the 
d> 4 -model are endemic in field theory. 

To better understand the origin and remedy of the problems identified above, we must 
develop some new ideas. To this end, it is helpful to remember that the central driving force 
of all of our efforts is the development of a better understanding of long-range characteristics. 
On the other hand, models such as </> 4 exhibit fluctuations on all length-scales, while it 
was the short-scale fluctuations that were responsible for the majority of difficulties. In 
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principle, we already know how to deal with situations of this kind: our aim should be 
an effective theory of long-range fluctuations realized by integrating over all short-range 
fluctuations. Indeed, this program has been exemplified already on numerous examples 
where some slow field was coupled to rapidly fluctuating microscopic fields, which were 
subsequently integrated out. 

The problem with our current application is that the </> 4 - theory is unaware of 
any clear-cut separation into “fast” and “slow” degrees of freedom. Rather, all 
fluctuations, ranging from the shortest scales (of the order of some microscopic cut- 
off, a, limiting the applicability of the theory) to the longest scales (of the order of, 
say, the system size, L) are treated on the same footing. However, to nonetheless 
implement the scheme of “integrating over fast modes to generate an effective 
action of the slow degrees of freedom,” one must declare artificially a certain length 
scale a 1 " 1 - 1 = ba > a, as the scale separating “short-wavelength fluctuations” on 
scales [a, a^] from “long-wavelength fluctuations” on scales [a^^L]. Having done 
so, one may proceed to integrate out the short-range fluctuations, thereby changing 
the action of the long-range degrees of freedom (see figure). Since the short-range 
action is by no means simpler than the action of the long-range degrees of freedom, 
this step will likely involve some approximation. Indeed, the integration procedure 
may lead to a number of conceivable scenarios. For example, it may corrupt the algebraic 
structure of the long-wavelength action, leaving us with a theory fundamentally different 
from the one with which we began. Alternatively, it may turn out that the effective action 
of the slow degrees of freedom is structurally similar to the original, in which case the 
entire effect of integrating over the fast fluctuations amounts to a changed set of coupling 
constants. 

If the latter, the procedure is well 
motivated: we have arrived at a the- 
ory identical to the original but for 
(a) a different, or renormalized, 
set of coupling constants, and (b) an 
increased short-distance cut-off a — » 
a W = ba. Evidently, one may then 
iterate this procedure; that is declare 
a new cut-off a ^ = ba^ = b 2 a, 
integrate out fluctuations on length 
scales [«0, r/ 2 '], etc. Along with the 
recursive integration of more “layers 
of fluctuations,” the coupling constants of the theory change, or “flow,” until the cut-off 
a (n> ~ L has become comparable to the length scales in which we are interested. Remark- 
ably, we will see that the renormalized set of coupling constants in fact encodes much of 
the important information about the long-range behavior of the theory. 

The general line of reasoning above summarizes much of the thinking behind the renor- 
malization group. Of course, the approach would be quite useless had we to perform each 


Kenneth G. Wilson, 1936- 

Recipient of the 1982 Nobel Prize 
in Physics, awarded for “discov- 
eries he made in understanding 
how bulk matter undergoes phase 
transitions, i.e., sudden and pro- 
found structural changes result- 
ing from variations in environmen- 
tal conditions." Wilson’s background ranges from 
elementary particle theory and condensed matter 
physics (critical phenomena and the Kondo problem) 
to quantum chemistry and computer science. (Image 
© The Nobel Foundation.) 
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and every recursion step a ^ explicitly. However, the utility of the program relies 

on the recursive reproduction of the model at each step: a single step already encodes all 
the information about the renormalization properties of the model. Cast in abstract terms, 
the conceptual idea may sound obscure. Therefore, before developing a general and formal 
framework for the renormalization group scheme, we shall acquire some intuition for the 
approach by focusing on two specific applications which encapsulate the basic concepts. 

INFO The formulation of renormalization group ideas has a long and varied history, 
reflecting, in part, the versatility and generality of the method. 1 Indeed, the advent of these 
ideas in the late 1960s and early 1970s marked the transition between two different “epochs.” 
While hitherto researchers had focused primarily on the development and application of ever 
more-sophisticated perturbative techniques, the seventies remained under the spell of the renor- 
malization program. 

In the second half of the 1960s, ideas to recursively generate flows of coupling constants arose - 
apparently in independent developments - both in condensed matter and in particle physics. 
However, it took the insight of Kenneth Wilson to realize the full potential of the approach 
and to develop it into a widely applicable tool. 2 It is probably fair to say that Wilson’s original 
formulation of the approach, and its later extension by others, led to revolutionary progress in 
condensed matter physics, particle physics, and general statistical mechanics. Perhaps no less 
important, the RG concept turned out to be one of the major driving forces behind the partial 
unification of these fields. 

In fact, the (highly unfortunate) terminology of “renormalization group” discloses much of 
the historical origin of the approach. A widespread doctrine of late 1960s particle physics had 
been that, on a fundamental level, our world could be understood in terms of symmetries and 
their implementation through groups - the eight-fold way. In an attempt to absorb the newly 
developed RG approach into this general framework, it became dubbed the renormalization 
group. Of course, a linkage between group structures and renormalization methods would not 
have been drawn had it been utterly unjustified. Indeed, one may argue that the sequence of RG 
transformations outlined above defines the structure of a semigroup. 3 However, the connection 
between RG transformation and group algebraic structures is not only highly formal but also 
counterproductive. (It suggests a conceptual bond that simply is not there.) Besides, the group 
interpretation of the RG transformation is completely useless with regard to practical aspects. 

To introduce the conceptual foundation of the renormalization group, we find it helpful 
to draw initially on classical theories, later turning our attention to addressing the effects 


1 A perspective on the development of the renormalization group can be found in the review article by M. E. 
Fisher, Renormalization group theory: its basis and formulation in statistical physics, Rev. Mod. Phys. 70 (1998), 
653-81, or in the text by J. Cardy, Scaling and Renormalization in Statistical Physics (Cambridge University 
Press, 1996). 

2 See K. Wilson, The renormalization group: critical phenomena and the Kondo problem, Rev. Mod. Phys. 47 
(1975), 773-840, still one of the best introductions to the approach! 

3 To this end, one should interpret an individual RG transformation as an abstract mapping between two actions: 
S S' , where S is the original action, S' the action with increased short-distance cut-off, and R the mapping 
between them. One may then notice that these transformations can be iterated, or “multiplied,” R o R' . A 
moment’s thought shows that the composition law obeys the defining conditions of a semigroup: there is a unit 
transformation (nothing is integrated out) and the iteration is associative. (We are dealing with a semigroup 
because the RG transformation is irreversible, i.e. it does not have an inverse.) 
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of quantum fluctuations. Our first encounter with the renormalization group is, therefore, 
focused on the one-dimensional Ising model. 


8.1 The one-dimensional Ising model 

To illustrate the implementation of the RG program on a simple example (indeed the 
simplest of examples), let us consider the one-dimensional Ising model defined through the 
classical microscopic Hamiltonian 

N N 

H = ~jJ2 S i S i+ 

where 5, = ±1 denotes the (uniaxial) magnetization or spin of site i (periodic boundary 
conditions, Sn+ i = <Si, imposed), and H ext represents an external field. Throughout this 
section we will adhere to the discrete spin representation of the model, i.e. in contrast to 
Section 5.1, we will not map the model onto a continuum theory. However, this consideration 
does not impede our discussion of the RG program: One may simply regard —/3H[S] as a 
“functional” of the discrete “field” {Si}, and all steps outlined above can be carried out 
without substantial modification. 


8.1.1 Exact solution 

The feature of the one-dimensional Ising model which makes it of pedagogical interest in the 
present context is that it admits an exact solution, i.e. all of its macroscopically observable 
properties can be computed in closed form. Both the exact solution of the model and its 
RG formulation rely on its transfer matrix representation. One may notice that the 
Boltzmann weight of the system can be factorized according to the relation 

e~P H = l( ^%*t{«SA +i +hSi) = T(S u Si + i), 

i= 1 

where, for notational convenience, we have introduced the dimensionless parameters 
K = 0J > 0 and h = 0H ex t , and the weight is defined through the relation T(S,S') = 
exp [R, S' 5" + 2 (S + 6")]. Defining a two-component matrix T with elements Tn = T(l, 1), 
T 12 = T(l, — 1), T 2 1 = T(— 1, 1), and T 22 = T(— 1, —1), one may confirm that the partition 
function of the system can be written as 

z = E e ~ 0H = E II T ( 5 - 5 *+i) = Ell 7 ™, = trTN - 

{SR {SRi=l {m}i= 1 

Thus, we have managed to represent the partition function as a trace of the Nth power 
of the two-dimensional “transfer” matrix T. 4 In this representation, the partition function 


The terminology “transfer matrix” originates in an interpretation of the Ising model as a fictitious dynami- 
cal process in which a state Si is “transferred” to a state Si+i, where the transition amplitude is given by 
T(Si,S i+ i). 
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(b) 


Figure 8.1 (a) On the absence of spontaneous symmetry breaking in the one-dimensional Ising 
model. No matter how low the temperature, the energy cost associated with the creation of a 
segment of flipped spins is outweighed by the entropy gain, (b) In higher dimensions, entropic 
factors no longer have the capacity to overpower the extensive growth of energy associated with 
the formation of mismatched regions. 


♦ h + hhh uhu 



may be presented in terms of the two eigenvalues, A± = e K [cosh(/i) ± (sinh 2 (h) + e iK ) l ! 2 \, 
of the transfer matrix 

r e K+ h e ~ K 1 

[e~ K e K ~ h \ 

as Z = tr T n = A+ + A(^. Noting that A + > A_, one may see that, in the thermodynamic 
limit N — > oo, the contribution of the latter may be neglected, so that Z Restoring 

the original microscopic parameters, in the thermodynamic limit, one obtains the free energy 

F = - i ]nZ = - N (j + Tin [cosh(S//,. xt ) - yJsinh 2 (pH ext ) + e" 4 ^] j . (8.1) 

With this result, one can compute the magnetization M by differentiation with respect to 
the magnetic field. As a result one obtains the magnetization per spin, 


M _ sinh (/3H ext ) 

N ^ sinh 2 (/3 H ext ) + e~ A P J 


( 8 . 2 ) 


Two magnetization curves are shown in 
the figure at two non-zero temperatures. 

Notice that, while the magnetization curves 
grow ever more steeply with decreasing 
temperature, the system does not magne- 
tize at any non-zero temperature in the 
absence of an external field, i.e., unlike 
its higher-dimensional descendants, the 
one-dimensional model does not display 
spontaneous symmetry breaking. 

I N FO There are many ways to understand the absence of spontaneous symmetry breaking 
in the one-dimensional Ising model (cf. our discussion of the connection with quantum mechanical 
tunneling in Section 3.3). Suppose that the system did exhibit a critical temperature T c below 
which the system acquired long-range order with {£»} = 1 or {Si} = —1 (Fig. 8.1 (a)). Now, 
let us imagine that a segment of M consecutive spins were to flip. In doing so, they would incur 
an energetic cost of 0(2J), i.e. the energy associated with the unfavorable spin alignment at 


M 
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the two ends of the domain of reversed spins. However, there are some N different choices for 
placing the sector of M flipped spins, i.e. the energy loss is counteracted by an entropic factor 
of ~ Tin N (exercise). Thus, no matter how small T is, for large systems of large enough extent 
the free energy balance for domain creation is positive, implying that the system will be in a 
disordered phase at any non-zero temperature. 

EXERCISE By enumerating the number of spin configurations with the same energy, obtain 
a formal expression for the classical partition function of a one-dimensional Ising model of 
length N with periodic boundary conditions. (Hint: Consider the number of domain wall 
configurations.) Making use of Stirling’s approximation Inn! ~ n(lnn — 1), determine the 
temperature dependence of the correlation length (domain size) at low temperatures. Confirm 
that the system does not order at any finite temperature. 

To what extent do these arguments carry over to Ising systems of higher dimensionality? 
Consider, for example, a two-dimensional variant of the model. Here the formation of a large 
connected region of M mismatched spins incurs an energy cost U ~ M 1 ^ 2 J. To understand why, 
one may note that the energy cost is proportional to the length of the one-dimensional boundary 
that encloses a (circular) domain of reversed spins. However, the entropic gain still scales as 
~ klnN. (Allowing for less symmetric domains, the energy /entropy balance becomes more 
subtle. Yet the qualitative conclusion remains robust.) Thus, for the two-dimensional system, 
F ~ JM 1/2 - Tin IV. From this one can conclude that, over the range of temperatures for 
which this estimate is valid, no matter how small T, it is energetically unfavorable to flip a 
thermodynamic M ~ N x>0 number of spins. One may, therefore, conclude that the (d > 2)- 
dimensional Ising model does exhibit a phase transition into an ordered low-temperature phase. 


The phenomenon of spontaneous symmetry breaking occurs only in systems of 
sufficiently large dimensionality. The threshold dimension below which entropic 
mechanisms exclude spontaneous symmetry breaking is called the lower critical 
dimension d c . 


Our argument above indicates that the lower critical dimension of the Ising model and, more 
generally, 


The lower critical dimension of systems with discrete symmetries is d c = 1. 


Pursuing this theme, let us complete the discussion by addressing the question of the lower critical 
dimension for systems with a continuous symmetry (e.g. the U(l) symmetry of the superfluid 
phase, the 0(3)-symmetry of the Heisenberg ferromagnet, etc.). To address this question, one 
may proceed in a manner analogous to that used in the Ising system, i.e. let us assume that 
a critical temperature exists below which the system is ordered. Using, for concreteness, the 
language of magnetic phenomena, we might say that the system with two-component spin degrees 
of freedom acquires a state of uniform magnetization S(r) = S'ei , where ei denotes the unit 
vector in the, say, 1-direction. Expanding the Hamiltonian /3H[S] in fluctuations around this 
configuration (and, for simplicity, taking a continuum limit), one obtains 

S[S] = W / d<V ^ 2 ’ 
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where 9( r) denotes the polar angle measuring fluctuations around the ordered state and c is a cou- 
pling constant. One may now explore the thermal (or, if appropriate, quantum thermal) expec- 
tation value (Si(r)) of the 1-component of the spin variable anywhere in the system. Assuming 
that we are close to the ordered state, an expansion in 0 gives Si(r) = S cos 9 = 5(1— |0 2 + •••). 
We now proceed to check whether the close-to-ordered assumption was actually legitimate. For 
this, we have to verify that, with (5i(r)) = S — ^(9 2 ( r)) + • • • , the contribution associated with 
fluctuations is much smaller than the leading-order constant. Performing the Wick contraction 
with respect to the quadratic action, and switching to momentum space, one obtains 


(5r(r)> » S 


1 


-Y- 

2 cS 2 ^ q 2 


S 



Crucially, in dimensions d 2, the integral is divergent. In the marginal case d = 2, J“_ , = 

7r ln(L/a), where we have used the fact that the momentum integral should be limited by a short- 
(long-) wavelength cutoff of the order of the inverse lattice spacing (the system size). In the 
thermodynamic limit, L — ¥ oo, the integral grows without bound, implying that the assumption 
of an ordered state was ill-founded no matter how small the temperature; the system is in a 
disordered state. Noting that nowhere did we rely on specifics of the spin system, we draw a 
conclusion known as the Mermin— Wagner theorem: 


The lower critical dimension of systems with broken continuous symmetries is d = 2. 


Obviously, the divergence of the fluctuation integral is due to the fact that, for large wavelengths, 
and in low dimensions, the integration volume (alias the entropy) of fluctuations ~ q d scales 
more slowly to zero than the energy cost ~ q 2 . That is, as in the Ising model case, the Mermin- 
Wagner theorem 5 can be understood as the statement of a competition of energy and entropy. 6 


8.1.2 Elements of scaling theory 

Returning to the one-dimensional Ising model, we may note that Eq. (8.2) represents a full 
solution of the problem. We have explicitly obtained the magnetization as a function of the 
magnetic field and the microscopic coupling constant of the model. Other thermodynamic 
characteristics, such as the magnetic susceptibility x = — df { F, can be generated by further 
differentiation with respect to H and/or T. (Here, and henceforth, we drop the subscript 
“ext” from the definition of the external field, H ex t , noting that it can always be discrimi- 
nated from the Hamiltonian H by its context.) However, in the vast majority of physically 
interesting problems we will not be in possession of a closed analytical solution. This means 
that, before comparing the exact solution with the outcome of the RG program, we should 
reformulate the former in a universally applicable language, i.e. a code that can be used to 


5 N. D. Mermin and H. Wagner, Absence of ferromagnetism or antiferromagnetism in one- or two-dimensional 
isotropic Heisenberg models, Phys. Rev. Lett. 17 (1966), 1133-6. 

6 For completeness, we mention that the proof of the theorem (of which we gave a fairly abridged version here) in 
a subtle way relies on the fact that the symmetry in question is compact. For the (rare) class of systems with 
non-compact, e.g. hyperbolic, symmetries, the statement does not hold. 
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characterize the behavior of a model irrespective of the particular method by which this 
behavior has been analyzed. 

This objective leads us back to the familiar notion of correlation functions. All models 
of physical interest display non-trivial fluctuation behavior on large length scales. Indeed, 
in the vicinity of critical points marking the position of continuous phase transitions, one 
expects the onset of critical fluctuations, i.e. the accumulation of fluctuations on all length 
scales. (For a precise formulation of the terminology, see the section on phase transitions and 
critical phenomena below.) One of the central working hypotheses of the theory of “critical 
phenomena” is that, in the vicinity of the critical points there is only one length scale of 
physical relevance, the correlation length £. Within the context of the Ising model (cf. 
our previous discussion on page 199), £ is defined as the decay length of the correlation 
function, 

<7(ri - r 2 ) = (S'(r 1 )S'(r 2 )) - (5(r 1 ))(S'(r 2 )) ~ exp 

where we have switched to a continuum notation Si — >• S(r), to emphasize the large-distance 
character of £. To relate this quantity to the thermodynamic characteristics, one may note 
that the magnetic susceptibility is given by 



x = -d%F |h=o = Td 2 H lnZ\ H=0 =13 J d d rd d r' ((S(r)S(r')} - <S(r)><S(r')>) . (8.3) 

INFO In fact, this is yet another manifestation of the fluctuation— dissipation theorem 
discussed in the previous chapter. A dissipative quantity (in this case, the susceptibility) is 
determined by the fluctuation behavior of the system. 

Comparison with the definition of the correlation function above identifies \ ~ i- e -, in the 

one-dimensional system, the magnetic susceptibility is directly proportional to the length 
scale determining the decay of correlations in the system. So far, we have not yet made use 
of the specific results obtained for the Ising models above. However, to actually determine 
the correlation length for our present example, we need only differentiate Eq. (8.2) once 
again with respect to H, to obtain 

£ ~ X ~ 9 h \ h=0 m ~ e 2/3J , (8.4) 

i.e. the correlation length exponentially increases in the limit T 4 0 on a scale set by 
the microscopic “stiffness constant”, J. This result should not be too surprising: unlike its 
higher-dimensional counterparts, the one-dimensional Ising model does not display a finite- 
temperature phase transition between a ferro- and a paramagnetic phase. It is only in the 
limit T — »• 0 that long-range correlations develop. 

In the vicinity of a critical point, the diverging correlation length specifies the singular 
thermodynamic properties of the system. More specifically, all observables X of dimension- 
ality [length ] Dx should obey the scaling form 



x ~ e x gx, 
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C | j ) c l | ) 

Figure 8.2 RG or block spin transformation of the Ising model illustrated for clusters of size 6 = 2. 
An r-fold iteration of the procedure reduces the degrees of freedom by a factor 2 r . 

where gx is a dimensionless function. Let us explore this concept on our present example. 
The “reduced free energy,” 

f( T )=YZ’ ( 8 -5) 

has dimension L -1 . Noting that N ~ L, a straightforward low-temperature expansion of 
Eq. (8.1) indeed gives 

f(T) - /( o) = r 1 (-i - \eh 2 ^j = (8.6) 

where we have subtracted the infinite but inessential constant /( 0) and assumed that 1 » 
£ _1 » h. (The scaling form above actually suggests that the magnetic field has dimension 
I/ -1 , a prediction substantiated below.) 

We have thus found that the correlation length of the one-dimensional Ising system 
diverges according to Eq. (8.4) upon approaching zero temperature, and that the free energy 
obeys the scaling law (8.6). Of course, this is only a fraction of the full information stored 
in the exact solution. However, the reduced set of data has the striking advantage that it 
is of general relevance. Indeed, we saw above that the correlation length is directly related 
to measurable properties of the system such as the magnetic susceptibility. Similarly, the 
output of experiments on systems with long-range correlations is commonly encoded in the 
language of scaling relations. In other words, we have extracted that part of the information 
contained in the exact solution that carries universal relevance and can be compared with 
the output of other approaches. 


8.1.3 Kadanoff s block spin RG 

With this background we are in a position to explore some of the conceptual foundations of 
the RG on a model application. According to the general scheme outlined at the beginning 
of the chapter, our aim is to devise an algorithm to recursively trace out parts of the 
short-scale fluctuations of the system and assess their influence on the remaining degrees of 
freedom. 
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Following the program outlined by 
Kadanoff’s seminal work on the 
foundations of the RG, we will follow 
a strategy whereby this renormal- 
ization step may be effected by 
subdividing the spin chain into reg- 
ular clusters of b neighboring spins 
(see Fig. 8.2). We may then proceed 
to sum over the 2 b sub-configurations 
of each cluster, thereby generating 
an effective functional describing the 
inter-cluster energy balance. While it is clear that this energy functional exists, a far less 
obvious question to ask is whether it will again have the form of an effective Ising spin 
system. Remarkably, the answer is affirmative: the Ising model is said to be “renormaliz- 
able.” The structural reproduction of the model implies that we can think of each cluster 
as some kind of meta-Ising spin, or block spin. More importantly, it guarantees that the 
renormalization step qualifies for iteration: in a second RG step, b block spins are grouped 
to form a new cluster (now comprising b 2 of the microscopic spins) which are then traced 
out, etc. We next discuss how this algorithm is implemented in concrete terms. 

Within the transfer matrix approach, a cluster of b spins is represented through b transfer 
matrices T. Taking the partial trace over its degrees of freedom amounts to passing from 
these b matrices to the product T' = T b . (By construction of the approach, the internal 
index summation involved in taking the product amounts to tracing out the degrees of 
freedom of the cluster.) The transition from the original partition function Z to the new 
partition function Z' is defined through the relation 

Z n (K, h ) = trT^ = tr (T b ) N / b = tr {T') N/b = Z N/b (K ', h% (8.7) 


Leo P. Kadanoff 1934- 

Theoretical physicist and applied 
mathematician who has con- 
tributed widely to research in the 
properties of matter, the devel- 
opment of urban areas, statistical 
models of physical systems, and 
the development of chaos in simple mechanical and 
fluid systems. He was instrumental in the develop- 
ment of the concepts of scale invariance and univer- 
sality as they are applied to phase transitions. 



where the notation makes the parametric dependence of the partition function on the size of 
the system, N, and on the coupling constants K, h explicit. Notice that the equation above 
makes the highly non-trivial assumption that the reduced trace, tr ( T ') N / b , can again be 
expressed as an Ising partition function or, equivalently, that the effective transfer matrix 
T' has the same algebraic structure as the elementary matrices T. 

To confirm the integrity of the transformation, one may explore the structure of the 
product matrices V for the simplest case of b = 2 block spins. Introducing the abbreviations 
u = e~ K , v = e~ h , we have 


g K +h 

e~ K 1 

u 

e~ K 

e K ~ h \ ~ u 

u~ x v 


while the product takes the form 

T , =T 2 _ \u 2 +u- 2 v~ 2 v + v- 1 1 T c IV - V -1 v! 

I V + U -1 U 2 + U~ 2 V 2 \ l u' u'- 1 ' 


In the last equality we require that the new transfer matrix be of the same structure as 
the original. However, noting that this requirement will introduce three conditions (for the 
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Figure 8.3 (a) The flow of coupling constants of the one-dimensional Ising model generated 
by iteration of the RG transformation. The three lines shown are for starting values ( u , v ) = 
(0.01, 0.9999)(>), (0.01, 0.999)(*), and (0.01, 0.99 )(h). (b) Magnification of the zero-temperature 
fixed point region. 


three independent entries of the symmetric matrices T and T'), we are willing to tolerate the 
appearance of an overall multiplicative constant C . 7 Having introduced this new parameter, 
we have enough freedom to solve the three equations, from which one finds (exercise) 

, Vv + y - 1 , _ \/u 4 +v 2 

U (u 4 + U ~ 4 + V 2 + v -2 ) 1 / 4 ’ V y/u , 4 + v ~ 2 ’ 

and C = \/v + n _1 (u 4 +u ~ 4 +v 2 + U -2 ) 1 / 4 . As a corollary, we remark that the possibility of 
representing the new transfer matrix in the same algebraic structure as the old one implies 
that the transformed model again describes an Ising spin system (namely the spin system 
whose transfer matrix would be given by T'). However, the Hamiltonian [3H of the new 
block spin system: 

c> is defined at a different temperature, magnetic field and exchange constant (as described 
by the new values of the coupling constants (u',v')) and 
t> describes fluctuations on length scales that are twice as large as in the original system. 
In particular, the short-distance cutoff has been doubled. 

To make further progress, one may focus on the two relevant parameters u' and v' and 
observe that the result of the block spin transformation can be represented as a discrete 
map 

[VI = [/i(«,n)l 
M [h(u,v)\ ’ 

where the functions are defined through Eq. (8.8). In Fig. 8.3 sequences of points 
generated by an iterative application of the map / are shown for different values of “initial 
conditions” (wo,fo). If is evident from the RG trajectories that the map / possesses two 


Taking the product of the new transfer matrices, we see that this constant appears in the partition function as 
Z f r*j C N /\ i.e. the free energy acquires an overall additive constant F' ~ InC which will be of no further 

significance. 
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disjoint sets of fixed points, i.e. points (u* , v* ) that remain invariant under the application 
of the map /: 

= U 1 (“*•'’*)] 

Kd ' 

Inspection of Eq. (8.8) shows that this is the case for (a) the point (u*,v*) = (0, 1), and (b) 
the line (u*,v*) = (l,u). 

The set of fixed points represents the most important structural characteristic of an RG 
analysis. They organize the space of “flowing” coupling constants into sectors of qualitatively 
different behavior. In particular, one may note that, at a fixed point, all characteristics 
of the model, including its correlation length £, remain invariant. On the other hand, we 
noticed above that an RG step is tantamount to doubling the fundamental length scale of 
the system. Consistency requires that either £ = 0 or £ = oo. In the present case, the line 
of fixed points is identified with u = exp[— /3J] = 1, i.e. /3 = 0. This is the limit of infinitely 
large temperatures, at which we expect the model to be in a state of maximal thermal 
disorder, that is £ = 0. Besides the high-temperature fixed line, there is a zero-temperature 
fixed point (u, v) = (exp [— p J], exp[— /3h]) = (0,1) implying T —$■ 0 and h = 0. Upon 
approaching zero temperature, the system is expected to order and to build up long-range 
correlations, £ — > oo. 

Notice, however, an important difference between the high- and the low-temperature set 
of fixed points: while the former is an attractive fixed point in the sense that the RG 
trajectories approach it asymptotically, the latter is a repulsive fixed point. No matter 
how low the temperature at which we start, the RG flow will drive us into a regime of 
effectively higher temperature or lower ordering. (Of course, the physical temperature does 
not change under renormalization. All we are saying is that the block spin model behaves 
as an Ising model at a higher temperature than the original system.) 

To explore the low-temperature phase of the system quantitatively, we may linearize 
the RG map in the vicinity of the T = 0 fixed point. That is, condensing the fixed point 
coordinates into a two-component vector, x* = (u*,v*) T = (1,0) T , and assuming that Ax 
parameterizes a small deviation from the fixed point, one may write x*+Ax' = f (x*+Ax) « 
f (x*) + <9 x f • Ax + ©(Ax 2 ). Now, drawing on the invariance of the fixed point, f (x*) = x*, 
we obtain the linearized map Ax' = d x f • Ax + ©(Ax 2 ). To explore the linearized mapping 
in more detail, it is convenient to introduce yet another pair of variables, namely r = it 4 , 
s = v 2 , whereupon the RG transformation becomes rational, r' = r+ ^ti^ +a -i , s' = t . 
Differentiating this map at (r,s) = (1,0), it is straightforward to show that 


A r' 4 

A r 

_As'_ 

2\ |_As_ 


Noting that a transformation with 6 = 4, say, is equivalent to a two-fold application of a 
6 = 2 transformation, one may recast the relation above in the more general form 




(8.9) 


applicable to arbitrary 6. 
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To make use of Eq. (8.9), let us once again consider the reduced free energy Eq. (8.5): 
/(Ar, As) = — TV -1 \nZN(K(Ar), h( As)) = —IV -1 ln^jv(Ar, As) and reformulate Eq. (8.7) 
according to 

/(Ar, As) = — In Z N ( Ar, As) = \nZ N t(Ar', As') = ^f(b 2 Ar,bAs). 

This equation describes the “scaling” of the free energy density under block spin trans- 
formations (all in the linearizable low-temperature regime) or, equivalently, changes of the 
fundamental length scale at which we consider our model. The right-hand side of the equa- 
tion describes how the model would look from a “blurred” perspective where all degrees of 
freedom on scales < b have been comprised in a single structural unit. 

Importantly, b is a free parameter without intrinsic significance; it can be set to any desired 
value. For example, we may find it convenient to look at our model at scales where b 2 Ar = 1. 
With this choice one obtains /(Ar, As) = Ar 1 / 2 /(l, As/Ar 1 / 2 ) = Ar 1 / 2 g(As/ Ar 1 / 2 ), 
where the dimensionless one-parameter function g is defined through the second equality. 
Finally, one can relate back to the physical parameters of the system: 

Ar = r — 0 = r = u 4 = e~ 4K , As = s — 1 = v 2 — 1 = e~ 2h — 1 ~ —2 h, 

which brings us to the scaling relation 

/ = e~ 2K g(e 2K h). (8.10) 

This is the scaling form predicted by the RG analysis. Notice that a non-trivial statement is 
being made. The a priori dependence of the free energy on two independent parameters K, h 
has been reduced to a one-parameter function, multiplied by an overall prefactor. Indeed, 
we expect on general grounds that (see the discussion of the previous section) the reduced 
free energy should scale with the inverse of the correlation length /, which in turn diverges 
upon approaching the zero-temperature fixed point. Comparing with Eq. (8.10), and noting 
that there are no reasons for the rescaled free energy g(x) = /( l,x) to diverge by itself, we 
conclude that the divergence of / is driven by the prefactor, i.e. / ~ e 2K , in agreement with 
the result of the exact analysis. With this identification we obtain 

/ = 

i.e. the magnetic field appears in conjunction with the correlation length and we have 
reproduced the exact asymptotic Eq. (8.6). 

Notice that there is no reason to be too irritated about the excessive appearance of vague 
proportionalities As fixed points are approached, physical systems tend to build up 
all sorts of singular scales. The most fundamental of these is the correlation length, but 
the divergence of / usually entails singular behavior of other physical quantities. These 
singularities characterize much of the observable behavior of a system both theoretically 
and, in fact, experimentally. In the immediate vicinity of a fixed or transition point, all but 
the strongest driving forces of singular scaling are of secondary importance. Indeed, this 
is why we could, for example, conclude from Eq. (8.10) that / ~ exp(2A') ~ exp (2/3.7). 
We know on general grounds that / ~ / 1 . On the other hand, (8.10) implies that the 
leading (i.e. exponential) driving force behind the divergence of that scaling factor must 
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reside in exp(2/3J). This leads to the identification £ ~ exp(2 (3J), where all other factors of 
proportionality are of secondary importance. 

EXERCISE Apply the block spin RG procedure to the one-dimensional g-state Potts spin model 
pH = —K <5 Si ,s i+ i , where = (1,2, ... ,q). Identify all fixed points and note their stability. 

In Section 8.3.3 below we discuss these scaling arguments from a more rigorous and general 
perspective. However, before doing so, let us exemplify an RG analysis of a second case 
study. 


8.2 Dissipative quantum tunneling 

Previously, we have explored the impact of the external environment on the quantum 
mechanical tunneling of a particle from a well. In particular, we have seen that, under a 
wide range of conditions, the influence of the environment can be captured by an ohmic 
contribution to the action, of the form <Sdi ss [0] = liTg l w n||#n| 2 , where the parameter 
g- 1 characterizes the bare viscosity. Indeed, we have seen that a similar dissipative struc- 
ture appears in a number of different settings, including the problem of impurity scattering 
in a quantum wire (see Problem 4.5). There, the dissipative term appeared as a result of 
integrating over the collective fluctuations of the bosonized field while pinning the field 9 
at the impurity. 

In the following, we exploit an RG program to uncover the effect of dissipation on the 
facility to tunnel. To be concrete, let us assume that the quantum mechanical particle 
inhabits a periodic potential, U(6) = ccos 0. In this case, the quantum transition probability 
can be encapsulated by the quantum partition function Z = J DO exp(— 5[0]) where 8 


S[0] = -^^^2\un\\0 n ? + c J dr cos(0(t)). 


( 8 . 11 ) 


To explore the physical behavior of the system described by the effective action (8.11), we 
might decide to focus on a physical observable that would then be calculated by taking 
appropriate functional averages with respect to the effective action £[#]. However, we shall 
instead proceed in a somewhat more indirect manner: following the general philosophy 
outlined at the beginning of the chapter, we begin by arbitrarily subdividing the set of all 
fields {0} into short- and long-wavelength degrees of freedom. For example, assuming that 
the maximum frequency up to which the effective bosonic theory of the problem applies is 
given by A, we might say that fluctuations on scales A/6 < \oj n \ < A are “fast” (6 > 1) 
while those with \oj n \ < A/6 are “slow.” 


INFO Notice that this mechanism of subdivision of the degrees of freedom differs from that 
employed earlier. In the previous example, we carried out an RG transformation by integrating 


Here, taking the viscosity 1/g to be large, we have neglected the influence of the kinetic contribution J dd^O 2 . 
which serves only to regularize the UV behavior. 
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out field fluctuations on short scales in real space, i.e. we performed a real space renormal- 
ization. Here, we chose to integrate over a high-lying sector in frequency space. Alluding to the 
status of frequency as the ( d+ l)th component of a generalized momentum, this implementation 
of the RG transformation is called momentum shell renormalization. While the real space 
renormalization may seem more intuitive, it is usually less convenient to carry out analytically. 
(As regards numerical renormalization procedures, the situation is different.) It goes without 
saying that the net outcome of any RG strategy must not depend on the specific choice of the 
integration procedure. 

Our first objective is to identify the effective action of the slow degrees of freedom after 
the fast fluctuations have been integrated out. To prepare this integration, it is useful 
to decompose a general field amplitude 0(t) into a slow contribution 9 s (t) and its fast 
complementary 9{(t) part, that is 9{t) = 9 s (t) + 9{(t), where 

e s (r) ee £ e~ iWnT 9 n ee jf ^e-^0(u;), 9 f (r) = £ e^9 k S £ 

(8-12) 

Here we have introduced the abbreviations Y, Un s = Yj\ Un \<A/b an< i Y Un f = Y,A/b |w„|<a- 
Anticipating that we shall be working at very low temperatures T -C A, we note that one 
may represent the discrete frequency summations by integrals, f s dui = J\ u \ <A / b dw and 
J f duo = j A j b w | <A doj. (It is a good exercise to convince oneself of the legitimacy of this 
simplification at every step of the construction below. Also notice that, with the continuum 
fields, 9(u) = 9 n /T.) 

On substituting into the continuum representation of the action (8.11), 

S[0] = 4 Yg { A +C J dT C0S (°( T ))’ ( 8 - 13 ) 

one obtains S[6 s , 9f] = 5 S [0 S ] + S{ [0f] + 0f], where 

< Ss , f [0 s , f ] = — ^ J s t'l^)| 2 W’ Su[0 s ,9 t ]=cJ (Itcos(9 s (t) + 0{(t)). 

To proceed with this expression, we will resort to an approximation that is difficult to justify 
in advance. Defining e — Seff t 61 ®] = e — ^ e — -Stz [6> s ,0 f ] ) f , w h ere (. . . ) f = J D9fe~ Si ^(- ■ ■ ), and 
assuming that the coupling constant c is small, we approximate 

e — s eff [<y _ e -m| ^ _ s u Q ii ' j H | j # (8.14) 

That is, by assuming that the coupling constant c is in some sense small, we expand in 
c - which appears to be similar to our previous perturbative approaches - only to re- 
exponentiate it in the next step - a manipulation very different from plain perturbation 
theory. Evidently, the validity of this step is bound to small values of the coupling constant. 
However, before attempting a more qualified justification, let us tentatively accept the 
approximation above and explore its consequences. 
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Computing the average 

(S/W^ = c J D6f e~ St ^ J dr cos(0f(r) + 0 s (r)) 

= \j dre ie ^ T) J D6{ e^*h * +&&. 

= C -f dr c'°" ^ e 1X9 & ** M + c.c. = c J dr cos(0 s )e~ 2n9 ^ A / b 
= cj dr cos(0 s )e~ 9 lnb = cb~ 9 J dr cos(9 s ), 

We arrive at the remarkable conclusion that the effective action for the slow field, 
Sett [0s] = f 2 \uj\ + cb~ 9 J dr cos(6 s ), 


is structurally identical to the action we started out from; in other words, u>i 
we have seen that the action is renormalizable. Nonetheless, S e s differs a § 
from the initial action S above in two important respects: firstly, the 
integration over the fast fields induces a change of the coupling constant 
of the perturbation; secondly, the new action takes values on field con- 
figurations that fluctuate only on scales \u> n \ < A/b. 

The next step of the RG program requires the comparison of the model 
before and after the integration over the fast fields. However, two model , 
actions taking values on different sets of field configurations cannot be | 
sensibly compared. Rather, one must effect a rescaling of the fundamen- 
tal unit of frequency /time, (r, uS) — > (t',u/), such that the transformed field 9 s (u}) — > 9'(u') 
also fluctuates on all scales |u/| < A. We thus introduce a rescaled frequency variable 
according to 

u / = bui. 

This change of variables engages a number of secondary transformations. In order to keep 
the dimensionless combination wr relating time to frequency invariant, one must set t' = 
b~ 1 r. As to the transformation of the field variable 9, there is some freedom (because 
9 is an integration variable that can be transformed arbitrarily). However, the Fourier 
representation Eq. (8.12) implies that, after the transformation of 9(t) has been fixed, the 
transformation of 9(ui) follows from our previous rescaling of frequency/time. In order to 
keep the algebraic structure of the cosine operator invariant, we choose to define 9'(t') = 
9 s (t). Equation (8.12) then enforces the condition 

9\oj') = b~ 1 9(u). 

Substitution of the new variables a/, r', 9'{t'), and 0'(u/) into the effective action then gives 

SM = S'[0'} s - — f ^V(u/)l VI + cb 1 - 9 [ 

^ 9 J\u' |<A 2 ?r J 



dr' cos {9'{t')). 
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Notice that the first contribution to the action remained invariant, save for the fact that 
the integration now extends over the full interval \to'\ < A. 

INFO In fact, one may note that, without any calculation, the transformation of the action 
under the rescaling of variables could have been anticipated from dimensional analysis: the 
definition to' = bto implies that all contributions to the action of dimension [frequency ] d change by 
a factor b~ d . Since 9{t) is a dimensionless phase, we have [#(r)] = 1 while [0(w)]i m [frequency] -1 . 
Thus, the first term of the action has dimension 1 and remains invariant. The second operator 
carries the dimension [dr] = [frequency] -1 and, therefore, changes by a factor b. 

We are now in a position to compare the effective actions 5[0] and S' [O'] before and after 
the integration over the fast modes. Obviously, the principal effect of the integration over 
fast modes is that the coupling constant of the periodic potential has changed according to 
the relation 


c c(b) = cb 1 9 . (8.15) 

(However, do not forget a point that is implicit in the notation: the new action describes 
fluctuations on slower frequency scales or larger temporal scales. That this difference is not 
manifest in the notation is due to the fact that we chose to measure the “new” frequency 
continuum in rescaled variables to' = bto.) 

This finding contains a preliminary answer to the question formulated above. To reca- 
pitulate, we asked how the presence of the dissipative term manifests itself in the facility 
to tunnel. We have found that an integration over short-scale fluctuations of the collective 
field 6 alters the effective strength of the potential. (In the context of the quantum impurity 
problem in the wire, this change is a manifestation of the mutual influence of the Friedel 
scattering pattern and interactions in the electron gas. Indeed, it is apparent that in the 
non-interacting case, i.e. for g = 1 (cf. Eq. (4.52)), the coupling constant does not change. 
For repulsive (attractive) interactions, g < 1 (g > 1) the coupling increases (decreases) in 
accord with the qualitative picture formulated above.) 

However, at this stage, our result merely indicates how the coupling constant changes 
after one RG step. Notably, Eq. (8.15) depends in a non-universal manner on how we choose 
to dissect the frequency spectrum (the parameter b). However, ideally, we would like to know 
the value of the coupling constant after all degrees of freedom down to a certain infrared 
cutoff scale w m i n have been integrated out. (For example, the role of cu m i n might be taken 
by temperature, the oscillation frequency of an external perturbation, etc.) The general 
route towards obtaining this information is to iterate the RG step, i.e., setting c = d°) 
and d = c^ 1 ), explore the sequence of coupling constants d°) —¥ d 1 ) -> d 2 ^ —>•■■■ In all 
that follows, it is convenient to think of this sequence as a kind of dynamical system. In 
each discrete “time step,” the variable c changes according to c —> d = dd~ 9 = ce ln6 ( 1_ s). 
Assuming that b = 1 + e is very close to unity (which means that, in each step, only 
an asymptotically thin layer in frequency space is “shaven off”), we can encapsulate this 
information in the differential equation w 4s ~ = c(l — g). The evolution equation 
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describing the flow of the coupling constant under an infinitesimal change of the control 
parameter, 


dc 

din b 


= fi(c) = c( 1 - g), 


(8.16) 


is known as a Gell-Mann-Low equation. For historical reasons, the right-hand side of 
the equation is called the /9-function. 

Equation (8.16) makes the interpretation of the RG-generated change of the coupling 
constant as a dynamical system manifest. Thinking of the parameter t = In b as a “time 
coordinate,” we can integrate the evolution equation to obtain 9 c(f) = c(0)e^ 1_s ^, where 
c(t = 0) = c(b = 1) has the status of the bare coupling constant of the theory. (Remember 
that, for 6=1, A ef j = A/6 = A, which is the level of the unrenormalized theory.) 

Having inferred the RG flow, one may wonder at what “time” 6 max should we stop the 
renormalization? What is the “real” value of the coupling constant? The answers to these 
questions are somewhat application-specific. For example, we might have coupled the system 
to an external perturbation of characteristic frequency u m . In that case, we might want 
to integrate out all degrees of freedom with frequency tj n>m to then explore the effective 
low-energy theory at scales ~ u m , i.e. we would set A ei j = u m , or t = ln(A/cu m ). The 
effective theory would then look structurally identical to the microscopic model but with 
a renormalized coupling constant c = c(0)(A/w TO ) 1-9 a 1 . (Notice that both the bare 
constant c(0) and the cutoff A depend in a non-universal manner on microscopic elements 
of the model. The use of the proportionality sign indicates that we are not, in general, 
interested in these details but rather focus on the dependence of the coupling constant on the 
low-energy scale oj m .) Alternatively, we might want to integrate out all degrees of freedom 
down to the lowest frequency allowed by Matsubara frequency quantization, A e g = 2nT. 
In this case, c a T 9_1 . For a system of finite extent L, we might argue that the spectrum 
of the modes 9(k) is quantized with w m i n = nv/L. In this case, the effective value of the 
coupling at the lowest frequencies would be c a L 1-9 . Summarizing, the RG flow should be 
terminated at a low-energy scale determined by the specific problem under consideration. 

Our results so far on the behavior of the coupling constant are summarized in the bottom 
part of the figure below. We have found that, for interaction parameters g > l(g < 1), 
the coupling constant c decreases (increases). The “non-interacting” case g = 1 defines a 
fixed line where the coupling strength does not change. However, at this stage, we must 
remember that the analysis was based on a spurious expansion of the action to first order 
in the impurity operator. This implies that, in the repulsive case, even if the initial value of 
the coupling constant was small, it will soon flow into a region where the perturbative RG 
analysis loses its meaning (indicated by a wavy line in the figure). 


9 Critical readers will notice that this result coincides with the change of the coupling constant obtained after 
the first RG step. This coincidence, however, is a consequence of the simple structure of the /3-function in the 
present example. In more complicated cases, the change of the coupling constant after an infinitesimal RG step 
will structurally differ from the result obtained after following the RG flow all the way down to the IR cutoff. 
The concept of a /3-function and its integration are therefore indispensable elements of the theory. 




8.2 Dissipative quantum tunneling ^21 

How do we know what happens after the coupling 
constant has disappeared into the forbidden zone? 
Remarkably, it turns out that, for the particular 
model in question, not only the regime of a weak 
potential but also the complementary case of an 
asymptotically large potential (the upper part of the 
figure) is amenable to analytical investigation. As a 
result of a strong coupling perturbative expansion 
(see Problem 8.8.1) one may show that the coupling 
constant flows to (away from) infinite coupling for repulsive (attractive) interactions. It is 
thus tempting to conjecture a globally increasing/decreasing flow continuously interpolating 
between the two asymptotic regimes. (Application of the machinery of conformal field 
theory to the problem has indeed shown this conjecture to be true.) 

Therefore, when put together, we arrive at a remarkable conclusion. For g > 1, the 
strength of the potential flows to zero, implying that the asymptotic dependence at zero 
temperature is free: the particle can tunnel freely between minima of the potential. How- 
ever, for g < 1, the coupling constant flows to infinity, implying that the particle becomes 
localized, confined to a minimum of the potential. 



INFO Notwithstanding the limitation to small values of the coupling, we still have to discuss the 
consistency of the fast field integration, i.e. to show that the re-exponentiation of the poten- 
tial operator after the averaging over fast fluctuations (see Eq. (8.14)) underlying the previous 
analysis is legitimate. According to Eq. (8.14), the functional average of the exponentiated action 
can effectively be replaced by the exponential of the averaged action, (exp(— S c )) « exp(— {S c )). 
To explore the integrity of this assumption, let us explore the relevance of a typical correction. 
For example, an expansion of the exponent to second order in the potential operator would lead 
to expressions of the type 


dr dr' cosi 


:(0 s (t) + Of(r)) cos(6> s (t') + , 


where, here, we used the connected average (AB) C = { AB ) — ( A){B ) since the square of the 
averaged action ~ ((c f dcos(0)}) 2 is already included in our previous scheme. At first sight, 
this expression looks worrisome. By averaging over the fast field 9f we are bound to generate 
a composite operator that depends in a non-local manner on two time arguments r and r . 
Re-exponentiation of these objects would lead to an action much more complicated than the 
original. 

However, before losing hope, let us have a closer inspection of the fast field average of the 
expression above. Representing the cos-functions as sums of exponentials, we are led to consider 
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expressions of the type 


(exp [±*(6> s (r) + Of (t))] exp [±i(0 s (r') + 0 t (r'))] } c 



Here, the ± signs in the first line indicate the four different possible combinations of signs; in the 
second equality we have used our previous results on the integrals over the high-lying frequency 
shell; and in the crucial third equality, we have noticed that, typically, (r — r') > 6/A, 1/A such 
that the oscillatory term integrates to something close to zero and can be neglected in comparison 
with the constant. 

The estimate above is limited to time arguments |t — r , |A/6 > 1 outside a narrow strip 
|r — t'\ ~ 6/A. To show that the contribution of composite operators from these domains is also 
negligible, we employ an argument that is symptomatic for RG analyses: consider the connected 



oc 6 J dr (cos 2 (# s (t)) + Of (r)) c oc 6 49+1 J dr cos 2 (0 s (r)) + const., 

where the notation highlights the fact that the integration area is proportional to 6, and we 
have used the fact that, for those narrow time windows, the field integration will be oblivious to 
the difference between 0(t) and 0(r'). The second proportionality is obtained by averaging the 
integrand along the lines of our previous calculations. 

After the rescaling of time and frequency to restore the old cutoff, the operator gets multiplied 
by another factor of 6, i.e. the overall scaling factor is given by 6 2 - 1 ~ 29) . This factor tells us that 
the composite operator generated by higher-order cumulative expansion of the action is of less 
operator relevance than the contributions we are keeping. For values of the interaction g « 1 
not too distant from unity, the relative value of the coupling constants of the anomalous operator 
(oc 6 2 ^ 1-29 ^) and of the standard cos 6 contribution (oc 6 1-9 ) will scale to 0 as 6 is increased. 
A classification of operators according to their relevance under the RG transformation indeed 
forms the general basis for the limitation of low-energy actions to few contributions, namely 
those contributions that promise to be of strongest scaling relevance as larger and larger field 
fluctuations are probed. We will discuss this point more systematically in the next section. 

Before leaving this section, let us make a few general observations about the renormalization 
procedure. We first notice that it would have been futile to attack the problem by the 
“plain” perturbation theory developed in Chapter 5. The reason is that the propagator 
of the (0 + l)-dimensional effective theory, |cu| _1 , leads to logarithmic divergences when 
integrated over unbounded frequency intervals, i.e. the present theory is again plagued by 
the UV/IR divergences observed above in different contexts. In Section 5.1 we had argued 
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that, in principle, a way to overcome these problems was to introduce a UV and, if needed, 
an IR cutoff into the theory. However, we soon dismissed this option because it seemed 
evident that it would lead to spurious non-universal cutoff dependences of physical results. 

How do these observations relate to what we are doing presently? Obviously, the present 10 
version of the RG procedure also relies on the introduction of a cutoff regularizing the 
logarithmic UV divergences mentioned above; apparently, the RG procedure shares a lot 
of structures with the perturbative approach. But, somehow, we managed to extract the 
information in which we were interested - the dependence of the potential strength on long- 
range system parameters - in a manner independent of the cutoff. 11 The key to obtaining 
this information was to introduce not one, but an entire hierarchy of cutoffs and to integrate 
over each of these domains recursively. 

Now, a subtle and important point is that this procedure does not imply that the cutoff 
or, more generally, short-scale fluctuations of the model have silently made their way out 
of the theory. After all, the UV divergences mentioned before are manifestations of a large 
“phase volume” of field fluctuations that are likely to somehow affect the behavior of the 
system. To understand the “implicit” way through which these fluctuations enter our results, 
let us return to a remark made on page 425. There, we had noted that, upon scaling 
frequency /time, each operator changes according to its physical dimension. An operator 
carrying the dimension [time] d would acquire a scaling factor b d . The scaling dimension of 
an operator predicted by its “physical” dimension is called the naive scaling dimension, 
the canonical scaling dimension, or, for obscure reasons, the engineering dimension. 

The designations indicate, however, that these dimensions are not the last word on the 
actual scaling behavior of an operator. Indeed, the net result of the RG analysis was that 
our operator of interest, f dr cos 0, an object of engineering dimension 1, changes according 
to 6 1_s . The correction to the naive scaling dimension (presently, g) is called the anoma- 
lous dimension of an operator. Its origin lies in the (cutoff-dependent) phase volume of 
fluctuations co-determining the change of an operator during each RG step. Put differently, 
we can say that the cutoff A, by itself a quantity of dimension [time] -1 , acts as a “gray 
eminence” implicitly affecting the scaling behavior of an operator. The anomalous scal- 
ing dimensions of the theory effectively determine its long-range observable behavior and, 
therefore, represent quantities of prime interest. 


8.3 Renormalization group: general theory 

Having discussed two extended examples, we are in a position to attempt a reasonably 
general outline of the RG strategy. Suppose we are given a field theory defined through the 


10 Below we become acquainted with UV regularization procedures that are not based on introduction of a cutoff. 
One may object that the solutions of the /3-functions given above actually do contain the bare cutoff, through an 
initial condition; they also depend on the bare coupling strength and, possibly, other “non-universal” parame- 
ters. However, that need not worry us: in most applications (both experimental and theoretical) one is interested 
not so much in the “absolute value” of physical observables (as these usually depend on unknown material 
parameters anyway) but rather in the way these observables change as a relevant control parameter is var- 
ied. The important feature found above is that the rate at which the effective potential strength varies with 
temperature, say, is largely universal and cutoff- independent. 



430 


The renormalization group 


0=1 

where <j) is some (generally multi-component) field, g a are coupling constants and O a [0] 
a certain set of operators. For concreteness, one may think of these operators as O a = 
/ d d x(V(j)) n <j) m , i.e. as space-time local operators involving powers of the field and its 
derivatives - although more general structures are conceivable. 12 By “renormalization of 
the theory,” we refer to a scheme to derive a set of Gell-Mann— Low equations describing 
the change of the coupling constants {g a } as fast fluctuations of the theory are successively 
integrated out. 


8. 3. 1 Gell-Mann-Low equations 

There are a number of methodologically different procedures whereby the set of flow equa- 
tions can be obtained from the microscopic theory. Here, we formulate this step in a language 
adjusted to applications in statistical field theory (as opposed to, say, particle physics). 
While there is considerable freedom in the actual implementation of the RG procedure, all 
methods share the feature that they proceed in a sequence of three more or less canonical 
steps. 


I: Subdivision of the field manifold 

In the first step, one may decompose the integration manifold {</>} into a sector to be 

integrated out, {</>f}, and a complementary set, { 0 S } . For example: 

> We may proceed according to a generalized block spin scheme and integrate over all 
degrees of freedom located within a certain structural unit in the base manifold {x}. 
(This scheme is adjusted to lattice problems where {x} = {x,} is a discrete set of points. 
However, as pointed out above, even then it is difficult to implement analytically.) 

> We could decide to integrate over a certain sector in momentum space. When this sector is 
defined to be a shell A/6 < |p| < A, one speaks of a momentum shell integration. Nat- 
urally, within this scheme, the theory will be explicitly cutoff-dependent at intermediate 
stages. 

t> Alternatively, we may decide to integrate over all high- lying degrees of freedom A -1 < |p|. 
In this case, we will of course encounter divergent integrals. An elegant way to handle these 
divergences is to apply dimensional regularization. Within this approach one formally 
generalizes from integer dimensions d to fractional values d±e. One motivation for doing 
so (for another, see below) is that, miraculously, the formal extension of the characteristic 
integrals appearing during the RG step to non-integer dimensions are finite. As long as 
one stays clear of the dangerous values d = integer one can then safely monitor the 


In our previous example of the Luttinger liquid, there appeared an operator f (duj/27r)O(oj)\oj\0(— w). When 
represented in space-time, this operator is highly non-local. 
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dependence of the integrals on the IR cutoff A -1 . For a good introduction to dimensional 
generalization we refer to the textbook by Ryder. 13 
t> For a discussion of alternative schemes, such as the introduction of short- distance real 
space cutoffs underlying the so-called operator product expansion, we refer to the 
literature (see, e.g., the excellent text by Cardy. 1 ) 

II: RG step 

The second, and central, part of the program is to actually integrate over short-range 
fluctuations. As exemplified above, this step usually involves approximations. In most cases, 
one will proceed by a so-called loop expansion, i.e. one organizes the integration over 
the fast field <p{ according to the number of independent momentum integrals - loops 14 - 
that occur after the appropriate contractions. Of course, this strategy makes sense only if 
we can guarantee that the contribution of loops of higher orders is in some sense small, 
a precondition that is, alas, often difficult to meet. At any rate, to engage loop numbers 
as an expansion parameter, we first need to understand the key role played by space- 
dimensionality in the present context. We return to this point in Section 8.4. 

Following the procedure, an expansion over the fast degrees of freedom gives an action 

su]=E^i§- 

in which coupling constants of the remaining slow fields are altered. Notice that the inte- 
gration over fast field fluctuations may (and usually does) lead to the generation of “new” 
operators, i.e. operators that have not been present in the bare action. In such cases one 
has to investigate whether the newly generated operators are “relevant” (see below) in their 
scaling behavior. If so, the appropriate way to proceed is to include these operators in the 
action from the very beginning (with an a priori undetermined coupling constant). One 
then verifies whether the augmented action represents a complete system, i.e. one that does 
not lead to the generation of operators beyond those that are already present. If necessary, 
one has to repeat this step until a closed system is obtained. 

Ill: Rescaling 

One next rescales frequency/momentum so that the rescaled field amplitude cf>' fluctuates 
on the same scales as the original field <j>, i.e. one sets 

q bq, u> —> b z ui. 

Here, the frequency renormalization exponent or dynamical exponent z may be 

unity, two, or sometimes a non-integer value, depending on the effective dispersion relating 
frequency and momentum. We finally note that the field (f>, as an integration variable, may 
be rescaled arbitrarily. Using this freedom, we select a term in the action which we believe 
governs the behavior of the “free” theory - in a theory with elastic coupling this might, 


L. H. Ryder, Quantum Field Theory (Cambridge University Press, 1996). 
For the definition of loops, see Section 5.1. 
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for example, be the leading-order gradient operator ~ f d d r(V(j>) 2 - and require that it be 
strictly invariant under the RG step. To this end we designate a dimension L' 1 * for the 
field, chosen so as to compensate for the factor b x arising after the renormalization of the 
operator. The rescaling 

<t> -> b d *(j>, 

is known as field renormalization, ft renders the “leading” operator in the action scale 
invariant. 

As a result of all these manipulations, we obtain a renormalized action 

= ^29a°a[4>], 

which is entirely described by the set of changed coupling constants, i.e. the effect of the 
RG step is fully encapsulated in the mapping 

g' = R(s), 

relating the old value of the vector of coupling constants, g = {9a}, to the renormalized one, 
g' = {g' a }. By letting the control parameter, £ = In b. of the RG step assume infinitesimal 
values, one can make the difference between bare and renormalized coupling constants 
arbitrarily small. It is then natural to express the difference g' — g = R( g) — g in the form 

of a generalized /3-function or Gell-Mann— Low equation 



(8.17) 


where the right-hand side is defined through the relation R( g) = lirrp^o t 1 (R(g) — g). 

INFO As mentioned at the beginning of the section, the formulation of the RG step above is 
actually not the only one possible. For instance, in high-energy physics, other renormalization 
schemes appear to be more natural. In this area of physics, there is actually no reason to believe 
in the existence of a well-defined “bare” action with finite coupling constants. (Contrary to the 
situation in condensed matter physics, the bare action of quantum electrodynamics, say, is in 
principle inaccessible.) However, one may legitimately require that, after an integration over 
UV-divergent fluctuations, the “renormalized” coupling constants of the theory (which, in turn, 
determine observables such as the physical electron mass) are finite. One may then postulate 
that the bare coupling constants of the theory are actually infinite. The value of these infinities is 
fine-tuned so as to combine with the fluctuation-induced “infinities” to realize finite renormalized 
coupling constants. Alternatively, one may deliberately add extra operators, counter-terms, to 
the action which are designed so as to cancel divergences due to fluctuations. However, the net 
result of all these RG schemes (which are by and large equivalent) is a mapping describing the 
flow of the coupling constants upon variation of a control parameter. 
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Figure 8.4 The fractal Julia set is self-similar in the sense that any sub-region of it contains the 
full information of the original set. 


8.3.2 Analysis of the Gell-Mann-Low equation 

The Gell-Mann-Low equation (8.17) represents the principal result of an RG analysis. 
Thinking of the control parameter £ as a kind of “flow parameter,” one may identify this 
equation as a generalized dynamical system, namely the system describing the evolution of 
the effective coupling constants of a model upon changing length or time scales. As with any 
dynamical system, the prime structural characteristic of the set of equations (8.17) is the set 
of fixed points, i.e. the submanifold {g*} of points in coupling constant space which are 
stationary under the flow: R( g*) = 0. Once the coupling constants are fine-tuned to a fixed 
point, the system no longer changes under subsequent RG transformations. In particular it 
remains invariant under the change of space/time scale associated with the transformation. 
Alluding to the fact that they look the same no matter how large a magnifying glass is 
used, systems with this property are referred to as self-similar. (For example, fractals 
such as the Julia set shown in Fig. 8.4 are paradigmatic examples of self-similar systems; 
the magnification of any sub-region of the fractal looks identical to the full system.) 

Now, to each system, one can attribute at least one intrinsic length scale, namely the 
length £ determining the exponential decay of field correlations. However, the existence of a 
finite, and pre-determined, intrinsic length scale clearly does not go together with invariance 
under scale transformations. We thus conclude that, at a fixed point, either £ = 0 (not so 
interesting), or £ = 00 . However, a diverging correlation length £ — > 00 is a hallmark of 
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Figure 8.5 Showing the RG flow in the vicinity of a fixed point with two irrelevant (0i,</>3) and 
one relevant {4> 2 ) scaling fields. The manifold S defined through the vanishing of the relevant field, 
<f > 2 = 0, is called a critical surface. On this submanifold, the RG flow is directed towards the fixed 
point /. Deviations off criticality make the system approach one of the stable fixed points /1 and 

h- 

a second-order phase transition. We thus tentatively identify fixed points of the RG flow 
as candidates for “transition points” of the physical system. (For a more comprehensive 
review of phase transitions and the critical phenomena accompanying them, see the Info 
block starting on page 436 below.) This being so, it is natural to pay special attention to 
the behavior of the flow in the immediate vicinity of the fixed-point manifolds. If the set 
of coupling constants, g, is only close enough to a fixed point, g*, it will be sufficient to 
consider the linearized mapping 

-R(g) = -R((g-g*) + g*)-^(g-g*), w ab = d ^\ 

d 9b | g=g . 

To explore the properties of flow, let us assume that we had managed to diagonalize the 
matrix W. Denoting the eigenvalues by A a , a = 1, . . . , IV, and the left-e igenvectors 15 by <f> a , 
we have 

<f%W = tfcXa. 

The advantage of proceeding via the unconventional set of left-eigenvectors is that it allows 
us to conveniently express the flow of the physical coupling constants under renormalization. 
To this end, let v a be the ath component of the vector g — g* when represented in the basis 
{*»}■ 

Vet =C(g-g*). 


Since there is 


reason for W being symmetric, the left- and right-eigenvectors may be different. 
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These components display a particularly simple behavior under renormalization: 

^ = - g*) = = Kv a . 

Under renormalization, the coefficients v a change by a mere scaling factor A a , wherefore 
they are called scaling fields - a somewhat unfortunate nomenclature. (The coefficients 
v a are actually not fields but simply a set of ^-dependent coefficients, the vector of coupling 
constants when expressed in the basis of eigenvectors <j> a .) These equations are trivially 
integrated to obtain 

v a {£) ~ exp(a a ). 

This result suggests a discrimination between at least three different types of scaling fields: 

t> For A a > 0 the flow is directed away from the critical point. The associated scaling field 
is said to be relevant (in the sense that it forcefully drives the system away from the 
critical region). In Fig. 8.5, V 2 is a relevant scaling field. 
t> In the complementary case, A„ < 0, the flow is attracted by the fixed point. Scaling fields 
with this property (vi,vz) are said to be irrelevant . 16 
t> Finally, scaling fields which are invariant under the flow, A 0: = 0, are termed marginal . 17 

The distinction of relevant /irrelevant /marginal scaling fields in turn implies a classification 
of different types of fixed points: 

t> Firstly, there are stable fixed points, i.e. fixed points whose scaling fields are all irrel- 
evant or, at worst, marginal. These points define what we might call “stable phases of 
matter” : when you release a system somewhere in the parameter space surrounding any 
of these attractors, it will scale towards the fixed point and eventually sit there. Or, 
expressed in more physical terms, looking at the problem at larger and larger scales will 
make it more and more resemble the infinitely correlated self-similar fixed-point configu- 
ration. (Recall the example of the high-temperature fixed line of the one-dimensional Ising 
model encountered earlier.) By construction, the fixed point is impervious to moderate 
variations in the microscopic morphology of the system, i.e. it genuinely represents what 
one might call a “state of matter.” 

t> Complementary to stable fixed points, there are unstable fixed points. Here, all scaling 
fields are relevant (cf. the T = 0 fixed point of the 1-D Ising model). These fixed points 
represent the concept of a Platonic ideal: you can never get there and, even if you managed 


16 The terminology “irrelevant” indicates that a scaling field of negative dimension usually does not play much of 
a physical role. There are, however, exceptions to this rule. For instance, it may happen that the free energy 
of the system depends in a singular manner on an irrelevant scaling variable - in which case the variable is 
called dangerously irrelevant. Dangerously irrelevant scaling variables not only strongly affect the outcome 
of the theory, but also invalidate the applicability of a number of established concepts of RG theory (such as 
the scaling laws to be discussed below). 

17 A marginal scaling field corresponds to a direction in coupling constant space with vanishing partial derivative, 
d<j )a R lg*=o = 0- In this case, to obtain a refined picture, one sometimes considers the second-order derivative, 

R| g *=o = In the vicinity of the fixed point, the scaling field then behaves as d Va = For x > 0 

(x < 0) the field has the status of a marginally relevant (irrelevant) scaling field. It is relevant (irrelevant) 
on account of the non-vanishing direction of the flow. However, it is also “marginal” because the speed of the 
flow decreases upon approaching the critical regime. 
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to approach it closely, the harsh conditions of reality will make you flow away from it. 
Although unstable fixed points do not correspond to realizable forms of matter, they are 
of importance inasmuch as they “orient” the global RG flow of the system. 

> Finally, there is the generic class of fixed points with both relevant and irrelevant 
scaling fields. These points are of particular interest inasmuch as they can be associated 
with phase transitions. To understand this point, we first note that the r eigenvectors 
<i> a associated with irrelevant scaling fields span the tangent space of an (r)-dimensional 
manifold known as the critical surface. (A schematic illustration for the case r = 2 
is shown in Fig. 8.5.) This critical manifold forms the basin of attraction of the fixed 
point, i.e. whenever a set of physical coupling constants g is fine-tuned so that g € S, the 
expansion in terms of scaling fields contains only irrelevant contributions and the system 
will feel attracted to the fixed point as if it were a stable one. 

However, the smallest deviation from the critical surface introduces a relevant compo- 
nent driving the system exponentially away from the fixed point. A sketch of the resulting 
flow is shown in Fig. 8.5 for the case of just one relevant scaling field. For example, in the 
case of the ferromagnetic phase transition - discussed in more detail in the next section - 
deviations from the critical temperature T c are relevant. If we consider a system only 
slightly above or below T c , it may initially (on intermediate length scales) appear to be 
critical. However, upon further increasing the scale, the relevant deviation will grow and 
drive the system away from criticality, either towards the stable high-temperature fixed 
point of the paramagnetic phase (T > T c ) or towards the ferromagnetic low-temperature 
phase (T < T c ). 

This picture actually suggests that systems with generic fixed points typically possess 
complementary stable fixed points, i.e. fixed points towards which the flow is directed 
after it has left the critical region. We also notice that a scaling direction that is relevant 
at one fixed point (e.g. $2 at the critical fixed point) may be irrelevant at others ($2 at 
the high- and low-temperature fixed points). 

I N FO The discussion above suggests that the concept of renormalization is intimately linked to 
the theory of phase transitions and critical phenomena, the traditional platform for the 
development of the subject in the literature. In view of the existing wealth of literature (and 
acknowledging the fact that we are approaching the field from a more operational perspective), 
we shall not endeavor to present another “introduction to the theory of critical phenomena.” 
Rather, we will summarize in a concise, but hopefully self-contained, manner, those few tenets 
and principles that are necessary to place the concept of renormalization into a larger physical 
context. 

The most fundamental 18 signature of a phase transition is its order parameter, M, i.e., a 
quantity whose value unambiguously identifies the phase of the system. Examples from classical 
statistical mechanics include the magnetization for the ferromagnetic-paramagnetic transition, 
the density for the liquid-vapor transition, the order parameter amplitude for the BCS transi- 
tion, etc. (However, to keep the terminology concrete, we shall mostly use the language of the 
ferromagnetic transition in the following.) 


Notice that there are transitions whose order parameter is actually unknown. A famous example is the quantum 
Hall transition discussed in more detail in Section 9.3.4. 
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Figure 8.6 Phase diagram of the ferromagnetic transition. Tuning a magnetic field at fixed tem- 
perature T <T C through zero causes the magnetization to jump discontinuously: [0, T c ] is a line of 
first-order transitions. This line terminates in the unique second-order transition point of the sys- 
tem, (T = T C ,H = 0). Lowering the temperature at H = 0 causes the non-analytic, but continuous, 
development of a finite magnetization at T <T C . 


Transitions between different phases of matter fall into two large categories. 19 In first-order 
phase transitions the order parameter exhibits a discontinuous jump across the transition line 
while, in the complementary class of second-order transitions, the order parameter changes in 
a non-analytic but continuous manner. (The two cases are exemplified in Fig. 8.6 by the classical 
ferromagnet.) 

The phenomenology of second-order transitions is generally richer than that of first-order 
transitions. As a thermodynamic state variable, the order parameter is coupled to a conjugate 
field, H : M = —DhF, where F is the free energy. At a second-order transition, M changes non- 
analytically, which means that the second-order derivative, a thermodynamic susceptibility, 
X = —d%F, develops a singularity. Now, you may recall from the discussion of the fluctuation 
dissipation theorem that the susceptibility is intimately linked to the field fluctuation behavior 
of the system. More precisely, x is proportional to the integral over the correlation function C 
determining the fluctuation behavior of the fields (cf. Eq. (8.3)). A divergence of the susceptibility 
implies the accumulation of infinitely long-range field fluctuations. 

The divergence of the susceptibility goes hand in hand with non-analytic and/or singular 
behavior of all sorts of other physical quantities. In fact, an even stronger statement can be made. 
We have seen that, right at the transition/fixed point, the system is self-similar. This implies 
that the behavior of its various characteristics must be described by power laws. Referring for 
a more substantial discussion to Section 8.3.3 below, we here merely support this statement by 
a heuristic argument. Consider a function f(t), where / is representative of an observable of 
interest and t is a control parameter (a scaling field) determining the distance to the transition 
point. In the immediate vicinity of the transition point, / is expected to “scale,” i.e. under a 
change of the length scale x — )• x/b, t — » tb ~ Dt , the function / must, at most, change by a factor 
reflecting its own scaling dimension, /(f) = b D f f(tb~ Dt ). (A more serious, structural change of 
the function would be in conflict with asymptotic self-similarity.) Mathematically speaking, this 
equation amounts to homogeneity of the function /, equivalently expressed by / ~ t D f / Dl . 


Readers absolutely unfamiliar with the thermodynamics of phase transitions may wish to consult the corre- 
sponding section of a textbook on statistical mechanics. 
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The set of different exponents characterizing the relevant power laws occurring in the vicinity 
of the transition are known as critical exponents. For at least four different reasons, the set 
of critical exponents represents the most important structural fingerprint of a transition: 

1. They carry universal significance, i.e. we do not have to invent a set of critical exponents for 
each transition anew. (For example, the divergence of the correlation length, £ ~ |t|“ , is 
characterized by a critical exponent commonly, and irrespective of the particular transition 
under consideration, denoted by v.) 

2. The set of critical exponents carries the same information as the set of exponents of the 
scaling fields, i.e. knowledge of the critical exponents is equivalent to the knowledge of the 
linear dynamical system characterizing the flow in the transition region. (In fact, the set of 
critical exponents overdetermines the scaling field exponents, i.e. it contains redundancy. For 
example, of the six critical exponents characterizing the magnetic transition, only two are 
independent. The others are interrelated by 20 scaling laws or exponent identities to be 
discussed below. 

3. Critical exponents are fully universal; they are numbers depending, at most, on dimensionless 
characteristics such as the space-time dimensionality or number of components of the order 
parameter. 

4. Perhaps most importantly, the critical exponents represent quantities that can be measured. In 
fact, their universality and structural importance make them quantities of prime experimental 
interest. 

In the following, let us briefly enumerate the list of the most relevant exponents, a, /3, 7 , 5, 77 , 
o, and «. 21 Although we shall again make use of the language of the magnetic transition, it is 
clear that (and, indeed, how) the definitions of most exponents generalize to other systems. 


or. In the vicinity of the critical temperature, the specific heat C = —Td^F scales as C ~ |t| _ “, 
where t = (T — T c )/T c measures the distance to the critical point. Note that, by virtue of 
this definition, a non-trivial statement has been made: although the phases above and below 
the transition are essentially different, the scaling exponents controlling the behavior of C are 
identical. The same applies to most other exponents listed below. 

/ 3 : Approaching the transition temperature from below, the magnetization vanishes as M = 

-8hF \ h 

7: The magnetic susceptibility behaves as x = 9 hM\h 0 ~ |t| 7 . 

5\ At the critical temperature, t = 0, the field dependence of the magnetization is given by 
M ~ \h\ 1/s . 

v\ Upon approaching the transition point, the correlation length diverges as £ ~ |f|“ . 

q: This implies that the correlation function, 

C(D~IW’ |r|<<{ ' 

lexp[-|r|/£], |r|»£, 

crosses over from exponential to a power law scaling behaviour at the length scale £. To 
motivate the power, one may notice that C ~ (</>(/>) carries twice the dimension of the field 


Unfortunately, the language used in the field of critical phenomena makes excessive use of the prefix “scaling.” 
Historically, the exponents are drawn from the first six letters of the Greek alphabet. The exceptional designation 
of the last exponent, z, betrays the fact that quantum dynamical fluctuations were considered only later. 
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<f>. The engineering dimension of the latter follows from the requirement that the gradient 
operator ~ f d d r ( <t >) 2 be dimensionless: [ <j> ] = L( 2_d )/ 2 , according to which C(r) has canon- 
ical dimension L 2 ~ d . The exponent r/. commonly referred to as the anomalous dimension 
of the correlation function, measures the mismatch between the observed and the canonical 
dimension. 

2 : A quantum theory can, to a large extent, be viewed as a kind of classical theory in d + 1 
dimensions. The theory is “quantum critical” if the effective classical theory contains a critical 
point. In the vicinity of that point large fluctuations are observed in both the d spatial direc- 
tions and the temporal “direction.” However, the different physical origin of these dimensions 
manifests itself in the scaling being anisotropic. Denoting the correlation length in the tem- 
poral direction by r, we define t ~ where deviations 2 = 1 in the dynamical exponent 
measure the degree of anisotropy. 

Now, a moment’s thought shows that, of the six classical exponents, only a few can be truly 
independent. Previously we have noted that, modulo irrelevant perturbations, the flow in the 
vicinity of a transition point is controlled by the relevant scaling fields. Referring for a more 
quantitative discussion to Section 8.4 below, we anticipate that, for the magnetic transition, the 
magnetic field will certainly represent a relevant perturbation (a fact readily expressed by the 
positivity of the exponent 5). Moreover, deviations from the critical temperature, t = 0, are also 
relevant . 22 However, for the magnetic transition, that exhausts the list; in the asymptotic vicinity 
of the transition, the flow is controlled by a two-dimensional dynamical system. This suggests 
that four constraining equations should reduce the set of six classical exponents to only two 
independent ones. Historically, these scaling laws were discovered one by one (at a time when 
the underlying connections to the system of “scaling fields” had not been fully appreciated). 
For the sake of reference, these constraint equations (along with the names of the people who 
discovered them) are listed below. In Section 8.3.3 below, we exemplify how the scaling laws can 
be transparently derived from the intrinsic structure of the theory. 


Fisher 

v{2 - ri) = 7 

Rushbrooke 

a + 2/3 + 7^2 

Widom 


Josephson 

2 — a = vd 


For practical purposes, we need only compute/measure two exponents - no matter which - to 
fully specify the scaling structure of the theory. 

In the next section we discover that the dynamical system of scaling fields encapsulates 
practically all information about “critical” fluctuation phenomena accompanying a phase 
transition. However, for the moment, we shall restrict ourselves to the discussion of one 
more aspect of conceptual importance, namely universality. In fact, the majority of critical 
systems can be classified into a relatively small number of universality classes. Crudely 
speaking, leaving apart more esoteric classes of phase transitions there are 0 ( 10 1 ) funda- 
mentally different types of flow recurrently appearing in practical applications. This has to 


22 If you find it difficult to think of temperature as a “coupling constant,” remember that, in our derivation of 
the </> -model as the relevant theory of the magnetic transition, the coupling constant of the “mass operator” 
r f d d r<f > 2 turned out to be proportional to the reduced temperature t = \T — T c \/T c . 
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be compared with the near infinity of different physical systems that display critical phe- 
nomena. Why, then, is it that the plethora of all these transitions can be grouped into a 
very limited set of different universality classes? Remarkably, the origin of this universality 
can readily be understood from the concept of critical surfaces. 


Imagine, then, an experimentalist exploring a system 
that is known to exhibit a phase transition. Motivated 
by the critical phenomena that accompany phase transi- 
tions, the available control parameters Xj_ (temperature, 
pressure, magnetic field, etc.) will be varied until the sys- 
tem begins to exhibit large fluctuations. On a theoretical 
level, the variation of the control parameters determines 
the initial values of the coupling constants of the model 
(as they functionally depend on the X t s through their 
connection to the microscopic Hamiltonian). In Fig. 8.5 
the curve in coupling constant space defined in this way 
is indicated by 7. For microscopic parameters corresponding to a point above or below the 
critical manifold, the system asymptotically (i.e. when looked at at sufficiently large scales) 
falls into either the “high-” or the “low-temperature” regime (as indicated by the curves 
branching out from 7 in Fig. 8.5). However, eventually the trajectory through parameter 
space will intersect the critical surface. For this particular set of coupling constants, the 
system is critical. As we look at it on larger and larger length scales, it will be attracted by 
the fixed point at S, i.e. it will display the universal behavior characteristic of this particular 
point. This is the origin of universality: variation of the system parameters in a different 
manner (or for that matter considering a second system with different material constants) 
will generate a different trajectory g a {{Xi}) = 7'. However, as long as this trajectory inter- 
sects with S, it is guaranteed that the critical behavior will exhibit the same universal 
characteristics (controlled by the unique fixed point). 

In fact a more far-reaching statement can be made. Given that there is an infinity of 
systems exhibiting transition behavior (symbolically indicated by the row of boxes in the 
upper part of the figure above) while there is only a very limited set of universality classes 
(the set of boxes on the left), many systems of very different microscopic morphology must 
have the same universal behavior. More formally, different microscopic systems must map 
onto the same critical low-energy theory. Examples of these coincidences include (to mention 
but a few entries of an endless list) the equivalence of the disordered Luttinger liquid to 
a Josephson junction (cf. Problem 6.7), the equivalence of models of planar magnets (see 
Section 8.6 below) to two-dimensional classical Coulomb plasmas, and the equivalence of the 
liquid-gas transition to the ferromagnetic transition. (In all cases, “equivalence” means that 
the systems exhibit identical scaling behavior and, therefore, fall into the same universality 
class.) Further coincidences of this type will be encountered below. 



universal predictions 
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8.3.3 Scaling theory 

Previously, we have seen that the dynamical system of scaling fields encodes a wealth of 
information on the large-scale structure and on the phases of a physical system. However, we 
have not yet established a connection between the concept of renormalization and concrete 
(i.e. experimentally accessible) data. This is the subject of the present section. Imagine, 
then, that we had represented some observable of experimental interest, X, in the language 
of the functional integral. According to the discussion of the previous chapter this means 
that we have managed to express 


X = J2C(pi,9a), 

p 

as the sum over an n-point correlation function C(pi,g a ) = ((• • • )(j><j) ■ ■ ■ <j>)$, where the 
(symbolic) notation indicates that C may depend both on the momentum scale at which it 
is evaluated (e.g. through the explicit momentum dependence of current operators, etc.) and 
on the coupling constants. The ellipsis (• • • ) stand for optional algebraic elements entering 
the definition of the correlation function. 

We next build on our assumption of renormalizability of the theory, i.e. we make use of 
the fact that we can evaluate C before or after an RG step; the result must be the same. On 
the other hand, the RG transformation will, of course, not leave the individual constituents 
entering the definition of C invariant; it will change coupling constants, g a , the momenta 
Pi, and the field amplitudes <i> according to the prescriptions formulated in the previous 
section. Expressed in a single formula, 


C(p i ,g a ) = b nd +C(p i b,g a b x ‘ »), 


(8.18) 


where we have simplified the notation by assuming that the coupling constants themselves 
scale (for, otherwise, the matrix elements of a linear transformation mediating between the 
coupling constants and the scaling fields would appear). For notational convenience, let us 
also assume that the fixed point values of the coupling constants are specified in such a way 
that g* = 0. The factor b nd<1, accounts for the explicit rescaling of the n fields entering the 
definition of C. 

Notice that Eq. (8.18) presents a remarkable statement. Although the three different 
elements (< fi,Pi,g a ) contributing to the correlation function change under the transformation 
in seemingly unrelated manners, the net result of the concerted rescaling is nil. Indeed, 
Eq. (8.18) serves as a starting point for the derivation of various relations of immediate 
practical relevance. 


Scaling functions 

Let us return to a principle already employed in connection with the one-dimensional Ising 
model. For concreteness, imagine that we are working under conditions where there is just a 
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single relevant scaling field gi, while all g a >i are irrelevant (or, for that matter, marginal). 
We can then write 

C(pi,gi,g a y = b nd *C( Pi b, g ^ , g a b x «) = g- nd ^ /Xl C( Pi g - 1/Xl , 1, g a g^ Xa/Xl ) 

■ 9 ’«‘ g- nd * /Xl C( Pi g- 1/Xl ,l,0) = gr d * /Xl F{Pi9i' /Xl )- 

Here, we have used the freedom of arbitrarily choosing the parameter b to set gib Xl = 1 
while, in the third equality, we have assumed that we are sufficiently close to the transition 
that the dependence of C on irrelevant scaling fields is inessential. The function F defined 
through the relation 

C(pi,g i) = 9: nd * /Xl F(pigi : (8.19) 

is an example of a scaling function. Alternatively (for example, if C represents a thermody- 
namic observable or a global transport coefficient) we might be interested in the correlation 
function C(gi,g a ) = C( Pl = 0. gt, g n ) at zero external momentum Pi = 0. In this case, a 
typical question to ask would be the dependence of C on the most relevant and the sec- 
ond most relevant control parameter r/ 2 (where we leave unspecified whether g -2 is relevant, 
marginal, or irrelevant). Following the same logic as above, one obtains 

C(gi,g2) = gr d * /Xl F(92g; X2/Xl ), 
with some different scaling function F. 

INFO As an example particularly relevant to the comparison between analytical theory and 
numerics, we note the concept of finite-size scaling. While analytical theories are most conve- 
niently formulated in the thermodynamic limit, numerical simulations are carried out for systems 
of still very limited size. The need to compare theory and numerical simulations motivates the 
need to explicitly keep track of the system size under renormalization. Indeed, the system size 
L has dimension [length] and, therefore, gets rescaled as L — >- L/b. Setting L/b = 1, we obtain a 
scaling function 

(l(Un-l') — L"' ,0 l\(g n l. n ). 
with explicit system size dependence. 

While the construction of any particular scaling function may be context-dependent, the 
principle behind the derivation is general: once the scaling behavior of a correlation function 
is known, the arbitrariness of the scaling parameter b can be used to reduce the number of 
independent variables by one. The reduced correlation function is called a scaling function. 
As with the response functions discussed in the previous chapter, scaling functions also 
represent a prime interface between theory and experiment. Experimentally, the mea- 
surement of an observable X in its dependence on a number of relevant system parameters, 
t and h say, results in a multi-parameter function X(t,h). In fact, a better way to think 
about this object is as a set of one-dimensional functions Xh(t ) depending on a parameter 
h. (This is because, in experiment, one typically varies only a single control parameter, e.g. 
temperature at fixed magnetic field.) Scaling implies that all these functions collapse onto 
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a generic one-dimensional 23 profile, if only the data are plotted as a function of the relevant 
scaling parameter th x . 

This mechanism can be exploited in several different ways. For example, if there is not 
yet a theory of the transition phenomenon in question, an experimentalist may empirically 
identify the relevant scaling parameters and pose the explanation of the observed scaling 
exponent x - by construction a fully universal number - as a challenge to theorists. 24 
Conversely, theorists may suggest a scaling exponent that can be put to the test by checking 
whether the experimental data collapse onto this exponent. Summarizing, one of the great 
virtues of the concept of scaling is that it condenses the information exchange between 
experiment and theory (and analytical theory and numerics for that matter) into a small 
set of universal numbers. 


I N FO For the sake of completeness we mention that, especially in the field theoretical community, 
the information encapsulated in the scale-dependent correlation functions is often represented in 
a different manner. Starting out from the relation 

C( Pi ,g a ) = e n d *C(p ie ,g a (£)), 


where we have set b = e , we can use the /-independence of the left-hand side to write 0 = 
_d_ e « d <t>C(pie ,g a {£))- (Notice that, here, we do not need to be in the asymptotic scaling regime, 
i.e., for the sake of the present construction, the /-dependence of the coupling constants need 
not be explicitly exponential.) We next carry out the /-differentiation to obtain 


n r/,,. 0 + \ +d+p a (g a )d ga C( Pi e ,g a (£) |^0. 


(8.20) 


Here, d e ,^ is the engineering dimension of the field (f> and p/2 = d^ — de^ its anomalous dimension 
(see the definition of p in the Info block starting on page 436). Further, the partial derivative d 
acts on the explicit scale dependence of the momentum arguments (or any other explicitly scale- 
dependent argument for that matter). Finally, /3 a (g a ) is the /3-function defined above. Equation 
(8.20) is known as a renormalization group equation. Both the RG equation and the scaling 
form that we used to derive it equivalently express the scaling behavior of the correlation function. 


Scaling functions and critical exponents 

Another important aspect of scaling theory is that it can be used to disclose relations 
between the seemingly independent 25 critical exponents of the theory. For the sake of con- 
creteness, let us consider the case of the ferromagnetic transition, i.e. a transition we have 
previously characterized in terms of six critical exponents a, ... ,p (see page 438). However, 
the flow in the vicinity of the magnetic fixed point is controlled by only two relevant scaling 
fields, the (reduced) temperature t and the reduced magnetic field h = H/T. Neglecting 


23 For an n-dimensional data set, the collapse is to an (n — l)-dimensional functional set. 

24 Parenthetically, one may note that the empirical collapse of experimental data onto scaling functions requires 
a lot of skill. For example, if the data set consists of a number of functional “patches” of only limited overlap, 
it is quite “easy” to construct a scaling function of, in fact, almost any desired power law dependence. Data of 
this type tend to contain a lot of statistical uncertainty, which can easily lead to erroneous conclusions. 

After all, the critical exponents describe the behavior of quite different physical observables in the transition 
region. 
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irrelevant perturbations, we thus conclude that, under a renormalization group transforma- 
tion, the reduced free energy / = F/TL d will behave as 26 f(t,h) = b~ d f(tb Vt ,hb Vh ). We 
next fix tb yt = 1 to reduce the number of independent variables to one: 


f{t,h)=t d / yt f(h/t Vh / yt ). 


(8.21) 


Containing the complete thermodynamic information, Eq. (8.21) is all that we need to 
compute the critical exponents. Indeed, comparing with the definitions summarized on 
page 438, it is straightforward to show that 


0 d d — y h 2y h - d ^ 

yt yt yt V 

S = Vh , u =— , 7] = 2 + d-2y h , i 

a -yh Vt ' 


(8.22) 


from where follow the cross-relations summarized in the table on page 439 by direct com- 
parison. These relations illustrate our previous assertion that, conceptually, the dimensions 
of the relevant scaling fields have a more fundamental status than the critical exponents. 


EXERCISE Verify these statements. To obtain the fifth relation, the hyperscaling relation, 
notice that, under a change of scale, £ — > b£. On the other hand, we know that t ~ • The 

sixth relation is obtained from Eq. (8.3) by a substitution of the definition of the spatial profile of 
the correlation function in terms of the critical exponent y into the integral to obtain a relation 
between the critical exponents 7 and y (Fisher’s scaling law). 


8.4 RG analysis of the ferromagnetic transition 

In the previous section, we became acquainted with some fundamental elements of the 
structure of RG analyses, and their connection to the theory of critical phenomena. Being 
kept at a general and conceptual level, the discussion may have seemed somewhat abstract. 
Therefore, to elucidate the concepts introduced above, and to introduce some more elements 
of the RG, we turn now to a concrete application of the approach to the classical theory 
of the (uniaxial) ferromagnetic (or liquid-gas) transition. In Section 5.1, the </> 4 -theory was 
identified as an effective low-energy model of the ferromagnetic system. However, beyond 
the mean-field, we have not yet applied the model to explore the universal characteristics 
of the transition. In the following, we shall see that RG methods, and only RG methods, 
can be applied to successfully understand much of the intriguing behavior displayed by the 
(d > 2)-dimensional Ising model in the vicinity of its phase transition. 


Here we have made use of the fact that the reduced free energy does not carry an anomalous dimension. By 
definition, the free energy F = —T In Z does not change under renormalization (which after all, merely amounts 
to representing the number Z through functional integrals of different space-time resolution). Thus, the scaling 
of the reduced free energy is entirely carried by the prefactor L ~ d . 
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8-4-1 Preliminary dimensional analysis 

The first question that we wish to address has a somewhat technical status: with what 
justification was the Ising model represented in terms of the model action 27 

S[<f>] = j d d r \^4> 2 + *(V<?1) 2 + - H | , (8.23) 

i.e. why was it possible to neglect both higher powers and gradients of the field <j> that are 
surely present in the exact reformulation of the Ising problem in terms of ^-variables? To 
rationalize the neglect of these terms, we proceed by dimensional analysis. Anticipating that 
the “real” dimensions carried by the operators in the action will be not too far from their 
engineering dimensions (see below), we begin by exploring the latter. We proceed along 
the lines of the general scheme outlined in the previous chapter and attribute a dimension 
of unity to the leading gradient term J(V</>) 2 in the action. This entails the choice [<f>] = 
Z_( 2 -rJ)/ 2 , from where it is straightforward to attribute engineering dimensions to all other 
operators: 

y <^] = l 2 , y = l-^% y = L y (v^ 2 ] = mm. 

These relations convey much about the potential significance of all structurally allowed 
operators: 

t> The engineering dimension of the non-gradient operator ~ <j) 2 is positive in all dimensions, 
indicating general relevance. 

t> The (f> 4 operator is relevant (irrelevant) in dimensions d < 4 (d > 4). This suggests that for 
d > 4 a harmonic approximation (A = 0) of the model should be reasonable. It also gives 
us a preliminary clue as to how we might want to approach the ("id-model on a technical 
level: while for dimensions “much” smaller than d = 4 the interaction operator ~ 0 4 is 
strongly relevant, the dimension d = 4 itself is borderline. This suggests that we analyze 
the model at d = 4, or maybe “close” 28 to d = 4 where the q i 4 operator is not yet that 
virulent, and then try to extrapolate to infer what happens at the “physical dimensions” 
of d = 2 and 3. 

t> Operators <p n>4 become relevant only in dimensions d < (— 1/n + 1/2) -1 < 4. However, 
even below these threshold dimensions, operators of high powers in the field variable are 
much less relevant than the dominant non-harmonic operator J q !> 4 . This is the a posteriori 
justification for the neglect of (j> n>i operators in the derivation of the model. 

> Similarly, operators with more than two gradients are generally irrelevant and can be 
neglected in all dimensions. 

> In contrast, the operator f (j) coupling to the magnetic field carries dimension 1 + d/2 and 
is therefore always strongly relevant. 


Generalizing our discussion from Section 5.1, we have incorporated a coupling to an external field. (Exercise: 
Recapitulate the construction of Section 5.1 to convince yourself that, to lowest order in an expansion in terms 
of </>, coupling the system to a magnetic field leads to the fourth term of Eq. (8.23). In case you are too impatient 
to do this: justify the structure of the term on physical grounds.) 

As we see shortly, the analysis of the problem is readily generalized to non-integer dimensions. 
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Dimensional analysis provides us with some valuable hints as to the importance of various 
operators appearing in the theory. It also indicates that, in the present context, dimension 
d = 4 might play a special role. Guided by this information, we now proceed to analyze the 
model in a sequence of steps of increasing sophistication. 


8-4-2 Landau mean-field theory 

Given an action of the form (8.23), the first thing one might try is a mean-field analysis. 
That is, assuming that our coupling constants r and A are sufficiently large we might assume 
that the functional integral over <j> is centered around solutions of the equation = 0, 
or 

r0 + ^0 3 - h = 0, (8.24) 

where we have used the fact that the low-energy mean-field configuration will be spatially 
constant. Just by inspecting the potential part of the field- free Lagrangian, |0 2 + ir^ 4 , it is 
clear that, depending on the sign of r, the mean-field equation possesses two fundamentally 
different types of solution. For r > 0, the action has a global minimum at <f> = 0, implying 
that 4> = 0 is the unique mean-field (see Fig. 8.7). Noticing that the amplitude of <f> represents 
a measure of the magnetization of the system (which is clear from the way the </> 4 -action 
was derived from the Ising model on page 196), we identify r > 0 as a phase of zero net 
magnetism, the paramagnetic phase. 

In contrast, for r < 0, the action has two degenerate minima at non-zero values, 0 = 
±0o = ±(6|r|/A) 1 / 2 (see Fig. 8.7). The system then has to make a choice as to whether 
it wants to sit in the ground state configuration 0 = 0o or 0 = — 0 q. This is the state of 
spontaneous symmetry breaking indicative of the low-temperature ferromagnetic phase. 
(Notice that, upon the switching on of a small magnetic field, the degeneracy between the 
two ground states is lifted and the system will populate a state of predetermined magneti- 
zation, 0 = ± 00 , depending on the sign of h.) 

The preliminary analysis above indicates that r has the status of a fundamental parameter 
tuning the system through the ferromagnetic transition. Indeed, the microscopic analysis 
in Section 5.1 had indicated that r ~ T — T c was a function of temperature that changed 
sign at some critical temperature T c , the mean-field critical temperature of the transition. 
However, even if we did not know the microscopies, it would be clear that r(T) is (i) some 



(b) the critical point. 
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Table 8.1 Critical exponents of the ferromagnetic transition 
obtained through di event methods. Experimental exponents 
represent cumulative data from various three-dimensional 
ferromagnetic materials. 


Exponent 

Experiment 

Mean-field 

Gaussian 

e 1 

e 5 

a 

0-0.14 

0 

1/2 

i/6 

0.109 

# 

0.32-0.39 

1/2 

1/4 

1/3 

0.327 

7 

1.3-1. 4 

1 

1 

7/6 

1.238 

5 

4-5 

3 

5 

4 

4.786 

V 

0.6-0. 7 


1/2 

7/12 

0.631 

V 

0.05 


0 

0 

0.037 


Source: Data taken from K. Huang, Statistical Mechanics (Wiley, 1987). 


function of temperature which (ii) must have a zero at some temperature T = T c (otherwise 
there would be no transition to begin with). Therefore, in the vicinity of T = T c , we can 
set r ~ T — T c as our prime measure of the distance to the critical point. (This observation 
is, in fact, in perfect agreement with our earlier observation that the operator f </> 2 coupled 
to r is relevant - see the discussion in Section 8.3.2.) 

What can mean-field theory say about the prime descriptors of the transition, the critical 
exponents? Identifying the field amplitude 4 > (alias the magnetization) with the order 
parameter of the transition, and referring back to our list of exponents on page 438, the 
low-temperature profile is given by \(f>\ = (12|r|/A) 1 / 2 ~ |t | 1//2 , implying that /3 = 1/2. The 
exponent 7 is obtained by differentiating Eq. (8.24) with respect to h. With x ~ dh<t>i it 
is then straightforward to verify that, on approaching the critical point from either side 
of the transition, \ ~ implying an exponent 7 = 1. The action evaluated on the 

mean-field-configuration takes the form 


3§r 




X~H 2 , t < 0, 
0 , t > 0. 


(8.25) 


With the mean-field free energy F = TS[<j>] we find that the specific heat C = —T 2 d^,F ~ 
d 2 S behaves as a step function at the transition point, implying a = 0. Right at the critical 
temperature, r = 0, the mean-field magnetization depends on h as <^> ~ h 1//3 , implying that 
6 = 3. Finally, the correlation length exponents v, r) cannot directly be computed from plain 
mean-field theory as they are tied to the spatial profile of fluctuating field configurations. 

For the sake of later comparison, the mean-field critical exponents are summarized in 
Table 8.1. At first sight the differences between the experimentally observed exponents 
(second column) and the mean-field exponents (third column) do not look too dramatic - 
apparently the primitive mean-field approach pursued here fares reasonably favorably - 
which, in view of the accumulation of pronounced fluctuations at the critical point, should 
come as something of a surprise. On the other hand we must keep in mind that the exponents 
describe singular power laws in the transition region. In view of that, the difference between 
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1.3 and 1 does look quite significant. At any rate, we should try to refine our theoretical 
understanding of the transition and search for the source of the discrepancy with experiment. 


8-4-3 Gaussian model 

As a first improvement on the mean-field approximation, let us explore the effect of quadratic 
fluctuations around the constant field configuration <p. Approaching the transition point 
from above, we set (p = 0 and approximate the action through its quadratic expansion 29 

S[(/>] :'»# J d d r ^<p 2 + *(V</>) 2 - hep J . (8.26) 

In this form, one may effect the Gaussian integral over field fluctuations and evaluate the 
dependence of the free energy on the external parameters h and r. However, in anticipation 
of our analysis of the full problem below, we here pursue a slighly different, renormalization- 
group-oriented approach, i.e., pretending that we did not know how to do the Gaussian 
integral, we subject the quadratic action to a momentum shell RG analysis. 

Proceeding along the lines of the canonical scheme, we split our field into fast and slow 
degrees of freedom <p = <p s + cpf resulting in the, now familiar, fragmentation of the action 
5[^> s ,<^)f] = S s [(p s ] + <Sf[</>f] + S c [<p a i$$. However, the crucial simplification, characteristic 
of a Gaussian theory, is that the action S c coupling fast and slow components vanishes 
(exercise), implying that the integration over the fast field merely leads to an inessential 
constant. The effect of the RG step on the action is then entirely contained in the rescaling 
of the slow action. According to our previous discussion, the scaling factors thus appearing 
are determined by the engineering dimensions of the operators appearing in the action, i.e. 
r — > b 2 r and h — > b d ^ 2+1 h. Using the fact that r~fwe can then readily write down the 
two relevant scaling dimensions of the problem, y t = 2 and yn = d/2 + 1. Comparison with 
Eq. (8.22) finally leads to the list of exponents, 


a 



P 


d 

4 



r d + 2 

s = iz r 



T] = 0 . 


Notice that the exponents now explicitly depend on the dimensionality of the system, a nat- 
ural consequence of the fact that they describe the effect of spatial fluctuations. Table 8.1 
contains the values of the exponents for a three-dimensional system. We cannot really say 
that the results are any better than those obtained by the mean-field analysis. Some expo- 
nents (e.g. <5) agree better with the experimental data, while others (e.g. a) are decidedly 
worse. 

As a corollary to this section, we note that the Gaussian model possesses only one fixed 
point, namely r = h = 0, which in the context of dd-theory is called the Gaussian fixed 
point. 


The appearance of a linear term indicates that we are expanding not around the “true” mean-field, i.e. the 
exact solution of (8.24), but rather around the solution <j) = 0 of the field-free system. However, in view of the 
fact that h has the status of an external perturbation, this choice of the reference configuration is quite natural. 
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8-4-4 Renormalization group analysis 

In the present analysis of the model, we have not really touched upon its principal source of 
complexity, namely the effect of the “interaction operator” d> 4 on the fluctuation behavior 
of the field. It seems likely that the neglect of this term is responsible for the comparatively 
poor predictive power both of the straightforward mean-field analysis, and of the Gaussian 
model. Indeed, the dimensional analysis of Section 8.4.1 indicated that the 4> A addition to 
the action becomes relevant below four dimensions. A more physical argument, to the same 
effect, is given in the Info block below. 

Although the solution of the general problem posed by the action (8.23) still appears to 
be hopelessly difficult, there is one aspect we can turn to our advantage. While physical 
systems exist in integer dimensions d = 1, ... ,4, ... , there is actually no reason why we 
should not be allowed to evaluate our theory, i.e. the functional integral with action (8.23), 
in fractional dimensions. In the present context, this seemingly academic freedom turns out 
to be of concrete practical relevance. The point is that the nonlinear <p 4 operator was found 
to be marginal at d = 4 and relevant below. One may thus expect that, in dimensions 
d = l-f,£<l, the operator is relevant but not that relevant, i.e. one may expect that, 
for sufficiently small deviations off the threshold dimension, four, the theory knows of an 
expansion parameter, somehow related to e, which will enable us to control the interaction 
operator. Of course, at the end of the day, we will have to “analytically continue” to dimen- 
sions of interest, e = 1 or even e = 2, but, for the present, we will see what we can learn 
from a d = 4 — e representation of the theory. 


INFO Our previous analysis relied on the assumption that the field integration is tightly bound 
to the vicinity of the extrema of the action. But let us now ask under what conditions this 
assumption is actually justified. We should develop some intuition as to the relative importance 
of the mean-field content of the theory and of the fluctuations around the mean-field. While 
there are several ways to proceed with this program, we will focus on the analysis of the magnetic 
susceptibility. (At this point, we should warn the reader that the arguments formulated below, 
while technically straightforward, are conceptually involved. The critical contemplation of the 
logical steps of the construction is time well invested.) Firstly, let us recall the definition of the 
susceptibility, 


X = -3&F ~ J d d r (0(r)0(O)) c ~ G(k = 0), 


where we have used the fact that (</>( r)0(r')) = G(r— r') is the Green function of the model. Given 
this identification, we note that a formal criterion of the transition - divergent susceptibility! - 
is synonymous with a singularity of the zero-momentum Green function. 

On the level of the Gaussian theory (see Eq. (8.26)) G(k) 'sa|r + fc 2 ) -1 , i.e. x r ~ 1 - Antici- 
pating troubling observations to come, we reiterate that the mean- field transition temperature is 
identified by the condition r ~ i: s* 0. Now, let us move on to explore corrections to the mean-field 
susceptibility on the level of a perturbative one-loop calculation. To this end, we recall that (if 
necessary, recapitulate the discussion of Section 5.1), due to the presence of the 0 4 operator, the 
Green function acquires a self-energy which, at the one-loop level, is given by £ = — ^ r +k' 2 ■ 
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As a consequence, one can identify the susceptibility as 

X - 1 ~(G(k = 0))- 1 =r-E = r-+^^. 

k/ 

A first observation to be made is that non-Gaussian fluctuations (physically: interactions between 
harmonic fluctuations around the mean- field amplitude) lower the transition amplitude, i.e., 
setting r ~ T — T c , it now takes a smaller temperature T to reach the critical point; in accord 
with the intuitive expectation that fluctuations tend to “disorder” the system. Frustratingly, one 
may also observe that the cutoff A is needed to prevent the “correction”, 

k' 


from diverging in dimensions d > 2. To deal with this singularity, we have to realize that the effect 
of fluctuations is actually two-fold: the transition temperature gets shifted and the temperature 
dependence of the inverse susceptibility is apparently no longer simply linear (by virtue of the 
r-dependence of the integrand) . The two effects can be disentangled by writing 




d d k' 


r + k 12 


1 \ A r ( L\ d f A d d k' 

k”)~ r Y\ 2 tt) J (r + k' 2 )k' 2 ’ 


(8.27) 


where 

A (L_ \ d f A d^_ 

r - r+ 2 \2 tt) J Jr 


represents the shifted transition temperature while the integral describing the deviation from the 
linear temperature dependence of the susceptibility is now UV-convergent in dimensions d < 4. 
Notice that in the second equality of Eq. (8.27) we have replaced the parameter r in the integrand 
by the modified parameter r. To the accuracy of a one-loop calculation, this manipulation is 
permissible. 

Naively, it looks as if this sequence of manipulations has led to a catastrophe: the fluctuation- 
renormalized transition temperature appears to diverge as one sends the cutoff to infinity, clearly 
a nonsensical prediction! However, one may note that there was actually no justification for 
identifying the physical transition temperature through the parameter r in the first place. This 
identification was based on mean-field theory alone, i.e. an approach to the problem which 
neglected altogether the key effect of fluctuations. However, the bare parameter r appearing in 
the action carries as little “universal” meaning as the cutoff A, or any other microscopic system 
parameter for that matter! 

Once we have acknowledged this interpretation, we should then identify the transition tem- 
perature through the singularity of the macroscopically observable properties (e.g. divergence of 
the susceptibility leads to the vanishing of the modified parameter r at the one-loop level) while 
the microscopic parameters carry no significance by themselves. 


EXERCISE This interpretation closely parallels the philosophy of renormalization in high- 
energy physics. There, the bare parameters of the action are fundamentally undetermined, 
while the inverse of the Green function at zero external momentum represents a physical 
observable, e.g. the mass of the electron. Since the loop corrections to this physical quantity 
appear to be infinite (and the theory does not enjoy the luxury of the presence of a physically 
motivated cutoff), one postulates that the bare parameters of the action have been infinite by 
themselves. These singularities are deliberately adjusted so as to cancel the divergence of the 
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fluctuation corrections and to produce finite “physical” quantities. It is instructive to consult 
a textbook on renormalization in high-energy physics (such as Ryder 30 ) to become acquainted 
with the functioning of this strategy, and with the enormous success it has had in the context 
of QED and other sub-branches of particle physics. 

We now turn to the second effect of the fluctuation correction, namely the deviation from the 
linear temperature dependence, as described by the integral contribution to Eq. (8.27). On 
dimensional grounds, the integral depends on the parameter r as ~ A L d r^ d-4 )/ 2 . The (mean-field 
+ quadratic fluctuations) approach to the problem breaks down when this contribution becomes 
more important than the leading-order contribution to the susceptibility, i.e. for dimensions 
d 4. This observation is the essence of the so-called Ginzburg criterion. The criterion states 
that mean-field theory becomes inapplicable below the so-called upper critical dimension 
d c = 4. While we have derived this statement for the particular case of the (p* -model, it is clear 
that similar estimates can be performed for every nonlinear field theory, i.e. as with the lower 
critical dimension, the upper critical dimension also represents an important threshold separating 
the mean-field dominated d > d c from the fluctuation dominated d < d c behavior. Also notice 
that the analysis above conforms with our previous observation that the nonlinear (f> 4 operator is 
relevant in dimensions d < 4. (Convince yourself that the two lines of argument reflect the same 
principle, namely the dependence of fluctuations on the accessible phase volume, as determined 
by the dimensionality of the system.) 

Before proceeding to the details of the RG program, let us try to predict a number of general 
elements of the 0 4 phase diagram on dimensional grounds. We saw that in dimensions d > 4 
the (j ) 4 operator is irrelevant and that the Gaussian model essentially dictates the behavior 
of the system. Specifically, for d > 4, the Gaussian fixed point r = A = h = 0 is the only 
fixed point of the system. Below four dimensions, the 0 4 operator becomes relevant and 
the emergence of a richer fixed point structure may be expected. However, for e = 4 — d 
sufficiently small, we also expect that, whatever new fixed points appear, they should be 
close to the Gaussian point. This means that we can conduct our search for new fixed points 
within a double expansion in e, and the small deviation of the coupling constants r, A, h 
around the Gaussian fixed point. (In fact, we will momentarily identify a third expansion 
parameter, namely the number of momentum loops appearing in fast-field integration.) 

Step I 

We next proceed to formulate the steps of the RG in detail. To keep things simple, the RG 
transformation will be carried out to lowest order in a triple expansion in e, the coupling 
constants, and the number of momentum loops. The rationale behind the loop expansion can 
be best understood if we assume that the entire action 31 is multiplied by a large parameter 
(which, in the case of a quantum theory, might be hr 1 ). The expansion in the number of 
loops is then equivalent to an expansion in the inverse of that parameter (for a quantum 
theory, an expansion away from the classical limit). 


L. H. Ryder, Quantum Field Theory , (Cambridge University Press, 1996). 

Before we rescaled the fields so as to make the leading-order coefficient equal to 1/2. 
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EXERCISE Verify this statement; to this end, notice that a diagram of nth order in perturbation 
theory in the </) 4 vertex contains a prefactor a n involving the large parameter. On the other hand, 
each of the I internal lines, or propagators, contained by the diagrams contributes a factor o _1 , 
so that the overall power is a n-/ . Next relate the number of internal lines to the number L of 
loops. Notice that each line corresponds to a momentum summation. However, the number of 
independent summations is constrained by the n 5-functions carried by the vertices. Use this 
information to show that the overall power of the graph is a~ L+1 , i.e. an expansion in L is 
equivalent to an expansion in the inverse of a. 

Let us now decompose the action in the standard manner, setting S[</> s ,0f] = Sf[<j>f] + 

S s [0 s ] + S c [(j) s , <j> f], where 


'S'f [0f] = 

J d dr + ^(V</>f) 2 j , 

S s [<p s \ = 

J d dr [^it + *(V</>s) 2 + - h<t> s j 

■$$&,&] = 



Several approximations related to the loop order of the expansion are already imposed at 
this level. We have neglected terms of 0{4> 4 ) because their contraction leads to two loop 
diagrams. The same applies to terms oiO{(f> s (j)\) (exercise). Terms of 0{4%(f>i) do not arise 
because the addition of a fast momentum and three slow momenta is incompatible with 
momentum conservation. 


Steps II and III 

To simplify the notation, let us rescale the momentum according to q — > q/A, implying 
that coordinates are measured in units of the inverse cutoff r — >• rA. With the coupling 
constants rescaled according to their engineering dimensions, r —t rA 2 , A — > AA 4_d , the 
action remains unchanged, while the fast and slow momenta are now integrated over the 
dimensionless intervals |q s | £ [0,6 _1 ] and |qf| £ [£> _1 ,1], respectively. We next construct 
an effective action by integration over the fast field: e -Seff ^ s l = e _,Ss ^ s l In 

performing the average over fast fluctuations, (• • • )f , we shall (a) retain only contributions of 
one-loop order while (b) neglecting terms that lead to the appearance of </>” >4 contributions 
in the action. (For example, the contraction ((/ would lead to such a term.) To this 

level of approximation, one obtains 

e ~ SM = e- s ^exp [- + \ <5 C [^ S ,^] 2 );] , 

where the superscript c denotes a connected average. (Exercise: It is instructive to check 
the consistency of this expansion for yourself.) The two diagrams corresponding to the 
contractions (S c [(j> s , 0f])f and {S c [(f> s , <pf] 2 )f are shown in parts (a) and (b) of the figure 
below, respectively, where the external line segments indicate the passive cj) s amplitudes. 
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According to the standard rules of perturbation theory, the first of the two diagrams, (a), 
evaluates to 

= j 

We now consider the summation over fast momenta appearing in this expression. Using 
the fact that we are in the near vicinity of the critical point and anticipating that we are 
interested in no more than the expansion of the /3-function for small values of the coupling, 
we now expand the integrand to first order in r, f { r +q 2 = A — r /2 , where we have 
introduced the shorthand notation, 

< 8 - 28) 

These integrals are straightforwardly computed by switching to polar coordinates, 

T Cl f „d—2ot—l i2(x—d\ 

U = Q )• 

where fly, = (27r d / 2 /T{d/2))/{2'K) d denotes the volume of the d-dimensional unit sphere 
(measured in units of 27r). We thus find that, after the integration over fast modes, and the 
standard rescaling operation, q — > &q, </> — »• &( d_2 )/ 2 </>, the quadratic part of the action takes 
the form 

£(%] = % \ r + (1 ~ b 2 ~ d ) - (1 - b A ~ d ) 1 f d d r<j> 2 . (8.29) 


Turning to the second diagram (b) in the figure, we notice that, C A 

owing to the presence of four external legs, its contribution will be (a) \/ 

proportional to (j> 4 . Further, momentum conservation implies that the 
momenta carried by the internal lines of the diagram will depend on ___ 

both the fast “internal” momentum and the external momenta carried \/ 

by the fields </> s . However, we can simplify the analysis by neglecting ^ \ 

the dependence on the latter from the outset. The reason is that the integration over the 
internal momentum followed by Taylor expansion in the slow momenta would generate 
expressions of the structure F'(qi, q 2 , q 3 )^(qi)^(q 2 )</>(q 3 )<^(-qi — q 2 — q 3 ), where qi, 2,3 
represent slow momenta and F is some polynomial. Taking account of the small momenta 
would thus generate derivatives acting on an operator of fourth order in <p, a combination 
that we saw above is irrelevant. 

Neglecting the external momenta, diagram (b) leads to the result 


-(5 c [^ s ,0 f ] 2 ) f c 


A 2 [ 

id ^4 [ d d q 

1 

a 2 / 2 f 

16 J ° 


' + q 2 ) 2 

16 J 


Evaluating the integral and rescaling 
malized action reads 


r— t-t ^ = — 2 - / d d r<j>j+0(\ 2 r). 

2ir) d ( r + q 2 ) 2 16 J s y ’ 

! find that the quartic contribution to the renor- 



x 2 fi d i-b 4 ~ d \ r 

\4! 

16 d — A ) J 
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Finally, there are no one-loop diagrams affecting the linear part of the action, i.e. 

SM[<I>] = hb d ' 2+1 J d d r<t > , 


rescales according to its engineering dimension. 

Combining everything, we find that, to one-loop order, the coupling constants scale 
according to the relations r -> b 2 {r + ^^(1 - b 2 ~ d ) - - b 4 ~ d )), A -+ b 4 ~ d ( A - 

|A 2 f2 d 1 ^ 4 ), and h — > hb d / 2+1 . We next set d = 4 — e and evaluate the right-hand sides 
of these expressions to leading order in e. With f 24 _ e S3 Q 4 = g^- , we thus obtain 


r 

A 

h 


3A 2 


(1 + elnfe) |^A- 
hb 3 ~^ 2 , 


which, setting b = e e , lead to the Gell-Mann— Low equations: 



(8.30) 


These equations clearly illustrate the meaning of the e-expansion. According to the second 
equation, a perturbation away from the Gaussian fixed point will initially grow at a rate 
set by the engineering dimension e. While, on the level of the classical, zero-loop theory, A 
would grow indefinitely, the one-loop contribution ~ A 2 stops the flow at a value A ~ e. 


Equating the right-hand sides of Eq. (8.30) to zero (and j 6(X) 
temporarily ignoring the magnetic field), we indeed find that 
besides the Gaussian fixed point (r*. A() = (0,0) a non- 
trivial fixed point (r^A^) = (— gC, lf )( r e) has appeared. 

Notice that, in accord with the schematic considerations made 
at the beginning of the section, the second fixed point is (9(e) 
and coalesces with the Gaussian fixed point as e is sent to 
zero. Plotting the /3-function for the coupling constant A (see 
figure), we further find that, for e > 0, A is relevant around 
the Gaussian fixed point but irrelevant at the non-trivial fixed point. 

To understand the full flow diagram of the system, one may linearize the /3-function 
around both the Gaussian and the non-trivial fixed point. Denoting the linearized mappings 
by W h2 , we find 
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Figure 8.8 Phase diagram of the b 4 -model as obtained from the e-expansion. 


Figure 8.8 shows the flow in the vicinity of the two fixed points, as described by the matrices 
fF- 1.2 as well as the extrapolation to a global flow chart. Notice that the critical surface of 
the system - the straight line interpolating between the two fixed points - is tilted with 
respect to the r ~ temperature axis of the phase diagram. This implies that it is not the 
physical temperature alone that decides whether the system will eventually wind up in the 
paramagnetic (r 0) or ferromagnetic (r <C 0) sector of the phase diagram. Rather one has 
to relate temperature (~ r) to the strength of the nonlinearity (~ A) to decide on which side 
of the critical surface we are. For example, for strong enough A, even a system with r initially 
negative may eventually flow towards the disordered phase. This type of behavior cannot 
be predicted from the mean-field analysis of the model (which would generally predict a 
ferromagnetic state for r < 0). Rather it represents a non-trivial effect of fluctuations. 
Finally notice that, while we can formally extend the flow into the lower portion of the 
diagram, A < 0, this region is actually unphysical. The reason is that, for A < 0, the action is 
fundamentally unstable and, in the absence of a sixth-order contribution, does not describe 
a physical system. 

What are the critical exponents associated with the one-loop approximation? Of the two 
eigenvalues of W 2 , 2 — e/3 and — e, only the former is relevant. As with the Gaussian fixed 
point, it is tied to the scaling of the coupling constant, r ~ t, i.e. we have y t =2 — e/3 and, 
as before, yy = (d + 2)/2 = (6 — e)/2. An expansion of the exponents summarized in Eq. 
(8.22) to first order in e then yields the list 

7 = 1+ i- i=3+e ' ,, = 5 + s- n=0 ‘ 

If we are now reckless enough to extend the radius of the expansion to e = 1, i.e. d = 3, 
we obtain the fifth column of Table 8.1. Apparently the agreement with the experimental 
results has improved - even in spite of the fact that we have driven the e-expansion well 
beyond its range of applicability! (For e = 1, terms of 0(e 2 ) can, of course, no longer be 
neglected!) 
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How can one rationalize the success of the e-expansion? Trusting in the principle 
that good theories tend to work well beyond their regime of applicability, we might simply 
speculate that nature seems to be sympathetic to the concept of renormalization and the 
loop expansion. Of course, a more qualified approach to the question is to explore what 
happens at higher order in the e-expansion. Needless to say, the price to be paid for this 
ambition is that, at orders 0(e n>1 ), the analysis indeed becomes laborious. Nonetheless, 
the success of the first-order expansion prompted researchers to drive the e-expansion up to 
fifth order! The results of this analysis are summarized in the last column of Table 8.1. In 
view of the fact that we are still extending a series beyond its radius of convergence, 32 the 
level of agreement with the experimental data is striking. In fact, the exponents obtained by 
the e-expansion even agree - to an accuracy better than one percent - with the exponents 
of the two-dimensional model, 33 i.e. for a situation where the “small” parameter e has to 
be set to two. 

However, it is important to stress that the e-expansion is not just a computational tool 
for the calculation of exponents. On a more conceptual level, its merit is that it enables 
one to explore the phase diagram of nonlinear theories in a more or less controlled manner. 
In fact, the e-expansion not only is useful in the study of field theories close to the upper 
critical dimension (i.e. close to the mean-field threshold) but can equally well be applied 
to the analysis of systems in the vicinity of the lower critical dimension. In the following 
section, we consider a problem of this type, i.e. we will apply an e-expansion around d = 2 
to detect the onset of global thermal disorder in models with continuous symmetries. 


8.5 RG analysis of the nonlinear a - model 

The scalar field theory encapsulates a wide class of systems encompassing a single- 
component order parameter. However, throughout the text, we have encountered problems 
where the order parameter involves more than one component, e.g. the complex field 
associated with condensation phenomena, the matrix field associated with the quantum 
disordered metallic system, or the field theories involving spin. In such cases, one very 
often finds that the low-energy content of the theory involves a projection which imparts 
a constraint to the field integral. In the context of condensation phenomena, we saw that, 
at low temperatures, one can neglect the massive amplitude fluctuations of the order 
parameter, while the collective fluctuations of the phase mode impacted significantly on the 
low-energy properties of the system. In this case, the phase degree of freedom is constrained 
by its topology to lie on the unit circle. Similarly, if we neglect the “high-energy” physics 
of local moment formation, classical and quantum spin theories are constrained by the 
normalization of the local spin. When subjected to an auxiliary constraint, theories that 
are otherwise free are known as nonlinear cr-models. The aim of the present section is to 
apply methods of the RG to explore the critical properties of a general class of nonlinear 


Indeed, it is believed that we are dealing with a series that is only asymptotically convergent. That is, beyond 
a certain order of the expansion, the agreement with the “true” exponents will presumably become worse. 

The latter are known from the exact solution of the two-dimensional model, see L. Omsager, Crystal statistics 
I. A two-dimensional model with an order-disorder transition. Phys. Rev. 65 (1944), 117—49. 
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u-models. Later, in the next section and in Chapter 9, we will consider the way topological 
considerations can impact on the nature of the system. 

Let us then consider the nonlinear cr-model defined by the partition function Z = 
f Dge~ s ^ 9 \ where 

s[g] = \ J d dr tr i^g^g 1 ] f (8.3i) 

g £ G takes values in some compact Lie group G and the integration f Dg = f d tl g{x) 
extends over the Haar measure d fl g of G. (Later in this section, we will specialize to the 
case G = 0(3) relevant to the case of the three-component spin models, the Heisenberg 
magnet. Other realizations of G will be met in the next chapter.) 

I N FO The Haar measure d^g of a compact Lie group G is a (uniquely specified) integration 
measure that is (a) unit normalized, i.e. f G d^g -1 = 1, and (b) invariant under left and right 
multiplication by a fixed group element: 

[ d^gf(g)= [ d„gf(gh- 1 )= [ d^gflh^g) 

J G </ G i/ G 

for any h € G. Upon translation of the integration variable g — > hg or g — > gh, these equations 
assume the form f G d^(gh) f(g) = f G d^g f(g) = f G d^(hg) f(g). Holding for any /, this implies 
that d^(hg) = d^g = d^gh), which tells us that the Haar measure assigns equal volume density 
to any point on the group manifold. Owing to this homogeneity property, integrations over 
groups are almost always performed with respect to the Haar measure. Details on the explicit 
construction of this measure can be found in textbooks on Lie group theory. 34 


As we have seen above, the presence 
of a continuous symmetry identifies 
the lower critical dimension of the 
theory at d c = 2. At dimensions 
of d c and below, massless Gold- 
stone fluctuations of the field destroy 
long-range order at any non-zero 
temperature. However, in the range 
2 < d < 4, one expects an ordered 
phase to persist at low temperatures, 
while the critical characteristics of 
the phase in the vicinity of the tran- 
sition are fluctuation dominated. For 
reasons that will become clear, in 
the following we develop an e-expansion around the lower critical dimension to detect the 
fixed point structure of the model. (Such an e-expansion was proposed in a seminal work 


Alexander M. Polyakov 1945- 

He has made several important 
contributions to quantum field 
theory from non-abelian gauge 
theory to conformal field theory. 

His path integral formulation of 
string theory had profound and 
lasting impacts in the conceptual 
and mathematical understanding of the theory. He 
also played an important role in elucidating the con- 
ceptual framework behind renormalization indepen- 
dent of Kenneth Wilson’s Nobel prize winning work. 
He formulated pioneering ideas in gauge/string dual- 
ity long before the breakthrough of AdS/CFT using 
D-branes. 



See, e.g., J. Fuchs and C. Schweigert, Symmetries, Lie Algebras and Representations, a Graduate Course for 
Physicists (Cambridge University Press, 1997). 
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by Polyakov. 35 ). However, before turning to the actual formulation of the RG analysis, we 
need to do a bit of preparatory work. 

As usual, let us split the matrix fields g into “slow” and “fast” components. This is 
achieved by defining g( r) = g s (r)gf(r), where <? s (r) and £/f(r) have momentum components 
in the ranges [0, A6 _1 ] and [A6 _1 , A], respectively. Substituting this decomposition into the 
action, one obtains -9 [/Mr] = S </ K ] + 5[<jif] + <S' c [s , s>'7fL where the coupling action 

Sc[g s , g f ] = l j d d r tr [g-^g^gi^ (8.32) 

In the next step, one must integrate over the fast field gf. Since, of course, this integration 
cannot be performed exactly, we will have to resort to an approximation scheme based 
on the loop expansion. Yet, before doing so, we need to provide a little more background 
information on field integration over group-valued variables in general. 


8.5.1 Field integrals over groups 

As with any other nonlinear integration space, to perform an integral over a group one first 
needs to introduce a set of suitable coordinates. Of course, with the choice of these variables 
one can exercise some freedom. However, in most applications, one will employ exponential 
coordinates, i.e. one may start out from g = e w , where W G Q lives in the Lie algebra of 
the group, and then expand W = i )P Q ir a T a in the Hermitian group generators. 

INFO Recall that the generators T a of a Lie group are M elements of the Lie algebra Q 

specified by the following two conditions, (i) The set {T“|a = 1, M } is a basis of Q. 36 (ii) 

The commutators translate to the relation [T a ,T b ] = —if abc T c , where the so-called structure 
constants f abc encode the geometry of the group. For example, the structure constants of the 
three-dimensional ( M = 3) groups SU(2) and 0(3) are given by f abc = e abc , a. b. c 1,2,3, 
where e abc is the antisymmetric tensor. For SU(2), the generators T a = l<7 a can be identified 
with (one half of) the Pauli matrices, for 0(3) with the angular momentum operators familiar 
from quantum mechanics. 

For later reference, we note that the generators of the group families U(1V), SU(1V) and 0(A) 


obey the completeness relations 



M 

U(A), 


Ew = 

(8.33) 

M 

Ew = 

| SU(A), 

(8.34) 

M 



Ew = 

SuSjk — s ik 5 jt , 0(A). 

(8.35) 


A. M. Polyakov, Interactions of Goldstone particles in two dimensions. Applications to ferromagnets and massive 
Yang-Mills fields, Phys. Lett. B 59 (1975), 79-81. 

The Lie algebra of an M-dimensional Lie group is a vector space, so it has a basis. The “vectors” of this space 
are realized by ( N x iV)-dimensional matrices T a . 
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(Here, the upper summation limits are set by M = N 2 for G = U (IV), M = N 2 — 1 for 
G = SU(TV), and M = N(N - l)/2 for G = O (TV) - exercise: why?) 

In general, the generators can be chosen so as to obey the normalization condition tr(T a T b ) = 
c5 ab , where c is a normalization constant. Interpreting fW\W') 3 tr(WW') as a scalar product 
on the Lie algebra (exercise: convince yourself that ( | ) meets all the criteria required of a scalar 
product), the equation above tells us that the set {T a } forms an orthonormal basis. In the cases 
U(IV),SU(IV) and O (IV), consistency with the orthonormalization relation above enforces the 
normalization c = 1/2, 1/2, and 1, i.e. 

tr (T a T b ) 
tr (T a T b ) 

Integration over all N real coefficients tt° will cover the full group G. Yet there remains 
one non-trivial problem to be solved: we need to understand how to express the Haar 
measure d^g —i dir J(ir) in terms of the new integration variables. Here, the “flat measure” 
dm = dn a comprises the integration over all Lie algebra coefficients while the function 
J( 7r) encapsulates both the geometry of the Haar measure and the Jacobian associated with 
the transformation g — > Y a 7r a T a . Referring for a more detailed discussion of the function J 
to text books on group theory, we here merely note that its Taylor expansion in 7r starts as 
J(n) = 1 + 0{tt a ) where the 0(-7r 4 ) term will not play a role in the context of the one-loop 
analysis below. Thus, at the level of approximation discussed in this text, the integration 
measure of the fast-field components can be approximated by the “flat” measure dn. 

Using the RG application above as an example, let us now discuss how integrals over 
fluctuations in the group space are performed in practice. Substitution of the expansion 
gi = e w = 1 + W + W 2 /2 + ■ ■ ■ into 5[<7f] and 5 c [gf,g s ], gives a series of contributions 
S = Y n 0 a ^ where S is of nth order in the local field amplitudes W. As usual, we will 
organize our analysis around the fully solvable sector of the theory, i.e. the action quadratic 
in W. A straightforward substitution into Sj/j'f] gives 

J d d r V7r a V7r 6 tr 

(8 = 6> \ J ddr V7r°V7r a = \Yj ^P n p 1?r -P> 

p e/ 

where we have used the normalization condition (8.36), Y P ef s t an ds for a summation over 
the fast momentum sector and we have introduced n p 3 2L * p2 as the propagator of the 
fast fields n a . 

Below, we engage integrals of the type (tr(F(IU(7r))tr(G(IU(7r)) • • • )w, where F.G.. . . 
are functions of the generators W and {■■•)„= A f J dnexp(— 5^ 2 ^[7r])(- • • ) denotes averag- 
ing over the quadratic action. These integrals may be evaluated with the help of Wick’s 
theorem and the completeness relation (8.35). Wick’s theorem states that we need to form 
all possible pairings of n- variables or, equivalently, W-matrices. Each pairing is of the form 
either (tv{AW)tr(A'W)) or (tr(AWA'W)), where the matrices A and A' may contain W- 


[ T a T b ] 


S (2) [VU] = -jJ d d r tr [V W V W] = ^ 


- 6 '*: U(7V), SU(1V), 
= 5 ab , 0{N). 
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matrices themselves (however, when computing an individual pairing, these matrices are 
temporarily kept fixed). Specifically, for G = O(N) the Gaussian integrals give 37 

{tv(AW v )tv(A'W v ')) w = -(7rp7Tp,)ivtr(AT a )tr(^ , r Q ) 

= — 5 p> _p']Ip tr(AT a ) it(A'T a ) ( = 35) -6 Pi _ p ,n p [tr (AA’) - tr {AA ,T )\ , 
{tv{AW v A'W v ,)) w = - 7r“7r“: w tr (AT a A'T a ) 

= -<5 p _p/IIp tv(AT a A'T a ) ( = 5) -6p,_p,n p [tr(A)tr(A') - tr {AA’ T )\ . 

Integrals over IT-matrices are computed by applying these contraction rules until all pos- 
sible pairings have been exhausted. Probably the best way to become acquainted with this 
procedure is to work through an example. 

EXERCISE Consider the expression X = (tr(A qjP W p + q W_ p )tr A'^, pl W p > + q>W- p i ) w , where 
A, A' € o (N) belong to the Lie algebra of O(N) (which implies, in particular, that they are 
traceless). Show that the pair contraction of the first ll'-rriatrix with the second (the third and the 
fourth) vanishes due to the tracelessness of A and A'. Next perform the contractions (1 — 4) (2 — 3) 
and (1 — 3)(2 — 4) to obtain the result X — (IV — 2) p p + q tr [A Piq A!_ qiP+q — A!_ q _ p ] . 


8.5.2 One-loop expansion 

With this background, let us now return to the actual RG program for the 0(7V)-rriodel. 
As a first step, we need to identify all contractions that lead to no more than one fast 
momentum integration (one-loop order). Since the contraction of terms of 0(W (x) n>4 ) will 
generate at least two fast integrations, it suffices 38 to consider the action S[g s , W] at order 
W 2 . Substitution of gf = 1 + W + W 2 / 2 into the action (8.32) gives 
if 2iT d 

S^[g s ,W] = - d d r tr(4> / ,[V /< W.l'P]) ~ tr [^_ q W p+q W_ p ] , 

J q P ef 

where we have introduced the abbreviation <h ; , = g ~ 1 and, in the last representation, 
neglected the small momentum in comparison with the fast, 2p + q ~ 2p. To obtain all one- 
loop corrections, we have to expand the functional in powers of S( 2 ^ [g s , W] and integrate 
over W. However, since each power of comes with one derivative acting on a slow field, 
and terms of more than two such derivatives are irrelevant, it suffices to consider terms of 
order 0((Sc 2> ) 2 ). To one-loop order, the RG step thus effects the replacement 

%] -+ %s] -In (l - 2< (^ 2) bs, W]) 2 ) w ^ - S[g s ] - (^ 2 >[fir s , W])" ) w - 

37 Although we are primarily interested in the case N = 3, it will be instructive to monitor the RG flow for general 
values of N. 

38 Convince yourself that contributions to the action of 0(W 3 ) also do not contribute at the one-loop level. (A 
good way to do this is to invent some prototypical diagrammatic code that keeps track of the momentum 
flow.) As in our previous analysis of the <p -model, the expansion of S[g 8 ,W] to linear order in W vanishes. 
(Momentum conservation implies that it is not possible to integrate a single fast field against a single slow 
field.) 
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Figure 8.9 (a) Graphical visualization of the matrix vertex tr {AWW). The line segments con- 
necting the boxes represent the matrix indices, (b) One-loop diagram representing the contraction 
(tr(<&WW) tr {AWW)) — >■ JVtr(<l>$). The factor of N stems from the free summation over the 
central index j. 


Written more explicitly, 

<(^ 2 )[ 5s ,W]) 2 )^ = ]T ]T (tr(^4 q]P Wp+ q W_ p )tr(^4 q / >p / Wp'+q' W_p'))w , 
q,q' p.p'ef 


where A pq = — i- e - an expression of the type considered in the exercise above. 

Drawing on that analysis (see Fig. 8.9), one obtains 


(sfb„wl)\ a _ 8(« -2)L™ E(np) , w y tr(<ty q ^_ q 


pef 

-C J d d r tr[$ M $„] = C J d d r tr [V&V& 1 ] = <7A% S ] 


where the constant 


C 


2(N — 2) [ d d p 1 2(N — 2)£l d f A ds 

d J (2n) d p 2 d(2n) d A/6 
2(N - 2)fl d (A d ~ 2 - (A/b) d ~ 2 ) d= 2 +e ( N - 2) In b 
[2n) d d{d — 2) “ 2^ 


Substituting this result back into the action, one obtains 


£[ff] ( 1 - 


(N- 2)ln6A 


%*] 


(N — 2) ln&A\ 
An ) 


b e S[g ], 


where in the second step we performed the rescaling of the momenta (i.e. we rescaled 
q — > bq, in such a way that the formerly “slow” momenta again take values up to the cutoff 
A, and noted that e = d — 2. As expected, the RG reproduces the action up to a global 
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scaling factor. Absorbing this factor in a renormalized coupling constant, A r , we obtain 
y = (i- ("-” l °“ )yy 1 ,or 


, = (l - ( JY -, 2 > h ' >AN | i-A = ln&A ( _ A + A 


Here, in the last equality, we have made use of the fact that both A and e are small. 39 
Finally, differentiating this result one obtains the RG equation 


din A _ (AT - 2)A 
d In b 47t 


-e + 0( A 2 ,e 2 ,Ae). 


(8.37) 


The figure below shows the RG flow for generic (N > 2) group dimensions and different 
values of e. As expected, for dimensions two and below, the coupling constant inexorably 
flows towards a phase with large A where fluctuations are large (and our perturbative 
analysis loses its basis) - the “disordered” type of behavior pervasive in the physics of 
low-dimensional systems with continuous symmetries. However, for dimensions d > 2, the 
model exhibits a fixed point at finite coupling strength A c = 2Tre/(N — 2). Linearizing in 
the vicinity of the fixed point, one finds a thermal exponent y\ = e from which, making use 
of the scaling relations, one can deduce a correlation length exponent v ss 1/e and a heat 
capacity exponent a w 2 — 2/e. 

EXERCISE To complete the analysis of the 0(3) model, one may explore the magnetic exponent. 
Introducing a magnetic field perturbation into the action, i.e. h f d d rtr[g + f/ -1 ], show that, 
under renormalization, h' = b Vh h, where t/h = 1 + 2 (X~ * 2 ) "h G(e 2 ). From this result, use the 
exponent identities to show that the correlation length exponent 77 « e/(IV — 2). 


Notice that, for d = 2 and N = 2, the coupling constant does 
not renormalize. This is because, for 0(2) - the abelian group 
of planar rotations - the action of the cr-model simplifies to 
S = j J d 2 r (V</>) 2 , where <f>( r) is the field of local rotation 
angles. This is a free action which is not renormalized by 

'/3(A) 

d< 2 

// 
y/ d> 2 

small fluctuations. (However, in the following section, we see 
that the situation is not quite as simple as it seems. The 0(2) 


A 


model admits large fluctuations - vortex configurations of the angular field - which do have 
a non-trivial impact on its behavior. However, being topological in nature, these excitations 
are beyond the scope of our present analysis.) 


The smallness of A is needed to justify the perturbative loop expansion. In the opposite regime A 1, the 
fields g fluctuate wildly and perturbative expansions are not an option. 
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EXERCISE Repeat the analysis above for the SU(A r )-model. First derive the intermediate iden- 
tities, 

(tr(AW p )tr(A'W p r)) w = -<5 P , p , £ [tr(AA') - ^tr(A)tr(A')] , 
(tr(AW p A'W pl )) w -<5 P , p - > [tr(d)tr(/l') - ^tr(/lA')] , 
where q = \/L d p 2 , and 


?WV <tr(^, q W P+q W p , +q , W_ p , )) w to & r($„, q V 

Use these results to obtain the RG equation 


din A _ TVA 
d In 6 8tt 


0(\ 2 , e 2 , Ae). 


(8.38) 


Notice that the right-hand side of the SU(1V = 2) equation coincides with the right-hand side of 


the O (N = 3) equation. This reflects the (local) isomorphism of the groups 0(3) and SU(2). 


8.6 Berezinskii— Kosterlitz— Thouless transition 

The majority of theories discussed in earlier chapters were defined on essentially structure- 
less target spaces - the real line, the complex plane, or, more generally, vector spaces of 
arbitrary dimensionality. Although we have encountered a number of models with more com- 
plex target manifolds (such as the superconductor, the Luttinger action with fields defined 
on the unit circle, and the nonlinear a - model theories discussed in the previous section), 
the global geometric structure of the target space did not seem to play such an important 
role: the phase behavior of the 0(N )-tlieory appears to differ little from an unconstrained 
TV-component vector <i> 4 -tlieory. However, the large-scale geometry, or topology of the field 
manifold, may have a striking influence on the long-range behavior of a system, and it is one 
of the objectives of the present section to illustrate this phenomenon on a particular exam- 
ple. (A more comprehensive discussion of the role of geometry and topology in quantum 
field theory will be developed in the next chapter.) 

Consider then a two-dimensional square lattice with a phase-like variable exp(*#j) G S i 
defined on each of its sites i. Demanding that the Hamiltonian or action of the system be 
periodic in all 9i and minimal on homogeneous field configurations, the most elementary 
action we can formulate reads 

(8.39) 

where denotes the usual sum over nearest neighbor sites of a lattice. Equation (8.39) is 
known as the action of the two-dimensional XY -model. (Its incarnation as a continuum 
field theory is the U(l) version of the nonlinear u-model discussed above.) To motivate the 
terminology, one may regard as an angle parameterizing the direction of a unimodular 
vector (a classical “spin”) in the two-dimensional XT-plane. Indeed, the model Eq. (8.39) 
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can be viewed as a two-dimensional descendant of the classical Heisenberg model, i.e. as a 
model where the Heisenberg spin is confined to a two-dimensional plane (a condition in fact 
realized for a number of magnetic materials). 

INFO Besides its natural occurrence in magnetism, the XY-model appears in many applications 
in condensed matter physics. By way of an example, we mention here the physics of granulated 
metals and the Josephson junction array. 

Imagine then a system of small metallic (or superconducting) 
islands (see figure), which are connected by poorly conducting tun- 
neling barriers. (Arrays of this structure are believed to mimic the 
mesoscopic morphology of structurally disordered metals. However, 
they can also be manufactured artificially, with islands as small as 
several nanometers.) On a microscopic level, the state of the system 
is essentially determined by the number Ni of electronic charges 
populating each island i. However, unless the inter-grain tunneling 
conductance, gx, is small (i.e. gx < e 2 //i, the fundamental quantum 
unit of conductance), temporal fluctuations of the charge degrees of freedom are vast. It is 
then more favorable to characterize the state of the system in terms of the variable canonically 
conjugate to the charge variable, i.e. the phase 9i. (Remember that [Oi,Nj] = — i5ij form a 
canonically conjugate pair.) The model action S[9] then, of course, depends on the microscopic 
structure of the system. However, for the case of a superconductor array, in the classical limit 
(where temporal fluctuations of the phase field 0(t) are suppressed), and neglecting charging 
effects, the action collapses to that of the XY-action shown above. (Recapitulate the structure 
of the single-j unction Josephson action discussed in Problem 6.7 to convince yourself that this 
is true.) 



Amongst the class of classical spin models, the two-dimensional XY -model assumes a unique 
status - a fact already hinted at in consideration of the RG in the previous section. To 
understand why, it is helpful to explore the high- and low-temperature expansions of the 
theory. In the high-temperature phase, J — > 0 , one may expect that the classical partition 
function for the lattice system can be developed as a series expansion in J, 

*-jfn fk* 1 = f n S n [1 + -*>+ wa • 

Here each term in the product can be represented by a “bond” that connects neighboring 
sites i and j. To the lowest order in J, each bond on the lattice contributes a factor of either 
one or Jcos(#j — 0j). But, since J Q 27r d6\ cos(#i — O2) = 0 , any graph with a single bond 
emanating from a site vanishes. On the other hand, a site at which two bonds meet yields 
a factor (dO^/^n) cos(#i — #2)cos(#2 — #3) = (l/' 2 )cos( 0 i — 63). Therefore, in this way, 
the classical partition function can be presented as a closed “loop” expansion. Similarly, 
the high-temperature expansion can be used to estimate the spin-spin correlation function 
(So • S r ) = (cos(# r — #o))- To leading order, only those graphs that join sites 0 and r will 
survive, implying an exponential decay of the correlation function, 



(So • S x ) 
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with a correlation length £ _1 = ln(2/J). Conversely, in the low-temperature phase, where 
fluctuations of neighboring phases 9i are strongly penalized, the lattice model can be safely 
replaced by the continuum theory \ f d 2 r J(V</>) 2 . Taking only harmonic fluctuations of 
the field </>(r), i.e. neglecting the topological constraint placed by the geometry, the spin- 
spin correlation function can be inferred from the rules of Gaussian integration, to wit 
(S(0) • S(r)} = Re(e*( e (°) -6, ( r ))) = e <(®(°) ®( r )) s >/2_ Since, in two dimensions, Gaussian 
fluctuations grow logarithmically, ((0(0) — 9( r)) 2 )/2 = ln(|r|/a)/27rJ, where a denotes a 
short-distance cutoff (e.g. the lattice spacing), one can infer a spin-spin correlation function 
that decays algebraically, 

(S(0) • S(r)) - W , 
a phase of quasi-long-range order. 

The distinction between the nature of the asymptotic decays allows for the possibility 
of a finite-temperature phase transition. However, these arguments are not specific to the 
XY -model: any continuous spin model will exhibit exponential decay of correlations at high 
temperature and, in two dimensions, a power law decay in a low-temperature Gaussian 
approximation. Now, formally, to establish that “Gaussian” behavior persists at low tem- 
peratures, one must prove that it is not modified by higher-order terms in the gradient 
expansion (including those induced by the nonlinear constraint, S 2 = 1). However, while 
the results of the (2 + e)-expansion of the U(/V) theory show that the two-dimensional 
model flows to a disordered phase for N > 2 (in accord with the Mermin-Wagner theo- 
rem), for N = 2 the theory is non-committal. Indeed, in an extension of the perturbative 
RG to higher loop order, the /3-function remains zero. So we are left with the question, 
is there indeed a phase transition that delineates between the high- and low-temperature 
asymptotics, and what is the mechanism by which the quasi-long-range order is destroyed 
at higher temperatures? Since the perturbative RG suggests that higher-order terms in the 
gradient expansion are not relevant, it is necessary to search for other relevant operators. 


8.6.1 Vortices and the topological phase transition 

The gradient expansion describes the energy cost of small deformations around the ground 
state, and applies to configurations that can be continuously deformed to the uniformly 
ordered state. In separate and seminal works, 40 Berezinskii, and Kosterlitz and Thouless, 
proposed that the disordering is facilitated by the “condensation” of topological defects or 
vortices. Since the angle describing the orientation of a spin is defined up to an integer 
multiple of 27T, it is possible to construct spin configurations in which the traversal of a closed 
path will see the angle rotate by 2nn. The integer n defines a topological charge enclosed 
by the path. The discrete nature of the charge makes it impossible to find a continuous 


V. L. Berezinskii, Violation of long range order in one-dimensional and two-dimensional systems with a con- 
tinuous symmetry group. I. Classical systems, Sov. Phys. JETP 32 (1971), 493-500; J. M. Kosterlitz and D. 
J. Thouless, Ordering, metastability, and phase transitions in two-dimensional systems, J. Phys. C 6 (1973), 
1181-203. 
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deformation that returns the state to the uniformly ordered configuration in which the 
charge is zero. (More generally, topological defects arise in many models with a compact 
group describing the order parameter - e.g., a domain wall defect (e.g. an instanton) in a 
model with Z 2 symmetry, or a “skyrmion” configuration in an 0(3) three-component spin 
Heisenberg ferromagnet; see the next chapter for a discussion of topological defects.) 

The elementary defect, known in 
the present context as a vortex 
configuration, has a unit “charge”: 
in completing a circle centered on 
the defect, the orientation of the spin 
changes by ±2n (see Fig. 8.10). If the 
radius r of the circle is sufficiently 
large, the variations in angle will 
be small and the underlying lattice 
structure of the spin model can be 
neglected. By symmetry VO has uni- 
form magnitude and points along the azimuthal direction. The magnitude of the distortion 
can be obtained by integrating around a path that encloses the defect, j> VO ■ dl = 2nn, i.e. 
VO = ™e z x e r , where e r and e z are unit vectors respectively in the plane and perpendicular 
to it. This continuum approximation fails close to the center (core) of the vortex where the 
lattice structure becomes important. 

Although the defect-driven topological phase transition can be described within the frame- 
work of the RG (see below) , the nature of the transition can be understood from a simpler 
and intuitive line of reasoning that one can find in the original paper by Kosterlitz and 
Thouless. The energy cost of a single vortex of charge n can be divided into contributions 
from the core region as well as from the relatively uniform distortions away from the center. 
The distinction between regions inside and outside the core is arbitrary (see below) and, for 
simplicity, we shall use a circle of radius a to distinguish the two, i.e. for a vortex of charge 
n, the associated action is given by 

S n = S™ re (a) + J - J^d 2 r (VO) 2 = S™ re (a) + nJn 2 In 

The dominant contribution to the action arises from the region outside the core and diverges 
logarithmically with the system size L. 41 The large energy cost associated with the defects 
inhibits their spontaneous formation at low temperatures and protects the integrity of the 
quasi-long-range ordered phase. To explore the range over which vortices are suppressed, 
one may explore the partition function for a configuration with just a single vortex of unit 



David J. Thouless, 1934- 

co-recipient with John Michael 
Kosterlitz of the 2000 Lars 
Onsager Prize “for the introduc- 
tion of the theory of topologi- 
cal phase transitions, as well as 
their subsequent quantitative pre- 
dictions by means of early and ingenious applica- 
tions of the renormalization group, and advancing 
the understanding of electron localization and the 
behavior of spin glasses.” 



Notice that, if the spin degrees of freedom have three components or more, the energy cost of a defect is 
only finite. Following the arguments below, one may reflect on the implications of this fact for the inability of 
topological defects to effect a topological transition in systems with N > 2. 
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Figure 8.10 Spin configurations of the two-dimensional XT-model showing vortices of charge ±1. 


charge n = 1, 





—Sl° ie (a) — it Jin 



(8.40) 


Here the factor of ( L/a ) 2 results from the configurational entropy associated with the range 
of possible vortex locations in an area of size L 2 . The entropy and energy of a vortex both 
grow as In L, and the free energy is dominated by one or the other. At low temperatures (i.e. 
large J) the potential energy dominates over the entropy and Z 1 , a measure of the weight 
of configurations with a single vortex, vanishes. However, when J < J c = 2/n, the entropic 
contribution overwhelms the potential and one may expect the spontaneous formation of 
vortices and a transition to the disordered phase. 

In fact one might expect that this estimate of J c represents only a lower bound for the 
stability of the system towards the condensation of topological defects. This is because pairs 
(dipoles) of defects may appear at larger couplings. Consider a pair of charges ±1 separated 
by a distance d (see Fig. 8.10). Distortions far from the core |r| » d can be obtained by 
superposing those of the individual vortices, i.e. VO = V0(r + d/2) — V0( r — d/2) « d • 
V(e r x e z /\r\) ~ d/|r| 2 . Integrating this distortion, one may infer that the energy cost of the 
configuration is only finite, and hence dipoles appear with the appropriate Boltzmann weight 
at any non-zero temperature. The low-temperature phase should therefore be visualized as 
a “gas” of tightly bound dipoles, their density and size increasing with temperature. The 
high-temperature phase constitutes a plasma of unbound vortices. 

To a large extent, the argument above conveys much of the phenomenology of the 
Berezinskii-Kosterlitz-Thouless (BKT) phase transition. However, one may justifiably ques- 
tion the impact of harmonic fluctuations and dipole interactions in suppressing or even 
destroying the transition. To explore these effects, one has to turn to a more rigorous line 
of reasoning. To do so, we will once again draw on the RG. 
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8.6.2 RG analysis of the BKT transition 

To prepare for the application of the RG, we must first reorganize the action to emphasize 
the different channels of fluctuations. Fortunately, for the two-dimensional XT-model, the 
different channels (harmonic and topological) can be neatly separated. To understand why, 
it is helpful to draw an analogy, noting that the distortion field, u = V0, is similar to the 
velocity of a fluid. In the absence of vorticity, the flow is potential, i.e. u = uo = V<j>, 
and V x uo = 0. The topological charge can be related to the vorticity V x u by noting 
that, for any closed path, j> dl • u = f d 2 r e z ■ V x u, where the second integral is over the 
area enclosed by the path. Since the left-hand side is an integer multiple of 2-rr, one can set 
V x u = 27re z )Tb n,;<5 2 (r — r,-). describing a collection of vortices of charge {n,;} at locations 
{r,}. The solution to this equation can be obtained by setting u = uo — V x (e z ip), leading 
to 

V x u = e z V 2 ip => V 2 V> = 2tt ^ n i( 5 2 ( r - r ( ). (8.41) 

Thus, the field ip behaves as the potential due to a set of point charges {2-7T n{\ while the 
solution, ip( r) = JTnjln(|r — r il)> is simply a superposition of the potentials. Any two- 
dimensional distortion can therefore be written as u = uo + ui = V<b — V x (e z ip), where 
uo accommodates the harmonic field fluctuations while ui includes the topological defects. 
The corresponding action, S[u] = 5 / d 2 r J u 2 , can be decomposed as 

S[ u] = J - J d 2 r [(Vcp) 2 - 2V<p • V x (e z ip) + (V x e z ip) 2 ] . 

An integration by parts shows that the second term vanishes (V • V x u = 0), while the 
third term can be simplified by noting that 'Vip and V x ( e z ip ) are orthogonal vectors of 
equal length. Hence, making use of Eq. (8.41), the action associated with the topological 
defects takes the form 

S t = J - f d 2 r (V x e z ip) 2 « - 3 - f d 2 r ipV 2 ip = -2tt 2 J^n i n j C'(r i - r,-), 

J J ij 

where C'(x) = In |x|/27r is the two-dimensional Coulomb potential. Once again, the unphys- 
ical divergence of the potential for i = j must be regularized by the core energy, whence 

St = 53 S nT ~ n i n i C ( r i ~ r j)- ( 8 - 42 ) 

i i<j 

Conveniently, the configuration space of the XT-model can thus be partitioned into 
different topological regions: one set arises from the degrees of freedom associated with the 
charges, {n,}, while the other reflects the harmonic spin- wave excitations associated with 
the field <p( r). The partition function of the lattice model is then approximately given by 
Z « Z s , w ,Z t where 

Z, w . = J D(p exp f d 2 r (V0) 2 ] , 2* = £ ^ j ^ d 2 r^j e~ s \ 
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and the r* are dimensionless integration variables r* = Xi/a where Xi is the position of 
the vortex centre. (Exercise: Consider why the combinatorial factors (AH) 2 are needed to 
prevent overcounting of identical charge configurations.) To summarize, Z t describes a grand 
canonical ensemble of “charges” with a two-dimensional Coulomb interaction. In fact, our 
derivation of the action relied on an integration by parts. The surface integral that was 
neglected in the process in fact grows with the system size as In LJTrij. Therefore, one 
must impose on the field configurations contributing to Z t the constraint of overall charge 
neutrality. In the following, to simplify our analysis further, we will limit considerations to 
only elementary excitations with n* = ±1. 42 Then, if we define the fugacity of the vortices 
as yo = exp[-S"i re ], the partition function 



reduces to that of a neutral Coulomb plasma of 2N unit charges n,; = ±1. 


(8.43) 


EXERCISE The low-temperature physics of many two-dimensional systems is effectively reduced 
to that of the XT-model or of the two-dimensional Coulomb plasma. For this reason, the equiv- 
alence XT-model O Coulomb plasma demonstrated above represents an important result in its 
own right. In fact, there exists a third important system falling into this universality class: the 
two-dimensional sine— Gordon model defined by the action 


S[0\ = ljd 


-g J d 2 rcos(0), 


8.44) 


where 0 is a real scalar field. To demonstrate its equivalence to the Coulomb gas model, expand 
the partition function in powers of g and show that 

oo 2JV 2JV / / 2 N \ \ 

where yo = g/2 and the angular brackets denote averaging over the free action. (To prove that 
positive and negative phases appear in equal numbers, consider the role of the integration over 
the zero mode (9(r) = const.) Using the fact that (6(r)6(r'))> m C(|r — r'|)/c, and neglecting 
the infinite self-interaction of the fields at coinciding points, show that the partition function 
becomes identical to Eq. (8.43) when c 1 / 8tt 2 ./. 

Since Z s w . and Z t involve independent degrees of freedom, they can be treated separately. 
Further, as the Gaussian partition function Z s , w , is everywhere analytic, any phase transition 
of the XT-model must originate from the Coulomb gas. Now, as mentioned above, in the 
low-temperature phase, charges appear only as tightly bound dipole pairs while, at high 
temperatures, the dipoles dissociate, forming a plasma. The two phases can be distinguished 


A charge two vortex costs about as much energy as two unit-charge vortices. However, its entropy ~ 2Tln(L/a) 
is much lower than the entropy ~ 4Tln(L/a) of the two-vortex configuration. This is why vortices of high 
winding number are negligible in the low- temperature limit. 
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by examining the interaction between two external test charges at some large separation X. 
In the absence of internal charges, the two particles interact by the bare Coulomb interaction 
C(X). The presence of internal charges provides a mechanism to screen the interaction, 
namely C(X)/ , where is an effective dielectric constant. With this interpretation, the 
transition with increasing temperature (or increasing t/o) can be viewed as a transition from 
an insulating (dipolar) phase to a metallic (plasma) phase in which the external charges are 
completely screened and their effective interaction decays exponentially. 

To quantify this picture, one can compute the effective interaction between two external 
charges at r and r' perturbatively in the fugacity yo . To lowest order, the effects of screening 
can be incorporated by including configurations with only two internal charges (positioned 
at s and s'). In this case, the effective interaction can be written as 

s — s' 

r ') +Uo Jd 2 sd 2 s' | x | +0(y $ | 

r — r' 

1 + Vof d 2 s d 2 s' (s - s') + 0(y$) 

from which one obtains 

g-SWf. (r-f')+47T 2 JC(r-r') 

_ 1 + 3 /q f d 2 s d 2 y' e -^ 3 JC(— > )+4 T 3 J[C(r- B )-C(r-0-C(r>- B )+C(^-.Q] + 0(y 4) 

1 + Vo I d 2 s d 2 s' e - 47r2JC '( s - s ') + 0(2/o) 

= 1 + yl J d 2 s d 2 s' e -^ 2 JC( s - s ’) _ fj + 0(||). (8.45) 

Here, D( r, r', s, s') = C(r — s) — C( r — s') — C( r' — s) + C(r' — s') denotes the interaction 
between the internal and external dipoles, while the direct interaction C( s — s') tends to 
keep the separation x = s' — s small. Defining the center of mass X = (s + s')/2, we can 
change variables to s = X — x/2 and s' = X + x/2, and expand the dipole-dipole interaction 
in small x as D{ r, r', s, s') ~ — x • VxC(r — X) + x • VxC(r' — X) + 0(x 3 ). To the same 
order 

e 47r 2 JD(r,r',s,s') _ 1 ~ _^2 Jx . Vx (C'(r - X) - C{r’ - X)) 

+ 87 r 4 J 2 [x • V x (C(r - X) - C(r’ - X ))] 2 + 0(x 3 ). 

Substituting this expression into Eq. (8.45), and changing integration variables, one finds 
that the term linear in x integrates to zero while the angular average of (x • V X C ) 2 leads 
to x 2 (V x C') 2 / 2. Thus, to C*(r 4 ), one obtains 

e -S eff ( r-r') ^ e -47r 2 JC(r-r') 

X [l + tfo j {dx2'Kx)e- i ^ Jc ^ x) S'KJ 2 ^- J d 2 X (V x (C(r - X) - C{r' - X))) 2 ] . 

Using the identity V 2 U(r) = (> 2 (r), one may note that f d 2 X [Vx(C , (r-X)-C(r'-X ))] 2 = 
2((7(r — r') — C(0)). Finally, absorbing the short-distance divergence into an appropriate 


(r - 

e -S eff .(r-r') B 4- 2 ./C(i- -r')^ = 
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cutoff with C(x) — > ln(x/a)/2n, one arrives at the expression 

e -Seff(r-r'J = e -4n 2 JC(r-r') ^ + ^ J 2 y 2 C(r _ r ') ^ dx X 3 e~ 2nJlnX + 0(y%) J , 

where the variable x has been rescaled to absorb the short distance cutoff. Exponentiating 
the second term, one obtains the effective interaction — r') ~ 47r 2 ./ e ffC'(r — r'), where 

J eff = J - 4 tt 3 J 2 y 2 J™ dx x 3 ~ 2 * J + 0(4), (8.46) 

From this result, to leading order in y$, one can infer the dielectric constant of the medium 
as = J/J e ff. However, while the integral remains convergent at large x. the perturbative 
correction is small. The breakdown of the perturbation theory for J < J c = 2/ir occurs 
precisely at the point where the free energy of an isolated vortex changes sign. Indeed, this 
breakdown of the perturbation theory is reminiscent of that encountered in the r/d-tlieory 
for d < 4. This being so, one may propose a reorganization of the perturbation series into 
a RG calculation for the parameters J and yo. 

The difficulty associated with the divergence at small J can be overcome by employing a 
renormalization procedure first used by Jose et al. 4i By breaking the integral in Eq. (8.46) 
into two parts, i.e. J) — i ► + f b , the non-singular short-distance contribution to the 

integral can be evaluated and incorporated into J. This procedure can be carried out order 
by order in yo even though the coefficient of i/q is formally divergent. This leads to a new 
equation J~^ = J~ 1 +4n 3 yo J b °° dx x 3 ~ 2nJ +O(yo ), where J -1 = J~ 1 +An 3 yo f^dxx 3 ~ 27rJ + 
O(yo). The variables in the remaining integral can be rescaled (x —¥ x/b) to yield an equation 
for equivalent to Eq. (8.46) but with shifted and rescaled parameters J and yo : 

J^ 1 = J -1 + Att 3 4 dx x 3 ~ 2nJ + 0{y$), 

where yo = b 2 ~ nJ y. By choosing an infinitesimal renormalization, i.e. I) = e f ~ 1+ t, one 
obtains the differential recursion relations 

||g- =4 t T 3 yZ+0{y%), 

d J (8.47) 

Jj± = (2-TrJ)y 0 +O(y 3 ). 

From these equations, one may note that the (inverse) coupling constant J -1 increases 
monotonically with £ while the recursion relation for yo changes sign at Jj 1 = 7 t/ 2. At 
high temperatures, when J is small, yo is relevant, while at lower temperatures it becomes 
irrelevant. The RG flows, shown in Fig. 8.11(a), separate the parameter space into two 
regions. At low temperatures, and small yo, flows terminate on a fixed line at yo = 0 and 
Jeff > 2/7T. This is the insulating phase, in which only dipoles of finite size occur (hence the 
vanishing of yo under coarse-graining) . The strength of the effective interaction is given by 
the point on the fixed line at which the flow terminates. Flows that do not terminate on the 

43 J. V. Jose, L. P. Kadanoff, S. Kirkpatrick, and D. R. Nelson, Renormalization, vortices, and symmetry-breaking 
perturbations in the two-dimensional planar model, Phys. Rev. B 16 (1977), 1217-41. 
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Figure 8.11 Schematic RG flow diagram for the XT-model. (a) Far from the critical point, and 
(b) close to the critical point. 


fixed line asymptote to larger values of J -1 and yo where perturbation theory eventually 
breaks down. This is the signal of the high-temperature phase where vortices proliferate. 

The critical trajectory that separates the two regions of the phase diagram flows to a 
fixed point at (J" 1 = tt/ 2, yo = 0). To explore the critical behavior at the transition, one 
must expand the recursion relations in the vicinity of this point. Setting t = J -1 — 7r/2, to 
lowest order, Eq. (8.47) simplifies to 

= 4t w 3 y 2 + 0(ty 2 ,y% 

| = hj/ + 0(«W). 

In contrast to those discussed in previous sections, the recursion relations here are inherently 
nonlinear in the vicinity of the fixed point (see Fig. 8.11(b)), suggesting that the critical 
behavior is likely to be novel. To uncover the structure, one may note that t 2 — iv 4 y 2 is a 
conserved quantity, i.e. the flow proceeds along hyperbolae characterized by different values 
of c = t 2 — tt 4 y 2 . For c < 0, the focus of the hyperbola is along the y-axis, and the flows 
proceed to (t. y) — > oo. Conversely, hyperbolae with c > 0 have foci along the f-axis, and 
have two branches in the half plane y > 0: the branches for t < 0 flow to the fixed line, 
while those in the t > 0 quadrant flow to infinity. The critical trajectory separating flows 
to zero and infinite y corresponds to c = 0, i.e. t c = —ir 2 y c . Therefore, a small but finite 
fugacity yo reduces the critical temperature to J" 1 = -ir /2 — n 2 yo + O(yo) ■ 

In terms of the original XT-model, the low-temperature phase is characterized by a 
line of fixed points with J e g = lim^^ J(£) > 2/n. Here the phase correlations decay as a 
power law, i.e. (cos(0(r) — 0(0))) ~ l/|r| T? , with r\ = l/27rJ e ff < 1/4. Since the parameter c 
is negative in the low-temperature phase, and vanishes at the critical point, we can set 
c = —b 2 (T c — T ) close to the transition. In other words, the trajectory of initial points 
tracks a line ( t(T),y(T )). The resulting c = t 2 — 7r 4 y 2 a (T c — T) is a linear measure of the 
proximity of the phase transition. Under renormalization, such trajectories flow to a fixed 
point at y = 0, and t = —by/T c — T. Thus, in the vicinity of the transition, the effective 
interaction parameter, 




2 
7 r 


Az lim t(£) 

7 r 2 t — yco v ' 



VT c ~ T, 


exhibits a square root singularity. 
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T c ( K) 


Figure 8.12 Measurement of the superfluid density ps(T c ) as a function of T c . The approximate lin- 
earity p s ~ T c demonstrates the universality of the stiffness constant J at the transition. (Reprinted 
with permission from D. J. Bishop and J. D. Reppy, Study of the superfluid transition in two- 
dimensional 4 He films, Phys. Rev. Lett. 40 (1978), p. 1727-30. Copyright (1978) by the American 
Physical Society.) 


INFO The stiffness, J e s, can be measured directly in experiments on superfluid films. As 
we have seen in Chapter 6, in the superfluid phase, the low temperature of neutral superfluids 
is described by a phase action with a stiffness J = pp s /m 2 determined by the superfluid density 
of the system. The density p s is measured by examining the changes in the inertia of a torsional 
oscillator; the superfluid fraction, p s experiences no friction and does not oscillate. 

Bishop and Reppy examined p s for a variety of superfluid films (of different thickness, 4 He 
concentration, etc.) wrapped around a torsional cylinder. By inferring the effective stiffness J as 
a function of temperature, they found that, for all films, it undergoes a universal jump of 2/ir at 
the transition while the behavior of J for T < T c was consistent with a square root singularity. 
They also observed (see Fig. 8.12) that the ratio p s (T c )/T c = J c m 2 = 2rn 2 /7r is a universal 
quantity independent of the material parameters of the film. 

In the high-temperature phase, the correlations decay exponentially. In this phase the 
parameter c = t 2 — " A y 2 = b 2 (T — T c ) is positive all along the hyperbolic trajectory. Here 
the recursion relation = 4ir :i y' 2 = ^(t 2 + b 2 (T — T c )) can be integrated to give 

4. 1 ( t \ 

7 r by/T - T c \by/T-T c J 

The integration must be terminated when t{i) ~ y(£) ~ 1 since, beyond there, the pertur- 
bative calculation is invalid. This occurs for a value £* ~ 4fcv /y_ T § , where we have used 
the fact that arctan(l /b\/T — T c ) 7r/2. The resulting correlation length is given by 


8 b^T - T c 


(8.48) 
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i.e. in contrast to the usual behavior described in previous sections, the divergence of the 
correlation length is not a power law, a consequence of the nonlinear nature of the recursion 
relations in the vicinity of the fixed point. 

Finally, vortices occur in bound pairs for distances smaller than £ while there can be an 
excess of vortices of one sign or the other at larger separations. The interactions between 
vortices at large distances can be obtained from the Debye-Huckel theory of polyelectrolytes. 
According to this theory, the free charges screen each other, leading to a screened Coulomb 
interaction exp(— x/£)C(x). On approaching the transition from the high-temperature side, 
the singular part of the free energy 

/ sing a r 2 oc exp , (8.49) 

has only an essential singularity. All derivatives of this function are finite at T c . Thus the 
predicted heat capacity is quite smooth at the transition. Numerical results based on the 
RG equations indicate a smooth maximum in the heat capacity at a temperature higher 
than T c corresponding to the point at which the majority of dipoles unbind. 


8.7 Summary and outlook 

This concludes our preliminary introduction to the theory of the “renormalization group.” 
It should be emphasized that our approach to the subject was biased in that it introduced 
the RG in a manner tailored to applications in quantum statistical physics; a more particle- 
physics-oriented introduction (see Ryder 13 ) would have focused on the principal issues of 
renormalizability, different regularization schemes, renormalization through counter-terms 
and, of course, the many great triumphs renormalization group methods have had in QED 
and QCD. However, even with regard to statistical/condensed matter physics, our discus- 
sion has been far from complete (arguably the most serious omission being that of conformal 
symmetries - see the Info block below). We trust, however, that the discussion was sub- 
stantial enough to convince the reader of the enormous power and versatility RG methods 
have in disclosing “deep” physical information and to motivate further exploration of the 
subject. 

INFO While, for the most part, previous chapters were guided by symmetry principles, little 
has been said so far about the role of symmetries in connection with renormalization. At the 
same time it is quite obvious that, at a point of self-similarity, a physical system will have more 
symmetries than the ubiquitous global scale invariance: it will, at least, also be rotationally 
and translationally invariant: In fact, we expect these symmetries to be local symmetries, e.g. 
provided the system is described by a short-range effective theory, a local rescaling of coordinates 
r — } 6(r)r should not cause any effect; a point on which you might like to reflect. 

More generally, we expect self-similar systems to be invariant under general conformal map- 
pings r — >- r'(r), i.e. mappings that leave angles between vectors locally invariant. (In a more 
mathematical language, a mapping is conformal if it leaves the metric tensor g ^ (r) invariant, up 
to an optional scale factor, i.e. g (r') =» |r )<? M (r) .) In general dimensions, the three classes of 
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transformations mentioned above, dilatations, rotations, and translations, essentially 44 exhaust 
the list of conformal mappings. It turns out, however, that the information gain due to the pres- 
ence of these symmetries is rather restricted. However, a radically different situation arises in 
d = 2. Introducing complex coordinates, z = z° + iz 1 , it can be shown 45 that every holomorphic 
mapping z — >■ w(z) is automatically conformal. This means that two-dimensional systems at a 
fixed point are invariant under a vast group of symmetry operations. These symmetry conditions 
can be employed - the subject of conformal field theory - to obtain very far-reaching infor- 
mation about the behavior of correlation functions. In fact, it is customary in two-dimensional 
conformal field theory to characterize the behavior of a theory “indirectly,” by the transforma- 
tion characteristics of its correlation functions, rather than by specifying concrete microscopic 
actions. At any rate, a discussion of the foundations and applications of conformal field theory, 
and its ramifications in various branches of mathematical physics, is beyond the scope of the 
present text. For a very accessible first introduction to the subject we refer to Cardy. 1 Readers 
who want to learn more about the subject are referred to di Francesco et al. i6 

In Section 8.6 we saw the way large-scale topological defects render the physical behavior of 
a model, as seemingly innocuous as the two-dimensional XY -model, utterly non-trivial. Our 
observations made in connection with the XY -model exemplify a very general phenomenon: 
in physical systems admitting the formation of topologically stabilized excitations, a con- 
spiracy of large-scale geometry-related structures and local fluctuations leads to intriguing 
physical effects. An introductory discussion of topological structures in quantum field the- 
ory, and of their ramifications in concrete physical phenomena, is the subject of the next 
chapter. 


8.8 Problems 

8.8.1 Dissipative quantum tunneling: strong potential limit 

In Section 8.2, we considered a quantum particle subject to a periodic potential and a dissipative 
damping mechanism. A combination of perturbative methods and the renormalization group was applied 
to show that, for dissipation rates above a certain critical threshold, the external potential effectively 
grows. Eventually, however, the potential height will reach values at which a perturbative treatment is 
no longer valid. In this problem we explore the fate of the quantum particle in the high-potential limit. 
As a result, we obtain the upper portion of the flow diagram discussed in Section 8.2. 

Let us consider the action (8.11) in the limit where the potential strength c is large in 
comparison with the high-frequency cutoff A of the theory. To explicitly implement the 
condition tu n < A, we add a “kinetic energy term” /Skm[$] = SP f d T @ 2 to the action, where 
l is some fixed parameter of dimensionality [length]. The mass parameter is chosen such 


The so-called special conformal transformation, r — »• r/r , also belongs to the family of conformal mappings. 
Under a change of coordinates, z —>■ w, the metric tensor changes according to ^ g a . It is 

straightforward to show (exercise) that the condition that the standard metric g^ v = change only by a 

global factor is equivalent to the Cauchy— Riemann equations — ^ and w q = ^ . 

P. di Francesco, P. Mathieu, and D. Senechal, Conformal Field Theory (Springer- Verlag, 1999). 
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that at the maximum value of the frequency, ui n = A, the kinetic energy term becomes of 
the same order as the dissipative term (i.e. for frequencies beyond A, the system is no longer 
effectively described by Eq. (8.11)): ml 2 ~ 1/A g. We are thus led to consider the imaginary 
time action 

S[0] = ^r g Y \ u, n\\0 n \ 2 + J dr + cos( 6 >(r))| , 


describing a particle in a periodic potential and subject to a weak damping mechanism. 
Assuming the damping to be weak, we begin by exploring the solutions to the equations 
of motion of the undamped ( g = oo ) problem. Varying the action, we obtain the equations 
of motion ml 2 d 2 Q + sin ((9) = 0, i.e. the Newton equation of the mathematical pendulum. 
Although these equations can be solved in closed form, there is actually no need to do so. 
We need note only that extended periods where the classical particle rests at the maxima of 
the potential are interrupted by occasional events where it rolls through one of the potential 
minima (see the figure overleaf). Throughout, it will be convenient to characterize the profile 
of a general trajectory 6 in terms of its time-derivative h = d T 0 (see the bottom part of 
the figure). For a function 0 containing n + instanton events and n_ anti-instanton events, 
h(r) = Sr=i e if( T ~ T i)> where Tj is the time of the event, = +1 (— 1 ) for an instanton 
(anti-instanton), n = ri- + n + , and /(t) is a function that is peaked around zero, has a 
width ~ c and integrates to 2ir. f™ dr f(r) = h( oo) — h(— oo) = 2n. 


(a) Show that the action of a general instanton trajectory is given by 


S[S\ 


1 

4t rtg 


Y E e * e J e ’ 


\fm \ 2 

M 


+ fi"Sinst) 


where .S'i nst . is the action of a single instanton event. 

(b) Apply a Hubbard-Stratonovich transformation to bring the partition function to the 
form 


^ = E E e - Si ' 

" Ui=±U 


J Dq e~^r^m 


J dn 


where we have neglected the contribution from the fluctuation determinant. (Hint: Use 
the fact that, in the frequency range of interest, f m ~ fm-o = 2nT.) 

(c) Sum over all configurations of the “charges” e, to obtain 

z = j D Q ex p f-^^El gTO l 2 l Wm l + 7 J dTcos (y¥) ij - ( 8 - 50 ) 


where 7 = 2e -Sinst . Intriguingly, we have arrived at a functional integral with an action 
structurally equivalent to our starting point (8.11). The “only” difference is a change in 
the coupling constants, g — > g c juai = ~ and c — > 7 = 2e -lS * nst . In words: very large values 
of the potential c map onto a very small potential in the “dual problem” ; dissipation 
strengths g > 1 map into gduai < 1 and vice versa. Use this information to convince 
yourself of the validity of the upper portion of the flow diagram discussed in Section 8.2. 
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What we have achieved by the construc- 
tion above is a duality transformation: 
a field theory subject to a strong potential 
(more generally, a field theory at strong 
interactions, or “strong coupling” ) has been 
mapped onto a weakly perturbed field the- 
ory (a theory at “weak coupling”). Duality 
transformations “boost” theories from one 
corner of their phase diagram into another 
and, therefore, establish links between sec- 
tors of the problem at hand that cannot be 
linked by perturbative means. This explains 
why the identification of its dual partners 
represents an important step in the analysis of any field theory. The problem above is spe- 
cial in that it is self-dual, i.e. the theory obtained after the transformation is structurally 
equivalent to the starting theory. 

Answer: 



(a) The result is obtained by substitution of 6 m = — ^ ]Tb eje*“' mTi into the action. 

(b) Substituting q(r) = e WmT Qm into the Hubbard-Stratonovich transformed represen- 
tation and performing the Gaussian integral over q rn , we arrive at 


* = £ £ < 

" {e<=±l} 


in/ 


d,Ti e 


-s[« 


i.e. the action of the previous line, integrated over all instanton coordinates and all 
“charge configurations” {e, = ±1}. 

(c) Starting from the representation derived in (b) and reorganizing terms, we obtain 


Z 


J i. j^ 7 j dT c„ s(5 ( T) ] 


dre ie « (T M 


Resumming the exponential series, we obtain Eq. (8.50). 


8.8.2 Quantum criticality 

So far, we have focused predominantly on scaling and critical phenomena associated with a thermally 
induced phase transition. However, as we have seen in previous chapters, the quantum system affords 
the possibility of a phase transition driven by some external parameter even in the zero-temperature 
system. For example, a transition from the superconducting to the normal phase of a metal can be 
driven by an external magnetic field. Similarly, the transition between a paramagnetic and a ferro- 
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or anti-ferromagnetic itinerant electron phase can be driven by changing the strength of the effective 
interaction (through, e.g., external pressure). In both cases, the application of the external parameter 
leads to a substantial rearrangement of the ground state - a quantum phase transition. If this 
is continuous, as with a thermally induced transition, one can expect the appearance of characteristic 
critical phenomena analogous to, but potentially different from, the corresponding classical critical 
theory. The aim of the present problem is to explore the nature of the critical phenomena when the 
critical point is driven to zero temperature - a quantum critical point. 

Motivated by the coherent state formulation of the quantum partition function of an inter- 
acting system, it is tempting to associate directly the quantum statistical mechanics of a 
d-dimensional system with the classical statistical mechanics of a (d + l)-dimensional sys- 
tem - the time coordinate would seem to just add one more dimension (albeit of finite extent 
0). However, in most applications, time and space enter the action in an anisotropic man- 
ner. As a result, one must consider scalings along the time and space directions separately. 
This provides for a rich variety of new phenomena associated with the quantum critical 
point. Although a general theoretical phenomenology of the quantum critical system was 
formulated by Hertz 47 soon after the development of the renormalization group method, 
it is only relatively recently that experimentalists have been able to access and explore in 
detail the quantum critical regime. Of these experimental developments, perhaps the most 
systematic has been in the exploration of metallic magnetism in the wide class of heavy 
fermion compounds. With this motivation in mind, let us then recall briefly our discussion 
of the Stoner transition in Problem 6.7. 

I N FO Without going into detail, we note that heavy fermions are compounds containing rare- 
earth elements such as Ce or Yb, or actinide elements such as U (examples including UBei3, 
CeCu2Si2, and many more). Their inner shell conduction electrons often have effective masses 
several hundred times as large as that of bare electrons. As a consequence, the Fermi energy 
in heavy fermion materials is anomalously low (exercise). At low temperatures, many of these 
materials are magnetically ordered, others show strong paramagnetic behavior, and some display 
unconventional mechanisms of superconductivity. 

Starting with the quantum partition function of the lattice Hubbard system, a field theory 
of the Stoner transition was derived. Specifically, it was shown that, after an appropriate 
rescaling of space and time, the quantum partition function of the interacting electron 
system could be presented as a functional field integral over a scalar magnetization field 
m(x), weighted by the Euclidean time action 48 

IfW = ^ + q2 + |m,| 2 + | J dxm 4 (x). 

In the quantum critical system, the parameter 6 measures the distance from the critical 
point while the Landau damping factor T g = Ijq for the ferromagnetic transition and 


J. A. Hertz, Quantum critical phenomena, Phys. Rev. B 14 (1976), 1165-84. 

Here, to facilitate the RG, it is helpful to adopt the Fourier convention m(x) = UT \ rn q = 

f dx m(x)e~ l ^ <l ' r ~ OJT \ where dx = J d d r dr. 
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T g = r, a constant, in the antiferromagnetic case. Finally, to regularize the theory, one 
may suppose that the momentum integration is cut off at the lattice scale A = 1/a. More 
generally, in the absence of an external magnetic field, one may consider a generalization 
from the scalar to a vectorial field m. However, keeping our discussion simple, we focus on a 
system with uniaxial symmetry (such as a metamagnetic system 49 ) where the field is scalar 
(uniaxial) . 

At the level of Landau mean-field theory, the magnetization field acquires a non-zero 
expectation value when <5 < 0. In the classical system, our analysis of the </> 4 -theory (to 
which the present action collapses at high temperatures) has shown that, in dimensions 
d > d u = 4, the critical system is controlled by the Gaussian fixed point while in dimensions 
1 > d > d u the critical properties are controlled by a new fixed point. Here, in the low- 
temperature system, one can expect the critical properties to be influenced by quantum 
dynamical fluctuations. Drawing on the seminal work of Hertz and, later, by Millis, 50 our 
aim here is to explore the ramifications of scaling in the quantum critical system. 


(a) As a warm-up exercise, show that, up to a temperature-independent constant, the free 
energy of the Gaussian theory (i.e. S > 0, u = 0) takes the form 


F Gauss = L d 


(Hint: Functional integration over the Gaussian field fluctuations leads to a formally 
divergent expression. To extract the finite 6-dependent contribution above, first differ- 
entiate, dsF, to later regain the 6-sensitive part of F by integration, f dS d$F.) 

(b) Although, as we have seen above, the free energy can be obtained directly from the func- 
tional integral, to guide our analysis of the non-Gaussian theory it is helpful to develop 
the RG on the Gaussian model first. Using the standard coarse-graining procedure, 


m(q) = 


m 3 (q), 

m(q), 


0 < |q| < A/6, 
A/6 < |q| < A, 


show that the scaling exponent ys = 2 and, from the temperature, identify the value of 
the dynamical exponent z for the ferromagnetic and antiferromagnetic models. Focusing 
on the Gaussian fixed point, show that the RG implies the scaling dimension y u = 4 —d—z 
from which one obtains the upper critical dimension d u = 4 — z. 

(c) To accommodate the effect of a general interaction U, one may make use of the pertur- 
bative RG expansion 


Z = 



49 As derived in Problem 6.7, the Stoner transition takes place from a paramagnetic to a ferromagnetic phase. 
However, experimentally, one often finds that, while the paramagnetic phase is stable at zero external magnetic 
field, the magnetization can change abruptly at some non-zero field - a metamagnetic transition. The develop- 
ment of a Landau expansion of the action in the vicinity of the metamagnetic critical point leads to a theory 
with uniaxial symmetry. 

50 A. J. Millis, Effect of a non-zero temperature on quantum critical points in itinerant fermion systems, Phys. 
Rev. B 48 (1993), 7183-96. 
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where U[m s ,m{\ = ^ f dxm^mf and ■ - )t = f Drri{e~ s(:2) ^ mt \- ■ ■ ) denotes averaging 
over the quadratic action of the fast field. Keeping terms to first order in the quartic 
interaction, 51 obtain the RG equations 

dT(b) 
dlnb 
; 0(b) 
dlnb 

| fe(6) 

. $Iiilr 

where 

/( 2 )[T(6)] = A d n d r A f 1 - 
J o 7r 

denotes a dimensionless integral function of the temperature and is the surface of 
the d-dimensional unit ball divided by (2n) d . (Hint: Notice that, for a given value of the 
cutoff A, the integration domain in frequency-momentum space is set by |w| < T g , |q| < 
A. This means that the “shell” of fast fluctuations obtained by a gradual lowering of the 
cutoff is of the form shown in the figure. Introduce the integration boundaries imposed 
on the fast-field integration after you have analytically continued to real frequencies.) 

Already, from these differential recursion relations, u | 

one can recover much of the phenomenology of the 
quantum critical system. Integrating the last of the 
recursion relations, one finds that for d + z > 4, the 
interaction u(b) = ub i ~^ d+z ' > initially decreases under 
renormalization. Therefore, over at least some range of 
parameters, one may expect the theory to be controlled 
by the Gaussian fixed point T* = r* = 0. However, it 
is evident from the first of the scaling relations that the 
temperature increases under renormalization scaling as 
T(b) = Tb z . Crucially, referring to the expression for 
/( 2 )[T(f>)] (and, indeed, its higher-order descendants) - setting A = T A = 1 for simplicity - 
one may see that the character of the renormalization changes when T[b) ~ 1 (i.e. when 
b ~ bo = T -1 / 2 ). Here, the effect of the confinement in the Euclidean time direction begins 
to impact. Whether this scaling regime is reached depends on the physical bare parameters. 

To stay within the scaling regime, we require 6(b) < i. Therefore, defining b\: <5(&i) = 1, 
one may identify two regimes of behavior. Firstly, in a low-temperature regime where 1 <C 
bi -C bo, the system is characterized by quantum critical behavior controlled by the T = 0 

51 Comparing with the RG analysis of the standard </> 4 -model in Section 8.4.4, consider which fluctuation effects 
are kept at this level of approximation and which are not. 




= zT(b), 

= 26(b) + 12u(b)fW[T(b)], 
= (4 — d — z)u(b), 
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fixed point. Here, approximating f^[T(b)] ~ /^[0], an integration of the scaling equation 
gives 


5(b) = b 2 


12u/( 2 )(0)1 

z + d- 2 


the renormalization of <5 recording the effect of quantum dynamical and spatial fluctuations 
on the critical point. Referred through b\, this implies a “quantum critical regime” bounded 
by the inequality T <C r z ! 2 . 

However, if this inequality is reversed (i.e. b\ > bo), the scaling behavior must be separated 
into regimes where b < bo ( T(b ) < 1) and b > bo ( T(b ) > 1). Noting that, for T(b ) 1. 

oc T(b), the recursion relations may be simplified (and, indeed, their nature 
elucidated) by defining v(b) = u(b)T(b), whence they take a more familiar form, 


d5(b) 

din 6. 


2 5(b) + Cv(b), 


m 

din b 


(4 — d)v(b), 


with C relating the constant of proportionality. These are nothing but the differential recur- 
sion relations associated with the leading order of perturbation theory of the d-dimensional 
classical r/d-tlieory with the familiar upper critical dimension d u = 4. Integrating these 
equations from bo to b > bo, we obtain 




Cv(bp) 
2- d 


v(b) = v(bo) ly 


To determine the initial values (6, v)(bo), we need to integrate the low-temperature (b < bo, 
or T(b) < 1) scaling relations up to bo- Setting f^(Te zx ) = f^ 2 \0) + [f( 2 \Te zx ) — /^(0)], 
the constant /^(O) recovers the result above while the term in rectangular brackets effects 
a temperature-dependent correction such that 


6(b 0 ) = T- 2 > z [r + BuT^**^/ 2 ], v(b 0 ) = uT^' 1 1-* -*)/*. 

where B = f dTT^ d+2z ^/ z [f 2 \T) - / (2) (0)]. 

We now have to distinguish between two very different scenarios, (i) If the scaling is halted 
(5(b) = 1) while v(b) remains small, the perturbative expansion remains valid. However, 
(ii) if v(b) grows to order unity while 5(b) remains small, the scaling behavior moves into a 
region potentially controlled by non-Gaussian fluctuations. From the equations above, the 
condition for Gaussian behavior, i.e. 6(b) = 1 and v(b) <C 1, translates to the Ginzburg 
criterion 

uT (d+ z ~ 3)/z 

From the same result, one can infer a transition temperature 


T c (r < 0, u) = 


r 1 */(<*+*- 2 ) 
(B + C)u 


Notice that at the transition temperature we are always in regime (ii), i.e. a regime governed 
by strong non-Gaussian fluctuations. Gathered together, the results of this analysis are 
summarized in Fig. 8.13. 
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Figure 8.13 Phase diagram associated with a quantum critical point. The solid line denotes the 
locus of the transition temperature while the hatched area denotes the Ginzburg region where the 
scaling behavior is controlled by critical fluctuations. 

INFO Although this completes our formal discussion of the renormalization properties of the 
quantum critical theory, it is instructive to question how these results can be translated to 
experiment. From the scaling behavior, one can infer directly a number of physical manifesta- 
tions. Of these, the most straightforward relates to the temperature dependence of the magnetic 
susceptibility. Focusing on the classical Gaussian regime, in the disordered phase, we have seen 
that the mass scales as S(bo) = T~ 2 ^ z [r + BuT ( ' d+z ~ 2 ^ z ], The latter translates to a zero-field 
susceptibility 

X (T) = X o(r + BuT (d+ *- 2 > / *). 

Once the bare temperature crosses into the disordered quantum regime, one expects the suscep- 
tibility to become temperature- independent. 

Now, in the clean system, magnetic fluctuations present the dominant mechanism of low- 
temperature scattering. In this case, one can estimate their effect on the low-temperature resistiv- 
ity. Referring to the literature, an estimate of the scattering rate based on the Born approximation 
gives 

T = -to 

where g denotes the coupling between the electrons and spin fluctuations. When the spin fluc- 
tuations dominate, one finds a resistivity p(T ) ~ j>3-(d+z-2)/z j n the opposite regime, 

the contribution from electron-electron scattering dominates and one recovers the Fermi-liquid 
result p(T) <s» T 2 . 
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Answer: 


(a) For the Gaussian theory, integration over the fields m(q) and subsequent differentiation 
with respect to 6 gives the free energy density 


[ ddq TV 1 -L d j 

f ddq Pi 

l' rq dui tanh(/3e/2) 

/ (2 ir) d j^ S + q 2 + \^A J 

(27T) dP J 

_ r 2m S + q 2 — 


L d f dd Q p f rq duJ tanh(ffe/2) u/T q 

J (2n) d J 0 7T (6 + q 2 ) 2 + (w/r g ) 2 ‘ 


To arrive at the first integral, we have converted the sum over positive frequencies u) n 
into a contour integral running parallel to the real axis, surrounding the pole at u) m = 0 
along an infinitesimal semicircle in the upper complex half plane, and closing it at 
infinity through the upper complex half plane as usual. Noting that the infinitesimal 
integral around zero cancels against the contribution cj rn = 0, the integral reduces to the 
principal value integral given above. (The integration is restricted to |iu| < because 
outside this interval the low-frequency approximation of the polarization operator no 

longer applies.) Using the relation that <9,5 tan -1 ( U J T _% ^ = — 7 !jJ / Yq 

at the required result. 

(b) Since, at the level of the Gaussian theory, fast and slow fluctuations decouple, the 
effective action of the slow fields is given by 

f A/b \£q_ 

mt 




<5 + q + 


\m s (q)\ 2 


i.e. the original action with lowered cutoffs. (The presence of a cutoff in the frequency 
summation is implicit in the notation.) To restore the cutoffs to their original values, one 
must implement the rescaling q' = q b,ui' n = b z uj n , where z denotes the dynamical expo- 
nent. Notice that the frequency scaling is tantamount to a redefinition of temperature, 
T' = b z T. The final step of the RG involves the renormalization of the magnetization 
field, m'(q') = m s {(f)/y. As a result, the action assumes the form 


S[m] 



d d || 

(2ir) d 



+ 


Kl b z \ 

Tq'/b J 


b (d+z) y 2 \m'(q')\ 2 . 


Requiring that the gradient term remain invariant, one finds that y = b( d + z + 2 )/ 2 _ 
Further, requiring that the damping term maintain the same form ensures that 
(l/r 9 //(,/6 z )6 _ G+ z )y 2 = l/r g /, fixing the dynamical exponent. In particular, for the 
ferromagnetic system (F, ; = r|q|), z = 3 while, for the antiferromagnetic system 
(T 9 = F), 2 = 2. Finally, applied to the coefficient <5, invariance of the action requires 
S' = y 2 b~ < ' d+z ^S = b ys S with yg = 2. As expected, the Gaussian fixed point lies at 
T* = S* = 0 and has the fixed point action 
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where r A sets the cutoff in frequency direction. 

To identify the upper critical dimension, one must consider the effect of the quartic 
interactions. Firstly, applied to the gradient component of the fixed action f dx (Vm) 2 , 
the relation m!{x') = m s (x')/r) allows us to deduce the field renormalization 77 = 
2 -d,-z )/2 Applied to the quartic interaction, the renormalization procedure leads to 
the relation 

^ J dxm 4 ‘(x) 1 — ^ dx' m' 4 (x'), 

from which one obtains v! = b 4 ~ d ~ z u. Therefore, in dimensions d + z < 4 the interac- 
tion coefficient diminishes under renormalization and the properties of the system are 
governed by the mean-field exponents. Since the dynamical exponent z = 2 or 3, one 
finds that the majority of physical systems fall above the upper critical dimension and 
are controlled by the Gaussian fixed point. 

(c) The first term in the u-expansion, (3u/2) f dx m 2 (m 2 ) , gives rise to a renormalization of 
the coefficient 5. Evaluating this contribution, and expressing the frequency summation 
by a contour integral as in (a), we obtain 


(m?) — r f d d qdco coth 

f/ 7T(27T ) d J dA 


u/Tq 

{6 + q 2 ) 2 + (w/T,) 2 ’ 


where the integral runs over the momentum shell dA (see the figure on page 480). Taking 
each component of the integral in turn, one obtains 


(m f ) = ^A- - 
1 

+ - 


A d_1 fL 


L duj 


coth 


w/r A 



(V - ^ coth (—) 

1 (lx)* 

V q b z ) V 2 / 




2 ) («5 + A2)2 + ( w /r A )2 

1 

(5 + q 2 ) 2 + 1 ’ 


Noting that A(1 — e _lnb ) ~ A In 6 + • • • and A(1 — e _zln6 ) ~ Azin 6 + • • • , the latter 
can be expressed as (to 2 ) = f^[T(b)] In 6, where, applying the rescaling s = w/r A or 
s = |q|/A as appropriate, one obtains the required expression for f^[T(b)]. Therefore, 
to first order in u, one obtains the scaling relation S' = (5(l-|-21n6)-|-12u/^ 2 ^[T(6)] In b. At 
first order in perturbation theory, the coupling constant u is not modified by fluctuation- 
induced corrections, i.e. u changes according to its engineering dimension, u' = u( 1 + 
(4 — d — z) In b). Similarly, T' = b z T. Differentiating these expressions, we arrive at the 
required differential recursion relations. 


8.8.3 RG analysis of the nonlinear a -model II 

Previously, we have argued that, below the lower critical dimension, Goldstone mode fluctuations become 
unbounded and induce a crossover of systems with spontaneously broken continuous symmetries into a 
“disordered” state. Technically, the long-distance fluctuation behavior of Goldstone modes is described 
by nonlinear cr-models with coset-space valued fields. In this problem, we apply RG methods to show 
that, at and below the dimension two, these models indeed flow towards a disordered phase. 
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Consider a situation where the symmetry of a model is broken from a “large” compact group 
G down to some subgroup H. On physical grounds, we expect that (i) the action of the 
associated Goldstone modes g £ G/H is space(-time) rotationally invariant and starts at 
O (two derivative operators). By symmetry (Exercise: Think why!) (ii) the action must be 
invariant under global transformations by elements of the full symmetry group, iS'fg] = S[gg], 
where g £ G is constant. However, (iii) it must also be invariant under local transformations 
£[</] = S\gh\, where h(x) £ H is a field taking values in the “small” group H. (This last 
condition simply states that g takes values in the (right) coset space G/H, i.e. is oblivious 
to (right) transformations g gh.) 

How would one construct such an action in practice? An obvious recipe goes as follows. 
Take any reference ground state |no). (Think of |no) as an element of some vector space, 
e.g. a “spin up” state in a magnetic system.) Defining the symmetry transformed states 
|n(x)) = < 7 (x)|no), the unique G-invariant action is then given by £[n] = cf d d x(Vn|Vn). 
This action obviously meets all three criteria above. However, by making explicit reference 
to an arbitrarily chosen state (|no)) it partially obscures the symmetries of the model. 
Thus, before turning to the RG formulation, let us briefly discuss an alternative, and more 
symmetry-oriented, formulation. 

Let A £ G be a group element that commutes with all h £ H, i.e. hAh -1 = A . 52 The 
set {Q — gAg _1 \g £ G} is isomorphic to the coset space G/H (which is to say that two 
elements g and g' = gh differing by an element of the small group are represented by the 
same Q). In the Q- language, the unique action meeting the criteria (i)-(iii) above (exercise: 
check!) is given by 

S[Q] = \j d d rtr(VQVQ). 

Below, we will explore the long-range behavior of the field integral f DQ exp(— S'[Q]), where 
DQ is the G invariant measure (DQ = D(gQg~ 1 )) on the coset space. 

INFO They being but di erent representations of the same coset space G/H it is, of 
course, always possible to explicitly relate the n- to the Q-representation. Consider, for exam- 
ple, the case G = SU(2), and H = U(l). Representing the group G through Euler angles 
(see Section 3.3), g((p,9,ip) = exp(» 0 S , 3 ) exp(i0Si) exp(iipS 3 ), define the (spin-coherent) states 
\n(<f>, 6)) = g(cp,9)\ ), where | ) is the state of maximal eigenvalue S 3 = S, and we noted that 
the ip degree of freedom (spanning the small group) leaves I 1 S 3 ) invariant (up to a phase) and 
can be omitted. In Section 3.3 we used this representation to identify the phase space of spin as 
the coset space SU(2)/U(1), the 2 sphere. In the Q language, the same sphere is represented as 
Q = go 3 g _1 , where Si = iTj/2 are identified with the Pauli matrices. Again, the ip factor, com- 
mutative with ( 73 , drops out. The integration measure dQ = dtpdO sin 0 reduces to the invariant 
integration measure on the sphere. 


52 It being a subgroup of the (matrix-)group G , we can always find a representation wherein the group H is 
represented by block-diagonal matrices. In this representation, A will be a diagonal matrix that contains identical 
elements on all sub-blocks defining H. 
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(a) By way of example, let us consider the case ( G,H ) = (SU(2), U(l)), relevant to, e.g., 
the classical Heisenberg magnet. 53 Defining g = g s gf where the fast field gf = exp (ill 7 ), 
W = zE \ 2 + zE‘ 2 \ , z S C (explore the connection between this representation and 
the Euler angle representation above), and = g A l 0 t ,g K . show that the freedom to 
transform gf by elements of the small group can be used to bring <f> s to an off-diagonal 
form, [$ s , (73]+ = 0. Show that the expansion of the action to second order (— > one-loop 
order) in the fast generators W is given by S^H 7 ] = S[g s ] + Sf[z,z\ + Sf s [g s , z, z], 
where Sf[,z, z] = jf d d rVzVz and 


Sfsfa a ,z,?l 


J d d r tr(<I» /( (3Ej2 - zE 2 i)^^(zE 12 - zE 2 1) + ^cr 3 $ ll cr 3 zz). 


(b) Integrate over the fields z, z to obtain the effective action iS[c/s] ~ j~ f d d r tr(VQ s V<5 s ), 
where A r is the renormalized coupling constant. Differentiate this constant with respect 
to the scaling parameter b to obtain the RG equation of the cr-model on the 2- 
sphere, 


din A _ A 
d In b 8n 


(8.51) 


According to this equation, the coset u-model behaves very much like its group-valued 
cousin explored in Section 8.5: at dimensions d = 2 and below it flows towards a dis- 
ordered phase; at dimensions d > 2 there is a phase transition separating a disordered 
regime from an ordered phase. 


Answer: 


(a) Substituting g = g s gt into the action and defining the shorthand notation $ = g~ 1 Vg s , 
we obtain S[g s , gf] = j f d d r t^V^QfV^Qf), where Qf = gicr^ 1 and the “covari- 
ant derivative” = VQ + [<!>, Q]. We next transform the integration variables 

W — >• hWlr 1 , where h £ H. (Notice that this really is a transformation of integration 
variables, i.e. the off-diagonal form of the generators is preserved.) This transformation 
induces a change — »• ^ and we may choose h so as to remove the matrix- 

diagonal components of <I>. (Exercise: Explicitly construct a matrix h that does the job. 
Why can the matrix h be defined only locally and why does the absence of a globally 
defined h not matter in connection with a perturbative RG operation?) 


Below the transition temperature, the symmetry of the Heisenberg magnet is broken from full rotational sym- 

locally 

metry, 0(3), to rotations around the magnetization axis, 0(2). Since 0(3) ~ SU(2), and 0(2) ~ U(l), the 

example above covers the physics of the magnetic system. (We prefer to use the language of the unitary group, 
as it is somewhat easier to implement, technically.) 
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A straightforward expansion to second order in W, Qf = 0 - 3(1 + 2 iW — 2 W 2 - 
now leads to the expressions 


S { [W] = jJ ddr tr (dnWdpW), 

% s , W \=~IJ ^rtriQ^WasQ^Wcrs + 


(8.52) 


Substituting the ^-representation of the generators W , we arrive at the action given 
above. 

(b) Every field 3?^ contains a derivative acting on the slow field. Since we do not want to keep 
terms with more than two derivatives, the effective action is given by S , e ff[ffs] = S [<? s ] + 
(S^[g s , z,z]). Switching to a momentum representation, Sf[z,z\ = S P ef z pZpP 2 , 
and doing the Gaussian integrals, we obtain 


C SW[g s ,z,z}) = J d d rtt([^,a 3 ] 2 ) = - I j- j d d r tr(VQ s VQ s ), 

e constant I\ has been defined in Eq. (8.28). Substituting this result l 
five action, we obtain 

5eff = G ~ t) / ddrtr ( V ^ V ^) = Y j d<i r tr(VQ s VQs), 


where we have rescaled momenta and A 7 1 = [A -1 — fl rf (l— b 2 ~ d )/A{d— 2)]6 rf_2 . Substitut- 
ing d = 2 + e and expanding in e, we obtain A r = A (l — In be + ^ In 6 ) . Differentiating 
with respect to In b, we obtain the RG equation (8.51). 


8.8.4 Scaling theory of the Anderson metal insulator transition 

In Section 6.5 we derived the field theory of the disordered electron gas. Specifically, we showed that 
the low-energy physics of the system is governed by a phase of a spontaneously broken continuous 
symmetry. This being so we should expect d = 2 to be the “lower critical dimension" of the disordered 
electron gas. At and below d = 2 Goldstone mode fluctuations are likely to drive a nominally conducting 
system into a phase governed by unbounded Goldstone mode fluctuations - an insulator in fact. In 
higher dimensions the effect of fluctuations is comparatively weaker and metallic phases have a chance 
to prevail. In this problem we apply RG methods to obtain a semi-quantitative understanding of the 
Anderson transition between these two phases. 

INFO Before turning to the technology, let us discuss the notion of Anderson localization 
in metals on a more qualitative level. Firstly, it is important to draw a distinction between 
qualitatively different mechanisms of localization. Consider a d-dimensional electron system in 
the presence of static disorder. Evidently, the lowest energy states in a disordered system will 
be dictated by “optimal fluctuations” of the random impurity potential. These “tail states” can 
be interpreted as bound states localized in regions where the potential is unusually low and 
uniform. Instead, here we are interested in the localization properties of states with energies 
greatly in excess of the typical magnitude of the impurity potential, i.e. Eft 3> 1. Here, the 
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mechanism of localization can derive only from the accumulation of interference corrections akin 
to the corrections discussed in Section 6.5 above. 

According to its energy E, Anderson proposed 54 that a given state is either extended or local- 
ized depending on the degree of disorder. Following an argument due to Mott, it is also plausible 
that extended and localized states at the same energy do not coexist - in such circumstances, the 
admixture of the former due to arbitrarily small perturbations would lead to the delocalization 
of the latter. Moreover, it is natural that states at the band edge are more susceptible to the for- 
mation of bound or localized states. These considerations suggest a profile for the DoS in which 
the low-lying localized states are separated from higher-energy extended states by a mobility 
edge. The transition signaled by the mobility edge is known as the Anderson transition. 

Following the pioneering work of Anderson, it was almost two decades before a consistent 
theory of localization emerged. Its basis was a simple yet powerful scaling hypothesis that followed 
naturally from a conjecture made by Edwards and Thouless: 55 the conductance g(L ) of a d- 
dimensional fragment of metal of size L d (L i Af) is determined by the sensitivity of the 
single-particle energy levels inside the sample to changes in the boundary conditions. Building 
on this assumption, Abrahams et al. proposed 56 a one-parameter scaling hypothesis of 
localization. This hypothesis states that the conductance of a system of size hL is a function 
of g(L) only: p(6L| jw fb(g(L)). The important point is that fb does not explicitly depend on 
the system size L. Independent of Thouless’ argument, this restriction can be made plausible by 
dimensional analysis: since g is dimensionless, the only way for the parameter L to appear in 
/ is through a dimensionless ratio L/a, where a is some compensating scale of dimensionality 
[length]. This, however, would imply explicit dependence of the conductance on a microscopic 
reference scale, which conflicts with the presumption of universal scaling behavior. 

Differentiation of the scaling form with respect to b gives the Gell-Mann-Low equation = 
P(g). The scaling function ft (g) = f'(g)/g is universal, i.e. independent of the microscopic prop- 
erties of the sample, such as the bare microscopic conductance and £/ Af. While these predictions 
still have to find a truly rigorous mathematical proof, they definitely represent an important 
milestone in phenomenology. Moreover, the scaling picture fits neatly into perturbative micro- 
scopic approaches to localization, such as the one discussed below. However, before turning to 
quantitative calculations, let us try to determine the profile of the /3-function on qualitative 
grounds. 

For an ohmic conductor, g(L) ~ L d ~ 2 , implying that the scaling function takes a constant 
value /3 = d— 2. On the other hand, deep in the insulating regime, where states are exponentially 
localized, /3(g) ~ In g (i.e. g ~ e~ L ^). A smooth interpolation between these limits (see Fig. 8.14) 
suggests that, below two dimensions, all states are localized, while, above, there is a critical 
conductance, g t , marking a phase transition between a localized and a delocalized phase - the 
Anderson transition. 

At the time when this scaling behavior was proposed, it was not yet known that the low-energy 
theory of the disordered electron gas was a nonlinear u-model. However, with the benefit of 
hindsight, one may interpret the phenomenology outlined above as a ramification of the notorious 


P. W. Anderson, The absence of diffusion in certain random lattices, Phys. Rev. 109 (1958), 1492-505. 

J. T. Edwards and D. J. Thouless, Numerical studies of localization in disordered systems, J. Phys. C 5 (1972), 
807-20. 

E. Abrahams, P. W. Anderson, D. C. Licciardello, and T. V. Ramakrishnan, Scaling theory of localization: 
absence of quantum diffusion in two dimensions, Phys. Rev. Lett. 42 (1979), 673—6. 
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Figure 8.14 Scaling function of the dimensionless conductance, (3(g), in dimensions d = 1,2, and 3. 
At large values of the conductance, the scaling function approaches the asymptote /3(g) — > d — 
2 corresponding to ohmic behavior. For very small values of conductance, the scaling function 
approaches /3(g) — > In g, which is characteristic of insulating behavior. According to the weak 
localization expansion, a localization transition is predicted in dimensions greater than two. 

scaling behavior displayed by these models. Below, we show that the one-loop /3-function of the 
replica cr-model fully complies with the predictions of the one-parameter scaling theory. 

(a) Recapitulate the construction of the replica nonlinear u-model in Section 6.5. Convince 
yourself that, in order to explore the physical properties of a non-interacting disordered 
system, we actually do not need the full information contained in that model; the 
model introduced in Section 6.5 has been constructed to compute correlation functions 
(G n G n i) for arbitrary Matsubara frequencies u> n and ui n i. However, sufficiently far- 
reaching information on the conduction behavior of a non-interacting system (cf. our 
discussion of linear response in Section 7.4) is contained in correlation functions ~ 
( G + G~ ) involving the product of a retarded and an advanced Green function = 
(Ep ± (cu/2 + *0) — Ff ) _1 at fixed, energies Ep ± w/2. Retracing the derivation of the 
cr-model, convince yourself that correlation functions of this type can be obtained from 
the action Eq. (6.63), where the Q-mat rices, Q = gAg -1 , are of dimension 4 R (instead 
of 2 RM as in the Matsubara case) and A = {a\ 3 < 5 Q:Q: /} assumes the role of the matrix A = 
{sgn ran ,c) aa '}, i.e. the effectively infinite-dimensional space of Matsubara frequencies has 
been reduced to a two-component space discriminating between retarded and advanced 
Green functions. (Here, a = (a, a) are 2/?-component indices comprising the replica 
index a and the two-component “time-reversal” index a introduced in Section 6.5.) 

(b) Focusing on the case of zero external frequency and introducing the abbreviation A -1 = 

we wish to explore the scaling behavior of the action 

S[Q] = \\ d d rtv(VQf. 

To do so, we follow the same logic as in the previous problem: we decompose Q = 
gsQtgf 1 into fast and slow components and expand the former Qf = <73 ( l+i2W — 21-F 2 + 
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■ ■ ■)■ The matrices W anti-commute with <73, and live in the Lie algebra sp(4i?), i.e. 
they obey the condition W = — crJ/W^cr^ where the Pauli matrices act in time-reversal 
space. These two conditions are encapsulated in the representation (block structure in 
advanced/retarded space) 

W = B ^j , B = -a$B T a% = -B T , (8.53) 

where the matrices B are of dimension 2 R and we have used the abbreviated notation 
B T = a^B T a % 2 - Substituting this representation into the action, we obtain «S[</ s , W] = 
S[g s ] + £f[W] + Sfs^W), where St and Sf s are given by Eq. (8.52). 

Perform the Gaussian integrals over the matrices W. First show that {B pa gB^, B , a ,) = 
— a °e/<fp2 q ’ q ' ■ Use this result, and the symmetries of the B matrices, to verify the aux- 
iliary identities 

tr(B p ABl,A’) ,|^_tr(T)tr(A'), (8.54) 

tt(B p AB p ,A') = ^^tr (AA IT ). (8.55) 

Use these relations to show that (S'[5 , s ,fU]) = — ^ / d d rtr(VQ) 2 , where I\ has been 
defined in Eq. (8.28). (Hint: Use the fact that = —<!>,, (exercise). Identify and neglect 
terms that vanish in the replica limit.) Adding this contribution to the core action 
S[g s ] we obtain the same renormalized coupling constant as in the previous problem. 
Thus, the RG equation of the cr-model of disordered systems (possessing spin 
rotation and time reversal invariance, to be precise) is again given by Eq. (8.51). In 2 + e 
dimensions, the constant A oc (Du)~ x oc g~ x is proportional to the physical conductance 
and Eq. (8.51) describes the scaling of the latter. In accord with the phenomenological 
expectations summarized above, the scaling equation depends only on the conductance 
itself (one-parameter scaling) and predicts an (Anderson) transition point for d > 2. For 
further discussion of the scaling behavior of the replica cr-model and its ramifications in 
the physics of disordered systems we refer to, e.g., Efetov. 57 

Answer: 

(a) Consider the Grassmann action J ^(E-p + (u> + *0)03/2 — H)ip, where */> = {ip ±a } is a 
2_R-component Grassmann field and 03 acts in the space of ±-indices. By introducing 
suitably devised sources, this action can be employed to compute correlation functions 
involving an advanced and a retarded Green function. Recapitulating the derivation of 
the nonlinear cr-model, we notice that nothing changes except that the infinite vector 
of Matsubara frequencies ui gets replaced by the two-dimensional matrix (w + *0)03/2. 
Since the structure of the diagonal saddle-point, A, is solely determined by the sign of the 


B. Efetov, Supersymmetry in Disorder and Chaos (Cambridge University Press, 1996). 
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imaginary part of the frequency vector, A = {sgn nn ,<5 0 a'} gets replaced by A = {craAio'}- 
The Q matrices reduce to Q = gAg~ x , where g € Sp(4 R). 

(b) Switching to an index representation, S[B,B^] = Ylaa' From this rep- 

resentation, one might naively conclude that (BqaffB^ g, a ,) = gI )i q2 d ao > Spp' <5 q . q ' with 
a right-hand side twice as large as the correct result. However, thanks to the relation 
B = — a^B T G 2 , the matrix elements of B and are not independent complex vari- 
ables. Using the symmetry relation (exercise), it is straightforward to show that all 
complex integration variables appear in pairs, i.e. the coupling constant effectively dou- 
bles. On the same basis, some matrix elements (Baa.a's, where a = (a + 1 mod2) turn 
out to be real rather than complex. In either case, the Gaussian integration obtains a 
result twice as small as for independent complex variables. This explains the factor 1/16 
in the given formula. 

The auxiliary identity (8.54) readily follows by representing the trace as a sum 
over indices and using the Gaussian formula for the matrix indices above. To prove 
Eq. (8.55), we first need to relate the (BB) contraction to a (BB^) contraction: 


(tr(B p AB_ p ,A')) 


{trC(*?A'B p A*?Xo?B- P '<$$ 

{{o$A% M B pPl (A*$%^B_ p ,af)s a ) 

-((afA'UB^iAafUBl^) = 

-ie = jdv tl(AA ‘ T) - 


Using these contraction rules to compute the second term in Eq. (8.52), (tr($ M f73$ M cr3 
(BB^Eu +B^ BE 22 ))) oc tr(l) a R, we obtain an expression that vanishes in the replica 
limit. As for the first term, noting that = (g~ 1 d f j,g s ) = d p glg~ lT = d p g~ 1 g s = — 
we obtain 


~\J d d r (tr^lUaa^fUas)) = -| j d d r (tr(<t> fl (BE i2 - B*E 21 ) 
x <t> fl (BE V2 - B'E 21 ))) 

► 4 J d^r^^BE^BE^ =hj d d r tr(^ 2 (^) 21 ) 

= 4/ d d r tr([^,^||^,:<t3i). 


Here, the >” indicates that we are omitting terms that lead to traces of <h i! = 0. 
We also note that, by virtue of the time reversal relation, contractions ~ (BB) and 
~ (B^Bt) lead to identical results. Expressing 4> ;j in terms of the original variables Q s , 
we arrive at the formula above. 
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8.8.5 Kondo effect: poor mans scaling 


In Problem 2.4 we saw that the low-energy properties of a metal, when it is coupled to a magnetic 
Anderson impurity, are encapsulated by an effective sd-Hamiltonian Eq. (2.51) involving itinerant carriers 
coupled to the spin of the magnetic impurity by an exchange coupling J. Applying a perturbative 
analysis, in Problem 5.5 we saw that the effect of the magnetic impurity fluctuations was to impart a 
contribution to the scattering rate which diverged at low temperatures - a phenomenon compatible with 
the observed increase in low-temperature resistivity. However, it being perturbative in character, little 
could be said about the effect of the magnetic impurity in the strong coupling regime. In the following, 
we will implement a perturbative RG scheme to monitor the flow of the coupling constant and explore 
the low-temperature properties of the Kondo impurity system. 

Our starting point is the sd-Hamiltonian where, anticipating the capacity for anisotropy, 
we treat the bare components of the exchange coupling as independent parameters, 

Hsd = ^^2 e k c k<r C k<r + ^2 \ ( C kt Ck 't — ^Oc't) + ^+‘S' +C k4. Ck 't + J-& c kt Ck 't] 

kcr kk' 

where S denotes the spin of the local moment formed by the d-electron system. 

In the following, we will implement an adaptation of the RG procedure which monitors 
the effect of high-energy (fast) degrees of freedom on the low-energy (slow). Our analysis 
is taken from the original paper by Anderson. 58 (For a discussion of the generalization of 
the RG scheme and a much more lengthy discussion of the subject as a whole, we refer to 
the review by Hewson. 59 ) To this end, let us divide the conduction band into high-lying 
electron/hole states that occupy an energy shell D/b < |ek| < D, where D denotes the bare 
bandwidth and b > 1, and the remaining states 0 < |ek| < D/b. To eliminate the high- 
lying excitations that fall within the band edge, we may implement a general procedure 
that mirrors the one exploited in the construction of the sd-Hamiltonian from the Anderson 
model (Problem 2.4). Specifically, let us write the total wavefunction of the Hamiltonian 
| ip) as the sum of terms IV’o), |^i), an d IV^} where IV’i) describes the component in which 
there are no conduction electrons/holes in the upper/lower band edge, IV’o) has at least one 
hole in the lower band edge, and |^ 2 ) has at least one electron in the upper band edge. 
Following the discussion in Problem 2.4, eliminating \ipo) and \ip 2 ) from the Schrodinger 
equation Yln=o Annl^n) = D\ ip m ), one obtains 

[g lc , 1 ffoi + H u + H 12 1 |V>i) = F# i). 

L A — H oo A — H 22 J 

(a) Taking into account the process that scatters a conduction electron into the upper band 
edge, show that the term 

//, 2 ,,i #2, | Cl) 

E — H 22 


P. W. Anderson, A poor man’s derivation of scaling laws for the Kondo problem, J. Phys. C 3 (1970), 2436-41. 
A. C. Hewson, The Kondo Problem to Heavy Fermions (Cambridge University Press, 1993). 
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is associated with eight possible contributions, which can be grouped in pairs. (You may 
find it helpful to enumerate these possibilities diagrammatically.) In particular, show 
that there exists a contribution 


- 

k'k s kf 


where the wavevectors kf(k s ) index states which lie within (outside) the band edge. 

(b) Focusing on this contribution alone (for now), rearrange the operators to show that it 
takes the form 

J + J- E ^D(l - l/b)S-S+cl^c Kt 1 

k'k s & — D + €k s — H 0 


where uo denotes the (assumed constant) density of states at the band edge and Ho = 
Sk<r ekcjE'kcr is the single-particle Hamiltonian of the band electrons. Now, measuring 
the energy with respect to the ground state of the non-interacting electron gas, Ho may 
be approximated by zero. Finally, noting that, for spin S = 1/2, S~S + = 1/2 — S z , the 
contribution takes the form 

J + J_ E "oD( 1 - l/6)(l/2 - S 2 )cl, t c Kt E _p + e . 

k'k s s 

We leave it as an unanswered part of the exercise to confirm that the parallel contribution 
from the process in which a hole is created in the lower band edge leads to the expression 

J + J_ E ^o-D(l - 1/&X1/2 + £*)<* 'A a t E _D_ e ■ 

k'k s s 


(Try to use arguments based on particle/hole symmetry rather than developing a first 
principles analysis.) Following a similar procedure, one may confirm that the second 
class of spin conserving terms lead to the contributions 

J? + 1 J " E voD(l-l/b)c Kc 


4 ^ 

k'kscr 


E — D + e k s ’ 


J 


1 


7 E — D — e k a 


This completes the analysis of contributions to the effective Hamiltonian for \ipi) 
which leave the electron and impurity spin unchanged. The four remaining contributions 
involve a spin flip. Following a similar procedure to the one outlined above, one may 
identify two further contributions 

• cd - , 

k'k s s 

^ £ -odo - ms^ 


where we have used the identity S Z S + = S + / 2. One may confirm that the corresponding 
terms with the order reversed generate an equal contribution. 
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When combined with terms from a spin-reversed process, altogether one finds that 
the components of the exchange constant become renormalized according to 

= * - J - J ^ + E - D - et , ) ’ 

J.ip) = J'- AJ-VoDO- + 

Now, since we are interested in the renormalization of the low-energy properties, we 
may ignore E against D. Moreover, since the typical internal excitation energy ek <C D, 
we may approximate these equations by the differential recursion relations 
dJ± „ T T dJ z „ T T 

nr = = 

where, as usual, b = e ( . 

(c) Starting with these differential recursion relations, show that the scaling trajectory obeys 
the relation 

j\ — J± = const. 


One may note that, on renormalization, the coupling constant J z always increases. 
Therefore, in a ferromagnetic model J z < 0 with bare coupling constants such that 
\J Z \ < will scale to zero. Conversely, when the bare constants \J Z \ > J±. both 

J z and J± flow to strong coupling where the perturbation theory breaks down. 

For J z =s J± = J > 0, an integration of the scaling equations dJ/d(. = 2ijqJ 2 gives 

Therefore, we may set 


D exp 



D(£) exp 



= T k , 


where Tk denotes the Kondo temperature. 


Answer: 


(a) Focusing on terms which scatter conduction electrons into the band edge, one may 
identify 

#21 = [ J *S Z ( C k f t C kst - C L C k a t) + •^+‘S' + c| Cf4 Ck 8 t + J-S , “c| Cft Ck s 4] , 

k s ,k f 

where the wavevectors kf span the range of single-particle band energies ek within the 
band edge while the components k s span the range of lower energies. Now, to evaluate 
the contribution of the term H 2 i(E — # 22 ) -1 # 2 i to the effective Hamiltonian for | tpi) 
we must identify all processes that are closed. Since the action of the Hamiltonian must 
return us to the subspace in which the band edge states remain unoccupied, processes 
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involved in a virtual occupation of the band edge are constrained by the conservation 
of spin. From H 12 H 21 , one may identify eight allowed processes, 

(1) J—S Ck/^-CkfJ. x </_|_»§ + cj Cf |C kst , 

(2) — x J + S' + cj [f ^Ck s -|-) 

(3) J z i5 z cj c ,^Ck f '|' x J_S Ck f -|-Ck s -l., 

(4) J z S z cl, t c kft x J 2 <5 z cj Cf ^Ck s -|-) 

and the four counterparts related by symmetry. Of the four kinds of process, terms 
(2) and (3) involve a spin-flip process while terms (1) and (4) preserve the electron 
spin orientation. Each of these terms can be represented diagrammatically. When com- 
bined with the energy denominator, the particular contribution specified in the question 
corresponds to the first process (1). 

(b) Since the band edge occupancy of the reference state \i/ji) is zero, one can set 

4 f + c katfe) = voD(l - l/h)4, t c k , f 1 w , 

where Ho denotes the single-particle Hamiltonian of the band electrons, ^ kf = 
Jd/ f, dev(e) ~ vqD(1 — 1/6), and we have set e kf cz D. As a result, one obtains the 
required formula. 

(c) Dividing the differential recursion relations, one obtains 

d,J± _ J z 
~dJ- z ~Tk' 

Integrating this equation, one obtains the required scaling trajectory. 
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Topology 


x In this chapter we discuss low-energy theories with non-trivial forms of long-range order. We learn 
how to detect the presence of topologically non-trivial structures, and to understand their physical con- 
sequences. Topological terms (0-terms, Wess-Zumino terms, and Chern-Simons terms) are introduced 
as contributions to the action, affecting the behavior of low-energy field theories through the topology 
of the underlying field configurations. Applications discussed in this chapter include persistent currents, 
quantum spin chains, and the quantum Hall effects. 


In the preceding chapters we encountered a wide range of long-range orders, or, to put it 
more technically, different types of mean-fields. Reflecting the feature of (average) transla- 
tional invariance, the large majority of these mean-fields turned out to be spatially homo- 
geneous. However, there have also been a number of exceptions: under certain conditions, 
mean-field configurations with long-range 1 spatial textures are likely to form. One mecha- 
nism driving the formation of inhomogeneities is the perpetual competition of energy and 
entropy: being in conflict with the (average) translational invariance of extended systems, a 
spatially non-uniform mean-field is energetically costly. On the other hand, this very “dis- 
advantage” implies a state of lowered degree of order, or higher entropy. (Remember, for 
example, instanton formation in the quantum double well: although energetically unfavor- 
able, once it has been created it can occur at any “time,” which brings about a huge entropic 
factor.) It then depends on the spatio-temporal extension of the system whether or not an 
entropic proliferation of mean-field textures is favorable. 

A second mechanism behind the formation of inhomogeneities can be the topological 
structure of the order parameter field; does the mean-field accommodate solutions that sim- 
ply cannot be continuously deformed back into a uniform state? The Ay-model discussed 
at the end of the preceding chapter conveniently illustrates this principle: a vortex cannot be 
eliminated by any smooth deformation of the field. One might argue that this irreducibility 
is associated with the behavior of the core region of the vortex, where local order breaks 
down and the mean-field theoretical description simply does not apply (i.e. the vortex can- 
not be removed by manipulations of the phase field alone). However, an alternative, and 
more generally valid, explanation of the phenomenon is that a vortex represents a field con- 
figuration characterized by a non-vanishing winding number, i.e. an integer different from 


1 For example, sometimes a system may find it energetically favorable to develop a micro-texture optimally 
adjusted to the structure of the underlying Hamiltonian (a prominent example being charge density wave for- 
mation in one-dimensional systems). Yet even these structures exhibit a discrete translational invariance. 
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zero. But a non-vanishing integer cannot be continuously “deformed” to zero. (The vortex 
example also illustrates that the two principles of topology and entropic proliferation tend 
to cooperate. Once the energy necessary to form a mean-field of non-vanishing winding 
number has been invested, the system can benefit from the freedom to place the core region 
anywhere in the system.) 

The remarks above touch upon but two of many interesting aspects of systems with 
topologically non-trivial order parameters. In the next section we employ a trivial example 
(namely a free particle on a ring) to set the stage for the discussion of further phenomena 
hinging on topological concepts. Specifically, we introduce the concept of a topological term, 
i.e. an operator which affects the low-energy behavior of a theory solely on the basis of 
the topology of its fields. Turning to the more systematic development of the theory, we 
then introduce homotopy as the key mathematical tool whereby fields can be topologically 
classified. This discussion provides the conceptual platform on which the rest of the chapter 
is based. It is followed by a discussion of different classes of topological terms (0-terms, 
Wess-Zumino terms, Chern-Simons terms), along with a number of applications. It turns 
out that, whenever such terms are present in a theory, they tend to massively affect its 
long-range behavior. At the same time, topological terms are notoriously easy to overlook 
in “standard” schemes of distilling low-energy theories from their microscopic origins. For 
this reason, some emphasis is placed on purely operational aspects; i.e. tricks that prevent 
one from missing the presence of a topological term! 

Before getting started, it is worthwhile emphasizing that we are about to plunge into a 
wide subject area that simply cannot be satisfactorily covered in a single chapter. 2 Conse- 
quently, our discussion is example— oriented and often regrettably superficial (with regard 
to both physical depth and, especially, mathematical structures). In fact, the aim of the 
present text is to demystify the subject of topology in field theory, to arouse the interest of 
readers and to motivate them to proceed to more profound and substantial discussions in 
the literature! 


9.1 Example: particle on a ring 

l<j> Consider the problem of a free quantum particle of charge e confined to one 

C T~ dimension and subject to periodic boundary conditions - a particle on a 

I ring (see figure). To make the problem somewhat more interesting, let us 

assume that the ring is threaded by a magnetic flux $. Measuring the coordinate of the 
particle in terms of an angular variable <j> £ [0, 27t], the free Hamiltonian of the system thus 
takes the form (h = e = c = 1), 




(9.1) 


For an advanced text specifically targeted on topological considerations in physics, and the underlying mathe- 
matical structures, we refer to M. Nakahara, Geometry, Topology and Physics (IOP Publishing, 2003). 
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where A = <h / 4>o denotes the vector potential corresponding to the magnetic field (exercise), 
and $o = he./ e = 27T represents the magnetic flux quantum. (Here, for notational simplicity, 
we have set the radius of the ring and the particle mass to unity.) Periodicity implies that we 
are working on a Hilbert space of wavefunctions i/> subject to the condition 0(0) = ip(2n). 

Of course, the problem defined by Eq. (9.1) is embarrassingly simple. One may readily 
verify that the eigenfunctions and spectrum of the Hamiltonian are given by 

Vn{4>) = ^=exp (m0), e n = ^(n- , n £ Z. (9.2) 

On the other hand, this very simplicity is somewhat deceptive; 3 we shall see in a moment 
that many of the concepts of topological quantum field theory find a preliminary realization 
in the problem above. 

To explore these connections, let us reformulate the system in the language of the imag- 
inary time path integral (cf. Problem 3.5): 


2= / Doe 


— 0(O)£27rZ 


(9.3) 


where the boundary condition </>(/)) — <f>( 0) £ 2-kZ expresses the fact that the phase is defined 
only up to integer multiples of 27r, and the Lagrangian is given by 

= (9.4) 

EXERCISE Verify by Legendre transformation that the Hamiltonian corresponding to this 
Lagrangian is given by Eq. (9.1). Obtain the spectrum Eq. (9.2) from the path integral, i.e. 
represent the partition function in the form Z = exp(— /3e n ). (Hint: You may find the 
Hubbard-Stratonovich decoupling of the quadratic term useful.) 


Suppose that we were unaware of the exact solution of the problem. Our canonical approach 
to controlling the integral would be to subject the theory to a saddle-point analysis. The 
saddle-point (alias Euler-Lagrange) equations of the action S[<f>l = J ^ dr L((j>,<f)), 


ssm _ n 

mr) 


0 = 0, 


have two interesting properties, (i) The vector potential does not enter the equations. 
On the other hand, we saw above that it does have a physical effect (the spectrum 
explicitly depends on A). We need to understand how these two seemingly contradic- 
tory observations can be reconciled with each other, (ii) There exists a whole family of 
solutions, 4>w{ t ) = W2ttt/(3. The action of these configurations, <S[0w]U=o = ;p(27rW) 2 , 
varies discontinuously with W, i.e. by analogy with other cases where we found saddle-point 


3 Note that the literature is full of erroneous statements on even this simple system. The most frequent of these is 
that the periodic boundary conditions of the problem force the flux to be quantized in integer multiples of the 
flux quantum = n<I>o- This is, of course, incorrect. Even for non-integer <£, the wavefunctions are perfectly 
periodic. The statement is probably triggered by flux-quantization in superconducting systems (a phenomenon 
that relies on energetic, and not topological, considerations). 
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Si 



4 > 





w= 1 




W=2 


Figure 9.1 Showing mappings tj> : Si — ¥ Si of different winding numbers. 

configurations defined by an integer index (cf. instanton solutions in the quantum double 
well), we may expect the different solutions to be separated by huge energy barriers. 
However, the present problem is special insofar as, besides energy, there is a much more 
“profound” principle separating configurations of different W. 

To understand this point, let us note that, mathematically, the field 0 is a mapping 

0:Si -4 Si, 

T i-4 0(t), 

from the unit circle S\ (imaginary time with periodic boundary conditions 4 ) into another 
circle (0 is a phase!). Mappings of this type can be assigned to a winding number, 
W, i.e. the number of times 0(t) winds around the unit circle as r progresses from 0 to 
/?: - 0(0) = 2 t rW (see Fig. 9.1). 

Indeed, it is not possible to change IF by a continuous deformation of 0. Since continuity 
is a paradigm implicit in field integration, the integration over all functions 0(t) can be 
organized as an integration over functions 0(r) of different winding numbers, or different 

topological sectors: 

Z = J D<t>e~^ dTL ^ = Y^e 2 * iWA J D<t>e~ s/*^ 2 . (9.5) 

W (j>(p)—4>(0)=2irW W 0(/3) — (£(0)=27rW 

Here, we have noted that the ^-dependent term in the action, 

Slavic] = iA d,T(j> = iA((j>(/3) - 0(0)) = %2-kW A, 

Jo 

involves only the index of the topological sector of 0. The representation (9.5) makes the topo- 
logical aspects of the problem particularly transparent. Specifically, one may note that: 


Strictly speaking, imaginary time should be identified with a “circle” of circumference (3. However, for our present 
purposes, all that matters is that the periodic boundary conditions render the interval [0, /3] isomorphic to a 
circle. 
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> The functional integral assumes the form of a sum of integrals over disjoint topological 
sectors. 

> The contribution to the action, ,S't op , is our first example of a topological term. (More 
precisely, it belongs to the class of “ -terms.”) 

> Since Stop is sensitive only to the topological sector of a field contribution, it cannot 
affect the equations of motion. This is because these probe how the action responds to 
an infinitesimal variation of the field configuration, an operation that cannot change the 
winding number. (You may wish to ponder this point since it will be important.) 

> However, the topological term does affect the outcome of the functional integration: it 
plays the role of a W -dependent “phase,” weighting the contribution of different sectors. 

> The fact that St op knows only about the topological class of a field configuration implies 
that it is impervious to any changes in the metric of the base manifold of the theory (in 
our case, imaginary time). For example, we might decide to measure time in different 
units, i.e. r — >• ar = t' , where a is some scaling factor. This transformation leaves the 
topological term invariant. 

> In particular, it remains form invariant under a change from imaginary to real time, 
r — > —it = t. In both representations, 5 t op [ 0 ] = ^tiAW is purely imaginary. This, in 
fact, is a hallmark of topological terms; in both Euclidean and Minkowski space-time, 
their contribution to the action is always imaginary. 


While formulated for the (almost trivial) example of the particle on the ring, all of these 
features generalize to much more involved settings. However, to discuss these generalizations 
in a sensible manner, we need to provide somewhat more mathematical background. This 
will be the subject of the next section. 


I N FO Owing to its simplicity, the system above frequently appears as an effective model in con- 
densed matter physics. Examples we have encountered previously include the Josephson junction 
(where the presence of a condensate induces an additional cosine potential) and the physics of normal 
metal granules subject to strong charging (with the additional complication of dissipative damping 
of the 0- fluctuations) . Here we briefly touch upon the physics of persistent currents as an example 
where the topological aspects of the problem play a particularly important role. 

Consider then a ring-shaped conductor subject to a magnetic flux. According to a prediction by 
Byers and Yang 5 the magnetic field induces an equilibrium current 


W) = 


dF($) 


periodic in $ with period $o- 


EXERCISE Remembering that a vector potential enters the free energy as ~ f dA ■ j, derive 
the Byers and Yang formula. Show that, at zero temperature, the persistent current flowing in 
a perfectly clean one-dimensional metal of non-interacting fermions assumes the form of a 'I>o- 
periodic sawtooth function, /(<!>) = 2 ™ F [<f>/$o], where [a:] = x — n and n is the largest integer 
smaller than x . (Hint : For zero temperature, the free energy of a non-interacting system of particles 
is equal to the sum of all single-particle energies up to the Fermi energy. Notice that the current is 


B. Byers and C. N. Yang, Theoretical considerations concerning quantized magnetic flux in superconducting 
cylinders, Phys. Rev. Lett. 7 (1961), 46-9. 
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l/#(l/gauss) 

Figure 9.2 Power spectrum of the persistent current carried by a single gold loop (diameter 
0(1 pm), L/l ~ 10). The data shows a peak at /i/e and a small satellite modulation at h/ 2e. The 
magnitude of the current is some two orders of magnitude larger than that predicted by theory. 
(Reprinted with permission from V. Chandrasekhar, R. A. Wells, M. J. Brady, etal. Magnetic 
response of a single, isolated gold loop, Phys. Rev. Lett. 67 (1991), 3578-81. Copyright (1991) by 
the American Physical Society.) 


carried by the last occupied state, i.e. the currents — c^e n carried by all levels beneath the Fermi 
energy cancel.) 

For a long time it has been believed that this statement was largely academic since 
surely even a moderate concentration of impurities 6 would be sufficient to render the cur- 
rent undetectable. However, a simple consideration shows that this need not be the case: 
a gauge transformation </ i) — > e lA< ^il)((j)) removes A from the Hamiltonian while chang- 

ing the boundary conditions to ip(0) = e 2mA ip(2-K). In the gauge-transformed picture, the 
presence of the magnetic field thus amounts to a twist in the boundary conditions of the 
wavefunctions, and the persistent current is a measure of the sensitivity of the spectrum to 
this twist. Now, there is no reason to believe that a wavefunction in a disordered system 
should be less sensitive to its boundary conditions than that of a clean system. 7 Indeed, 
it is shown in Problem 9.7.1 that even rings of circumference may carry a sizeable 

persistent current. 

In a series of beautiful experiments conducted in the early 1990s (see, e.g., Fig. 9.2), 
persistent currents in both “ballistic” ( L/i ~ 1) and “dirty” (L/i ~ 100) environments were 
indeed observed experimentally. Frustratingly, the measured current appears to be some two 
orders of magnitude larger than the theories of non-interacting particles would predict. This 
disturbing discrepancy led to the formulation of a plethora of theories of persistent currents 
in interacting/disordered systems. However, to date, the discrepancy between theory and 
experiment remains unresolved. 


6 More precisely, it was believed that a sizeable current might only be observed if the circumference of the ring 
were smaller than the scattering mean free path. The artificial fabrication of rings of that quality requires 
semiconductor technology which was not available at the time. With regard to molecular rings, equilibrium 
currents had been predicted in the 1930s. However, the fields needed to drive currents in molecules (O(10 5 T)) 
cannot be generated by the laboratory magnets currently available. 

7 What does diminish this sensitivity is mechanisms destroying the coherence of wavefunctions (thermal noise, 
etc.) and strong localization, i.e. a wavefunction confined to a finite portion of the ring will not sense changes in 
the boundary conditions. In fact, sensitivity with respect to changes in the boundary conditions has been used 
(largely by theorists) as a popular test for localization. 
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9.2 Homotopy 
9.2.1 Generalities 

To appreciate the generalization of the structures above to more complicated contexts, we 
need to introduce a few more mathematical concepts. In Chapter 1 (see page 6) , we defined 
fields as mappings 


<t>:M -> T, 
z i-> <j){z ), 

from a “base manifold” M - usually or a subset thereof - into some “target space” T. By 
now we have gained enough experience to determine what are the “most frequent” realiza- 
tions of target spaces: indeed we have mostly been interested in the long-range behavior of 
Goldstone modes, i.e. long-range modes induced by a mechanism of symmetry breaking. In 
such cases (cf. page 257), T = G/H will be a coset space obtained by dividing the symmetry 
group of the problem, G, by some subgroup H c G stabilizing the mean-field around which 
the Goldstone modes fluctuate. Usually (but certainly not invariably) G will be one of the 
compact classical groups U (TV), O(N), and Sp(lV), and H some subgroup thereof. 

INFO Why is the word “topology” so often deployed in this chapter? In the example above, 
the most relevant characteristic of a field configuration was its winding number, i.e. a quantity 
that does not change under any continuous deformation of a field, no matter how “large” is this 
deformation. More generally, in this chapter we are concerned with features of field theories that 
essentially rely on the concept of continuity, but do not involve the notion of “distance” or, more 
formally, of a metric. 

The most general mathematical structure for which the notion of continuity can be defined is a 
topological space : 8 a mapping <j> : X —>Y between two topological spaces is called continuous 
if, for any open set 17 C Y, the set (jU 1 (17) C X is open in X. This definition is not tied to 
the existence of a metric. 9 This remark is not entirely academic: there are prominent physical 
spaces - the phase space of classical mechanics, for example - which do not possess a canonical 
metric but for which the notion of a continuous mapping certainly exists. 

Although our discussion below does not rely on the in-depth mathematics of differential topol- 
ogy, the frequent use of the attribute “topological” emphasizes that what matters in this chapter 
is “continuity minus metric.” 

In fact, practical considerations also enable us to be a bit more specific as to the structure 
of the base manifold. Suppose our microscopic parent theory is defined on some simply 
connected manifold M Cl* 1 . As we shall be typically concerned with some kind of ther- 
modynamic limit, M can be thought of as an “infinitely large” object. On the level of 
the low-energy theory of the system, this requires that “sensible” field configurations must 


As a reminder, let A be a set and J = {Yi X\i El} a collection of its subsets. The pair {X,J) is called a 
topological space if and only if, (a) {},X G J, (b) for J 7, i jYi E J , and, (c) for any finite subset J 7, 
i jYi E J . The elements of J are called open subsets of X. 

By contrast, in elementary mathematics courses, the continuity of a mapping </> is frequently defined via a metric, 
i.e. one relates the distance between two image points <p(x) and 4>{y) to that between the arguments x and y. 
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approach a constant value on the boundary of M: i.e. <P\dM = const, (lest the action become 
infinite) . 



As far as our effective field theory is concerned, we may thus identify 
all boundary points with a single point at “infinity.” Geometrically, this 
implies that M can be compactified to a large sphere (see the figure). 
However, as the radius of this sphere carries no significance (indeed, tech- 
nically, it can be removed by a global rescaling of coordinates), we can, 
just the same, identify M ~ S d with the d-dimensional unit sphere. The 
practical upshot of all of these considerations is that, in practice, we shall 
mostly be interested in fields 

(j ) : S d G/H, 



mapping a unit sphere into some coset space. 

We now turn to the discussion of topological aspects of such mappings. 
By analogy to our discussion of the previous section (where we considered 
mappings (f> : S 1 — > S 1 ~ 0(2)), let us consider two fields <j > i and ^2 as 
topologically equivalent if they can be continuously deformed into each 
other. Technically, this condition amounts to the existence of a continuous 
mapping (a homotopy in the language of mathematics) 




1 x [0, 1] 

CM) 


G/H, 

<HM), 


such that </>(., 0) = <j> 1 and 02 ( . , 1) = <j> 2 . (Notice that <j> represents a 
mapping from (d+1 (-dimensional space into G/H , a fact that will become 
important when we turn to the discussion of Wess-Zumino terms below.) We denote the 
equivalence class of all fields topologically equivalent to a given representative (j) by [</>]. 
(In the example discussed in the previous section, those equivalence classes would contain 
all fields of a given winding number.) The set of all topological equivalence classes {[0]} of 
mappings </> : S d — ¥ T is called the d-th homotopy group, tij/T). 


INFO Some readers might wonder in what sense 7r<f(T) carries a group structure (rather than 
just being a set). To understand this point, it is convenient to deform our base manifold from 
a sphere to a d-dimensional unit cube I d = [0, 1 ] d . As far as topology is concerned, this is 
a permissible operation, if and only if and only if we identify the boundary dl d of the cube 
with a single point on the sphere. For example, it is convenient to choose this point to be 
the representative of the “infinitely large” boundary of our original base manifold. This choice 
requires that (f>\ gl d = <f>* = const, be the constant field configuration approached at the “physical 
infinity”. 10 


The assumption of a unique asymptotic configuration c/>* is less of a restriction than it may seem; any (constant) 
boundary field (f) can be converted to <f>* by a global transformation acting on the field (an operation that does 
not leave the equivalence class). 



504 


Topology 



Figure 9.3 Concatenation of two two-dimensional fields into a single one. 


The purpose of the cube construction is that it presents us with ways to glue two fields 0i 
and 02 together to form a new field 03 = 0i * 02 (see Fig. 9.3). For example, we might define 


<j>3(xi,X 2 , ■ ■ • ,Xd) 


<f>i(2xi,x 2 , ■ ■ ■ ,xa), xi e [0,1/2], 
02 ( 2 xi - 1, X2 , ... ,Xd), xi €[1/2,1], 


One might justly object that, due to its explicit coordinate dependence, this is not an accept- 
able mathematical definition. However, as far as our discussion of homotopy is concerned, this 
objection is immaterial. Indeed, we may define a group operation on 7 r<j(T) by 


[0i] * [<h] = [03]. 


(9.6) 


This definition is canonical in the sense that any other coordinate convention in our concatenation 
operation above would not leave the equivalence class of 0 3 (a point on which you might like to 
dwell). Similarly one may verify that “*” obeys all the criteria for a group mapping. In short, 
the very possibility to “concatenate” fields induces a well-defined group operation on the set of 
topological equivalence classes. 


9.2.2 Examples of homotopies 

The general group theoretical analysis of homotopies is a mathematical subject that reaches 
far beyond the scope of the present text. Here we restrict ourselves to the discussion of a 
few examples of practical interest. In simple cases, the homotopy group can be identified 
by common sense reasoning. For example, in the previous section we saw that mappings 
S 1 i-> S' 1 can be classified in terms of winding numbers: 7Ti(S' 1 ) = Z. Similarly, it is clear 
that any mapping S' 1 —> S 2 - a closed curve on the 2-sphere - is continuously contractible 
to a point: 7ri(<S' 2 ) = 0. The same applies, by definition, to any curve in a simply connected 
space. Prominent examples of such spaces are the higher-dimensional spheres S d>1 and 
SU(iV): 7Ti(S' d>1 ) = 7Ti(SU(iV)) = 0. By contrast, the first homotopy groups of non-simply 
connected spaces are categorically non-trivial. For example, curves on the d-dimensional 
torus T d are classified by (exercise: convince yourself of the veracity of this statement) 

7Tl(T d ) = Z x ••• x Z. 

d 

Turning to higher dimensions, it becomes more and more difficult to identify homotopy 
classes simply by invoking one’s imagination. One of the last intuitively accessible examples 
is 7T2 (S' 2 ) = Z: maps of the 2-sphere into itself can be classified according to how often they 
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Table 9.1 Homotopy groups of a number of frequently encountered mappings. A blank entry 
means that no general statements can be made. 



S 1 

s 2 

s d> 2 

T d 

SU(2) 

SU(iV) 

k -')i. 

Z 



z ••• z 



fc = 2 


z 

k < d : 

d 



k>2 



k = d : Z 

k > d : 


7r 3 (SU(2)) ,S| Z 



But 7T3 (S 2 ) = 


wrap around the sphere. This statement generalizes to nd(S d ) = Z, while nk(S d>k ) = 0. 
Interestingly, the situation can be non-trivial for mappings S k i->- S d<k . For example, Hopf 
has shown that 7r.3 (/S' 2 ) = Z. For a summary of these, and a few more results, see Table 9.1. 

Far from being complete, the list of examples in Table 9.1 fails to cover a number of homo- 
topies of outstanding relevance. For example, in condensed matter physics (unlike particle 
physics where space and time are intertwined by relativistic covariance) we are typically 
confronted with a situation where time is separately compactified to a circle - imaginary 
time with periodic boundary conditions. On top of that, the finite- action arguments outlined 
above motivate a compactification of M. d — > S d of real space to a d-dimensional sphere. This 
implies that, in quantum statistical field theory, one often encounters the base manifold 
M ~ S 1 xS d (instead of S d+1 as found in particle physics). Clearly the construction of 
homotopic groups 7r( S 1 x S d ,T) corresponding to mappings S' 1 x S d — »• T is more complex 
than that involved in the definition of the group structures above. However, rather than 
dwelling on the near-endless field of homotopy theory, we now return to field theory and 
explore general implications of the homotopic classification scheme. 2 


9.3 -Terms 

Returning to field theory considerations, let us address the question of what can be learned 
from the concepts introduced in the previous section. Each field <f> : M — > T can be uniquely 
assigned to a certain homotopy class. Consequently, the functional integration defining a 
field theory can be organized as 

Z= Y, [ D<t) we - s W, 
weG-J 

where G is the homotopy group and f Dcp\,y denotes integration over the homotopy class 
defined by a given element W € G - a “topological sector” of the theory. It may happen 
that the action of our field theory, 


S[</>] = Sb[0] -I- Stop [</>], 
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M 


<t> 2 f 


01 *02 f 


A 


A 


A 


A 


► W l + w 2 


Figure 9.4 The combination of two fields with winding numbers W\ and W 2 leads to a new field 
with winding number Wi + W 2 . 


contains a topological action component: i.e. a contribution Stop A = F(W) that depends 
only on the topological class of the field </> (which, for integer-valued W, is sometimes called 
the topological charge of the configuration). In such cases, one may pull the function 
F(W) in front of the functional integral to write 

Z = Y, e“ F(w ° f D<j> w e- s °W. (9.7) 

weG •' 

In the following, we would like to understand in what way the topological action depends 
on the index W. This question can also be addressed by the type of reasoning ubiquitous in 
this chapter. Suppose we are given two fields <j> 1 and <f> 2 which are constant everywhere save 
for two well-localized regions of variation somewhere in space-time. Let us assume that the 
two regions where the fields vary are “infinitely” far away from each other (see Fig. 9.4). 
Now, suppose we had glued these fields together (e.g. by the prescription formulated in 
the previous section) to form a new field <pi * <p 2 . The infinitely large separation of the 
two constituent fields implies that they are completely “uncorrelated,” i.e. the action of 
the composite field S[<j>i * 4> 2 \ = jS[0i] + S[<p 2 ] is obtained simply by adding the actions 
of the constituents. In particular, F{W\ + W 2 ) = Stop A * <l> 2] = S top [<?d] + ApM = 
F{W\) + F(W 2 ), where in the first equality we have used the fact that the composite field 
has topological index WWXW 2 . 

The identity F{W\ + W 2 ) = F(W{) + F(W 2 ) tells us that the topological action is linear 
in the topological index. For example, consider the simple (and at the same time most 
frequently encountered) case where 7 r(M, T) ~ Z, i.e. Wi € Z are just numbers. The linearity 
then uniquely determines the topological action, 

F[W\ = iOW, 

up to a constant (which we choose to be real lest the action become ill-defined at large 
values of the topological charge). 

These considerations tell us that the factor exp(— F[IF]) = exp(— iOW) weighing the 
different sectors assumes the form of a phase. Relatedly, the constant 6 is usually referred 
to as a topological angle. (Since W is integer, 9 is defined only mod 27T - an angular 
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variable.) For historical reasons 11 the topological action Stop is commonly referred to as a - 
term. Notwithstanding its simplicity, a drawback of this representation is that it explicitly 
relies on the decomposition of the field integral into a sum over distinct topological sectors. 
It would be much more desirable to work with a representation StopM directly in terms of 
the field (rather than through its winding number). This would enable us to formulate the 
field integral more directly as / D (f> exp (— Sq[4>\ — Stop [</>]), without the necessity to explicitly 
sum over winding numbers. 

Indeed, it is almost always possible to represent the topological action as an integral over 
a topological Lagrangian density, 

S t «M = J d d xC, ov [<j>,d^\. 

Unfortunately, no canonical recipe for the construction of this representation exists. How- 
ever, this is not as serious a problem as one might suspect. For one thing, the Lagrangian 
densities of the “usual suspects” of topologically non-trivial field theories are known. Con- 
versely, it is usually straightforward to deduce whether or not a given term in the action is 
a 0-term in disguise. To illustrate these points, we next discuss the concept of a topological 
Lagrangian density on a few relevant examples. 

In fact, we have already encountered the simplest representative of a #-term in our analysis 
of the particle on a ring: 

/> /3 7 

s top [0] = iej ' ^<j> = iwe, 

where 6 = 2n A was proportional to the magnetic flux through the ring. We reiterate the 
key features of this term: it does not affect the equations of motion (a small distortion, or 
variation, of the field does not change its topological index, i.e. it leaves Stop invariant), 
and it is invariant under arbitrary coordinate reparameterizations r i-> s = s(t). As a less 
simple example illustrating these features we now discuss a two-dimensional field theory. 


9.3.1 A case study: 

Consider a field theory in a two-dimensional compactified space with the two-sphere as a 
target manifold. Important examples falling into this category include the theory of quantum 
spin chains (see Section 9.3.3), the two-dimensional classical Heisenberg model, and the field 
theory of the integer quantum Hall effect (see Section 9.3.4). Technically, the fields of this 
theory are mappings 

n : S 2 S 2 , 

x i ^ n(x), |n| = 1, 


11 One of the first major applications where terms of this type played a dominant role was ’tHooft’s analysis of 
SU(2) gauge field instantons in (3 + l)-dimensional compactified space-time. (G. ’tHooft, Magnetic monopoles 
in unified gauge theories, Nucl. Phys. B79 (1974), 276-84.) There he systematically labeled the topological 
angle by 0 ; hence the name 0-term. 
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and the relevant homotopy group is n 2 (S 2 ) a Z. Let us suppose that the topological action 
of this field theory is given by 

S top [n] = ^ J dx i dx 2 n • (f^n x d 2 n). (9.8) 

How does one verify an assertion of this type? A first condition for S^op to be of topological 
nature is an insensitivity to small variations of the field n. Suppose then we vary n by a small 
amount, i.e. n(x) — > n(x) + e a (x)i? a n, where the functions e a ,a = 1,2,3, are infinitesimal 
and R a are the generators of rotations around the three coordinate axes. By integrating a 
few times by parts, it is straightforward to show that the variation of Stop assumes the form 
3i0 f 

<5S top [n] = — / dx i dx 2 e a R a n • (Sin x c^n). 

However, R a n is perpendicular to n while (exercise) (Sin x S 2 n) lies parallel, i.e. SS top = 0. 
(As a corollary we note that S top will not affect the equations of motion.) 

The invariance of S top implies that we can evaluate its value on any convenient test field 
configuration n 0 ; each field that can be reached by continuous deformation of no will have 
the same topological action. Consider, for example, the family of field configurations 

nW : R 2 -»■ S 2 , 

(aq,^) i->- ( <p = IT tan -1 ( — ] , 6 = 2 tan -1 ( — J 

V V^i J \x 1 +x 2 

where we used polar coordinates to parameterize n ( w \0,<j>). For historical reasons, 12 these 
field configurations are commonly referred to as skyrmions. (Notice that the suffix “-on” 
indicates that we are dealing with highly stable [particle-like] excitations.) 

A coarse visualization of the simplest skyrmion 
configuration, n^ 1 ), is shown in the figure: a tex- 
ture of unit vectors varying on a scale set by the 
parameter a. Skyrmions of higher winding num- 
bers, n l - w>1 ' > . are difficult to visualize. However, 
it is straightforward to verify (by substitution 
of the unit vector (cp, 0) into the integral 
(9.8)) that their topological charge is given by 
W. Equivalently, the topological action reads 
Sto P [<P (w) ] = iOW. 

EXERCISE Verify these statements. Show that the topological charge is insensitive to coordinate 
changes on both the target and the base manifold. Try to invent other simple field configurations 
of non-vanishing topological charge. 

INFO A word on semantics: depending on the context, topologically non-trivial field config- 
urations are described as solitons, instantons, skyrmions, etc. While there seem to be no 




See T. H. R. Skyrme, A nonlinear field theory, Proc. Roy. Soc. London A 260 (1961), 127-38 where these 
excitations appeared in the context of an effective model of nuclear matter. 
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discernible systematics in this scheme, a rule of thumb is that topological excitations in dynam- 
ical quantum theories (i.e. theories where the base manifold represents space-time) are called 
“instantons.” By contrast, “solitons” are topological solutions of classical equations of motions. 
However, this rule is also sometimes broken. 

The ubiquitous presence of the suffix “-on” reflects a widespread tendency in physics to asso- 
ciate excitations that are protected from ordinary decay mechanisms (presently, by the presence 
of topological indices) with different kinds of “particles” . 


9.3.2 Functional integration and topological textures: generalities 

How can our present understanding of topologically non-trival field configurations be 
extended to a working scheme of field integration? As we saw above, the different topological 
sectors of the theory essentially lead their “own lives.” It is then an obvious idea to try to 
carry out the “canonical program” of field integration (analysis of mean-field configurations 
integration over fluctuations) in each sector separately. Our first step would thus be to 
seek solutions of the equation 

f#| r «M\ =0. (9.9) 

^ 1 4>e<t>w 

(The later equality expresses the fact that the topological action does not change under field 
variation.) In our prototypical example of Section 9.1, the solutions of these equations were 
readily identified as <f>w(,T) = 2ttWt//3. However, in general, finding solutions of Eq. (9.9) 
for W ^ 0 is a task more complicated than the analysis of the W = 0 mean-field. (This is 
because field configurations with W ^ 0 generally exhibit some non-trivial spatial variations, 
i.e. we cannot rely on the standard homogeneity assumptions.) 

INFO However, there exists an elegant trick whereby the identification of topologically 
non-trivial mean-field configurations can sometimes be drastically simplified. Consider the 
expression 

0 ^ J <fx (d M n + n x d n) • (<9 M n + e M /n x d /n) 

= J d 2 x (d^n ■ d^n + n • (S M n x d n)) , 

where the inequality simply expresses the fact that we are integrating a positive definite quantity. 
Now, we know that the second term in the latter integral yields just — 87r times the topological 
charge, i.e. 

W J d 2 x d„n • d M n = So[<(>]. 

We thus conclude that W represents a lower bound for the action of field configurations of 
topological charge W. This limit is reached for extremal configurations 

+ nx d n = 0, (9.10) 

on which the integral above vanishes. Since any continuous variation of these fields leads to 
a non-vanishing integral, they must be stationary field configurations. We have thus managed 
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to reduce the identification of stationary phase configurations to the solution of the first - order 
differential equation (9.10). 

In passing we note that Eq. (9.10) is most elegantly solved by introducing complex coordinates 
z = x i + ix 2 and representing n in terms of a stereographic projection: 


ni + iri 2 = 


2w 

1+TwP 


where w £ C. Straightforward substitution then shows that Eq. (9.10) assumes the form dzw(z) = 
0 (exercise). This means that any meromorphic function 


w 


w 


=n 


Z — cii 
z — bi' 


solves the equation. To understand the identification of the order of the product, W, with the 
topological charge, notice that the inverse z(w ) will be a IT- valued function, i.e. W indeed 
measures the number of times the sphere (w-space) is covered by the compactified plane (z- 
space). (Exercise: Verily that, up to a different choice of the coordinate axes, the skyrmion <jf v> 
above is one of these extremal configurations.) 

As an attractive by-product, the scheme above automatically yields the action So[<A] = W of 
the extremal configurations <j>. A slightly modified variant of the same trick helps to find other 
topological mean-field configurations; e.g. the famous SU(2) instantons central to the analysis of 
0-vacua in QCD (for a pedagogical discussion of this subject see, e.g. Ryder. 13 ). 


At this stage, the further course of action seems to be clear. We ought to compute the action 
of the mean-field solutions of a given topological index, and then integrate over fluctuations. 
In practice, however, things turn out to be not quite as straightforward. To understand what 
is going on, let us revisit a problem we discussed back in Chapter 3, namely the motion of a 
quantum particle in a steep periodic potential. The topological nature of that system follows 
from the fact that, (i) by virtue of the periodicity of the potential, V(x) = V(x + a), we 
can identify the configuration space with a ring of periodicity a and, (ii) with exponentially 
large probability, at times t — > ±oo the quantum particle will rest in any one of the minima 


L. H. Ryder, Quantum Field Theory (Cambridge University Press, 1996). 



Figure 9.5 Snapshot of two typical field configurations of the U(l) model. Dashed: minimal field 
configuration with W = 4 instantons. Solid: field with 7 instantons and 7— W = 3 anti-instantons. 
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of the potential (see Fig. 9.5). This latter fact implies that the two points t = ±oo forming 
the boundary of the base manifold can be identified (in analogy to our discussion of higher- 
dimensional compactification above). Within the path integral approach to the problem, we 
are thus integrating over mappings S 1 — ^ S' 1 ; the spatial distance (in units of a) our particle 
traverses in the course of time translates to the winding number of the mappings. 

This number is a topological invariant, i.e. it does not change under any continuous 
deformation of the path. However, what may well happen is that, on its way from the 
minimum at x = 0 to another one at x = Wa, the particle includes one or several detours 
(see Fig. 9.5) . As is evident from the figure, any of those non-direct paths can be continuously 
deformed to a straight tunneling path 0—>a—> 2a— To connect to the notions 
of topology, suppose that the individual tunneling events forming a path of total winding 
number W are widely separated in time. By analogy with our discussion in Section 9.2, 
we can then imagine the full path as the result of the superposition of V > W single- 
instanton (W = 1) field configurations and V—W anti-instantons ( W = —1). The homotopic 
equivalence of a path with a non-vanishing number of anti-instantons to the direct path (no 
anti-instantons) amounts to the fact that an instanton and an anti-instanton annihilate, 
1 + (— 1) — > 0, when their temporal coordinates approach each other; only the difference 
between the numbers of instantons and anti-instantons is a topological invariant. 

For steep potentials, instantons and anti-instantons are widely separated in time, imply- 
ing that correlations/annihilations are vanishingly improbable. For obvious reasons, such 
configurations are referred to as dilute instanton gases. However, for sufficiently shallow 
potentials, instantons begin to proliferate in number. The increase in the instanton “den- 
sity” gives rise to correlation effects or “interactions” between the instantons. (Technically, 
the interaction of two nearby tunneling/anti-tunneling events no longer is just the sum of 
the two partial actions, but contains correlation terms.) Of course, these instanton liquids 
are much more difficult to describe than dilute instanton gases. 

Clearly, these phenomena are not limited to the one-dimensional example above. For 
general 7 r(M, T) ~ Z, instantons exist as “particles” (W > 0) and “anti-particles” (W < 0). 
Only the difference in the number of particles and anti-particles is a topological quantum 
number. The general plan of an instanton analysis will, therefore, typically take the following 
form: 

t> Solve mean-field equations - find the instantons/anti-instantons. 

> Analyze the dilute instanton phase. 

> Identify correlations between instantons and explore when the diluteness assumption 

breaks down. 

> If possible, try to understand the physics of the correlated instanton system. 

In Chapter 3, we exemplified this program on the U(l) example (albeit not emphasizing 
the underlying topology). A pedagogical discussion of the generalization to the S 2 -instanton 
gas (skyrmions/anti-skyrmions) can be found in the text by Polyakov. 14 However, at this 
stage, we will not pursue further the complexities of the dense instanton systems. Rather 


A. M. Polyakov, Gauge Fields and Strings (Harwood, 1987). 
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we shall turn to the - long overdue - discussion of examples where 5 2 -instanton formation 
plays an important physical role. 


9.3.3 Spin chains 

In Section 2.2 we applied the Holstein-Primakoff transformation to explore the disper- 
sion relation e(p) of excitations in ferro- and antiferromagnetic spin systems. Within this 
approach we found that, for antiferromagnetic systems, the dynamics of long- wavelength 
spin-wave excitations is characterized by a linear dispersion relation 

e(p)~u s |p|, (9.11) 

where v s is the spin wave velocity. However, let us recall that this result was based on a 
crude semi-classical expansion valid only to leading order in 1/S, where S is the magnitude 
of the spins. Yet, what happens when a 1 /S expansion is seemingly unjustified? Do interac- 
tion processes between the elementary spin-waves significantly renormalize the observable 
excitation spectrum? At any rate, we are dealing with a “strong coupling problem” for 
which no obvious approximation scheme is in sight. 

In situations like this it is usually a good idea to identify an exactly solvable reference 
system where the physics is known. In the present context, the S = 1/2 antiferromagnetic 
spin chain plays this role. Indeed, we have seen (in Problem 2.4 15 ) that, at small wavevec- 
tors, the 5 = 1/2 antiferromagnet is equivalently described as a system of one-dimensional 
chiral fermions. We know that the excitations of the fermions (charge density waves) have 
a linear dispersion, i.e. they obey a one-dimensional variant of Eq. (9.11). This seems quite 
reassuring: both, S> 1 and the exactly solvable point 5 = 1/2 are characterized by a linear 
dispersion. It is then, perhaps, not too bold to speculate that, in the analytically inacces- 
sible intermediate regime 5 ~ 1, interactions will also not corrupt the linear dispersion of 
antiferromagnetic spin chains. 

Surprisingly, though, this expectation does not conform with experimental observation. 
Neutron scattering experiments on one-dimensional spin 1/2 antiferromagnets have indeed 
shown that, in the vicinity of the Neel ordering wavevector q = n/a, the dispersion is linear. 
However, spin 5=1 chains show altogether different behavior! It turns out that these 
systems do not support low-energy magnetic excitations at all (see Fig. 9.6). More generally, 
the cumulative experimental finding is that antiferromagnetic chains of half-integer spin do 
support a relativistic low-energy excitation, while chains of integer spins are gapped. 


To be precise, in Problem 2.4 we applied a Jordan— Wigner transformation to represent the 5 = 1/2 chain 
in terms of a half-filled system of one-dimensional fermions. The fermion system contained an interaction 
term whose strength was determined by the anisotropy A of the magnetic correlations. In the Xl^-limit, A = 
0 (vanishing coupling of the z-components) , the fermion system becomes free. (That is, both the fermion 
system and its equivalent partner, the spin system, support long-range excitations.) For general A, bosonization 
techniques can be applied to map the problem onto a two-dimensional sine— Gordon model. This model falls into 
the universality class of the two-dimensional (!) classical Xl^-model (cf. discussion on page 469). Translated 
back to the context of the spin chain, the RG flow behavior of the latter implies that, for values of the anisotropy 
all the way up to the Heisenberg limit A = 1, the system remains in a gapless phase (the interaction stays 
irrelevant). For further discussion of the spin 1/2 chain we refer to A. O. Gogolin, A. A. Nersesyan, and A. M. 
Tsvelik, Bosonization and Strongly Correlated Systems (Cambridge University Press, 1998). 
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Figure 9.6 Neutron scattering data from the S = 1 compound CsNiCl 3 . The main panel shows 
the excitation energy for wavevectors close to the antiferromagnetic nesting vector Q c = n. In 
contrast to half integer spin systems, the spectrum is gapped. (Reprinted with permission from 
M. Kenzelmann, R. A. Cowley, W. J. L Buyers, et al., Properties of Haldane excitations and 
multiparticle states in the antiferromagnetic spin-1 chain compound CsNiCb, Phys. Rev. B 66 
(2002), 24407. Copyright (2002) by The American Physical Society.) 


As a rule, physical phenomena depending on the parity of an integer quantum number 
(presently, 2 S being even or odd) tend to be of topological origin. To understand why 
topology appears in the present context recall that, classically, the configuration space of 
a spin is a sphere (of radius S). The spin configuration of a spin chain is thus described 
by some mapping from (1 + l)-dimensional space-time into the sphere. Compactification of 
space-time then leads to the mappings S 2 > S 2 discussed above. 

To substantiate this picture, let us start out from the quantum partition function of an 
isolated spin derived in Section 3.3: 

Z (1) = J D ne ’i|% £ w ! (n,a r n) ] £ wz (n,d r n) = (1 - cos (0))<£, 

where the integration extends over all paths n:r4 n(r) and (<f>, 9) are two angles param- 
eterizing the unit vector n. 

INFO In the light of the discussion above, the action of this integral looks rather suspicious: 
it is purely imaginary and remains invariant under reparameterizations of time t t'(t) - 
both hallmarks of topological terms. Indeed, £wz[n, 0 T n] is the Lagrangian of a “Wess-Zumino 
action”. 16 For the topological character of these actions, and their connection to the 0-terms 
discussed presently, see the next section. 


Notice that, from the point of view of a purist, the notation £wz(n, d T n ) is problematic. As discussed earlier, 
the Wess— Zumino action does not admit a globally coordinate invariant representation in terms of n and its 
derivatives. To formulate one, we have to deploy an explicit coordinate representation. For the underlying 
reason, see the next section. 
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The generalization of the (0 + l)-dimensional path integral to a (1 + l)-dimensional field 
integral of the spin chain is a good exercise in guessing effective actions. As a warm-up 
exercise, we begin our discussion with the ferromagnetic spin chain. The interaction 
between the neighboring spins is mediated by an operator 

- ~ 79 

-JSi ■ S i+1 — 1 -JS 2 ni ■ n,. t -4 — (n, - n i+1 ) 2 , (9.12) 

where J is the positive exchange constant, i labels the sites on the chain, and the first arrow 
maps to the representation S — > S n of the spin operators in the field integral language. In 
the second term we have noted that, up to irrelevant constants (n 2 = 1), the interaction can 
be represented as a “discrete derivative.” Adding to this interaction term the Wess-Zumino 
terms of the individual spins, we are led to the partition function 2 = J On exp(— .S'[n] ) , 
with the effective action 

S[n] = J “ n i+ i) 2 +f < 9£wz(n i ,5 T n i )| . (9.13) 

Anticipating that, for a ferromagnetic system, the configuration {nj} will typically be 
smooth, we may then take the continuum limit to arrive at the action 

<9ferro[ n ] = a -1 J dr dx (9n) 2 + iS£ wz (n, 9 r n)| , (9-14) 

where a denotes the lattice spacing. The action (9.14) does not contain a (9-term. 

EXERCISE Derive the equations of motion of this action. Show that the mean-field dispersion 
of the ferromagnetic chain, u ~ q 2 , is quadratic. (Hint: Recapitulate what we know about the 
variation of the Wess-Zumino term from Section 3.3.) 

We now turn to the more interesting case of the antiferromagnetic spin chain. The 
exchange coupling is now negative, implying that the neighboring spins prefer antiparallel 
alignment. We thus start out from a configurational ansatz 

n t = HV;, 

where n' is the antiferromagnetic order parameter field. To derive the antiferromagnetic 
analog of the continuum action (9.14), we ought to substitute this ansatz into Eq. (9.13) and 
perform a gradient expansion. However, rather than going through the technical details of 
this expansion (see e.g., Jackson 17 ), we will here fix the structure of the action by qualitative 
reasoning. For one thing, we know that the system supports a wave-like mode. The minimal 
action consistent with the global rotational invariance of the model (n — » Bn. where R £ 
0(3)) and the presence of a wave-like mode is given by 




S 0 [n] = \ 

7 [ drdx(—(d x n) 2 + 

v s (<9 T n) 2N ) 



1 J \v s 

J 
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nics (Wiley, 1975). 
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where we have relabeled n' — > n for notational simplicity and v s = 2a JS is the spin- 
wave velocity. Of course, the detailed structure of the coupling constants is beyond the 
scope of our present plausibility argument. However, what we can say is that the overall 
coupling constant must be proportional to a positive power of S. This is because, for large 
S, “interactions” between the elementary spin waves ought to become weak. 

EXERCISE To show this, expand n around any preferential axis (e.g. by setting ni,2 = n, 2 , 
n 3 = (1 — r\ — r!) 1 / 2 , r 2 + rf fi [0, 1]). Expand the action in powers of Ti and show that, for 
large S, the contribution of anharmonic terms (spin- wave “interactions”) becomes small. 


Rescaling variables, r — > v s 1//2 r = xq, x v^ 2 x = x\, 

5o[n] -4 ^ J d 2 xd x ^n- d X)i n, X = 4 /S, 

assumes the form of the action of the 0(3) nonlinear cr-model . 18 
However, as argued above, we expect the full action of the problem to 
contain not only a dynamical piece So but also a 0-term. How can this be made plausible 
from the prototypical action (9.13)? We expect that, somehow, the #-term must appear as a 
descendant of the Wess-Zumino action. In the ferromagnetic case, the Wess-Zumino actions 
of the spins basically added to give a contribution that was not a #-term. However, in the 
present case, we are dealing with a staggered spin configuration that leaves room for more 
interesting things to happen. Let us first recall the interpretation of iS fj d£ wz(n, d r n) as 
the oriented area T [n] on the sphere swept out by the curve n(r) (see figure). 

By an elementary geometrical consideration, T [— n] = 47r — F [n] = — r [n] mod 47T. This 
implies that the continuum version of the Wess-Zumino action 



“Stop [ n ] 




dr C 




evaluated on a staggered configuration (— )*nj must contain a spatial derivative besides the 
temporal derivative inherent in £wz (since for n; = const, the factor (— )® would lead to 
a global cancellation). Indeed, the Wess-Zumino actions of two neighboring configurations 
n* + i and n, evaluate, respectively, to the areas bounded by the curves T [rij + i] and F [n,] . 
For | rij+i — n,| small, the area difference can be approximated by 

r* 

F [n*+i] - T [n;] ~ / drui- ((n i+ i - n*) x d T n). 

Jo 

Summing over i and taking the continuum limit, we obtain (see figure) 

Sto P [n]=*0 J drdxn- (d x n x d T n), 9 = (9.15) 


Although the action above is commonly referred to as the action of the 0(3)-model, a better terminology would 
be the (0(3)/0(2))-model. Indeed, the degrees of freedom of the theory span the 2-sphere S 2 ~ 0(3)/0(2). 
Although the nonlinear cr-model on the sphere has to be distinguished from the cr-model on 0(3) (cf. the 
discussion of Section 8.5 and Problem 8.8.3) we will follow the widespread convention to describe the model 
above as the 0(3)-model. 
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which we identify as our earlier representation of the 0-term. 

(Notice that the integral f dr dx really represents an integral over 
the sphere. This is because we agreed that, for |r| — > oo and/or 
| a; | — > oo, all trajectories approach a common reference point on 

•v 2 .) 

Summarizing, we have obtained So [ n ] + <5top [ n ] as the effective 
action of the antiferromagnet. In Problem 8.8.3, we have seen that at large length scales 
a two-dimensional system described by the action So[n] flows into a disordered phase. But 
how will the presence of the topological term modify this behavior? To get some idea of what 
might happen, let us reformulate the partition function as a sum over disjoint topological 
sectors, 

Z=Y1 [ Dn w e 2wiS ^ e -%E“vI. (9.16) 

where n^/ denotes field configurations of winding number W. Equation (9.16) provides 
a preliminary explanation for the observation of the half-integer/integer spin staggering 
phenomena mentioned above. For integer spin, exp(2TriSW) = 1 and the topological term 
is not operational. By contrast, for half-integer spin, exp(27r*SW) = (— ) w , i.e. consec- 
utive topological sectors are weighted by alternating signs. Notice that the topological 
term is susceptible to the parity of 2 S and nothing else. To understand heuristically 
the consequences of this feature we emphasize that, at the mean-field level (this stabi- 
lized by large S ), the partition function is governed by a mode with linear dispersion. 
Quantum fluctuations around this configuration will alter its dispersion, potentially by 
the creation of an excitation gap. Now, in the integer case, these fluctuations additively 
(exp(— So) € R + ) contribute to the partition function. For small S they may (and in 
fact do) totally mask the mean- field sector. By contrast, for half- integer spin, fluctuations 
contribute with alternating sign, thereby partially canceling each other; the mean-field 
sector has a better chance to survive. These observations form the basis of Haldane’s 
conjecture , 19 according to which spin chains of integer S will flow into a disordered phase 
with no long-range excitations whilst, in chains of half integer spin, they remain in a gapless 
phase. 

Reassuringly, the predictions born out of these rather abstract constructions are in full 
agreement with neutron scattering measurements of the dispersion of various quasi-one- 
dimensional magnets. But why should one, nonetheless, use the attribute “conjecture” (as 
opposed to, say, “theory”)? The scenario above is based on a number of shaky suppositions. 
Most seriously, it is tacitly assumed that the cr-model will seamlessly scale into the strong 
coupling region without changing its form. (That the model remains form invariant under 
renormalization is a perturbative prediction which can be trusted only for A 1.) Indeed, 
a subsequent analysis by Affleck and Haldane 20 has shown that, en route to the strong 
coupling regime, much more drastic things happen. Specifically, the appropriate critical 



F. D.M. Haldane, Nonlinear field theory of large-spin Heisenberg antiferromagnets: semiclassically quantized 
solitons of the one-dimensional easy-axis Neel state, Phys. Rev. Lett. 50 (1983), 1153-6. 

I. Affleck and F. D. M. Haldane, Critical theory of quantum spin chains, Phys. Rev. B 36 (1987), 5291-300. 
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theory describing the behavior of half-integer chains at large length scales turns out to 
be not the 0(3) nonlinear cr-model but rather a field theory with SU(2)-valued degrees of 
freedom. We discuss this theory in Section 9.4.4 after the concept of Wess-Zumino terms 
has been introduced. 

Introducing another group of phenomena where mechanisms of topology are crucial, we 
now turn to a discussion of the quantum Hall effects (QHE). The QHE belongs here because 
(a) it is surely a compulsory part of any modern treatise on condensed matter phenom- 
ena, and (b) it is, in many respects, a topological phenomenon. On the other hand, our 
discussion will lead us somewhat astray inasmuch as it involves, necessarily, a review of 
experimental observation, the elementary quantum mechanics of electrons in the presence 
of strong magnetic fields, etc. Readers wishing to maintain a more streamlined discussion 
of topology in condensed matter field theory are therefore invited to skip the next section, 
and turn directly to the section on Wess-Zumino terms below. 


9.3.4 Integer quantum Hall effect 

In fact, it is quite misleading to talk about the quantum Hall effect: at the least, one 
should speak of the quantum Hall effect “s” - a spectrum of quite different phenomena, 
almost unparalleled in diversity and conceptual depth. To get a preliminary impression of 
the phenomena, observe the raw experimental data displayed in Fig. 9.7. The figure shows 
the Hall resistance p xy = Ru and the longitudinal resistance p xx (the ragged curve in the 
bottom of the plot) of a two-dimensional electron gas as functions of a strong perpendicular 
magnetic field. Instead of a dull linearly increasing curve (the classical Hall resistance p xy ) 
or an approximate constant (the Drude resistance p xx ) one finds a profile that could hardly 
be more structured. 

On close inspection of the data, one may notice a number of characteristic sub-structures: 
(1) Shubnikov-de Haas oscillations at small magnetic fields, followed by (2) the character- 
istic quantum Hall plateaus p xy = z^ -1 /i/e 2 at rational “filling fractions” (see below) 
v € Q to which the effect owes its name. These are accompanied (3) by a dramatic drop in 
the longitudinal resistance p xx . The functional form of the increase (4) from one plateau 
to the next is described by certain well-defined power laws as a function of temperature, 
indicative of a second-order phase transition - the zero-temperature quantum Hall tran- 
sition. Barely visible, (5) a second generation of Shubnikov-de Haas oscillations is observed 
at v = 1/2. Finally, there appears much structure in the pattern of rationals v for which 
plateaus are found. For example, for some low-lying rationals (such as v = 1/4) no plateau 
is formed. The set of rationals for which the effect occurs is known as the quantum Hall 
hierarchy. 

These are but a few of the most striking observations gathered under the label quantum 
Hall effect. Undoubtedly, to account even superficially for all of these phenomena would 
present a task that is well beyond the scope of the present text. Rather, we will have 
to restrict ourselves to a brief (and, alas, painfully superficial) discussion of a number of 
conceptual basics. In the present section we shall focus on the discussion of the conductance 
plateaus observed at integer filling factors - the so-called integer quantum Hall effect 
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Figure 9.7 Hall conductance and longitudinal conductance of a two-dimensional electron gas as 
functions of a perpendicular magnetic field. For a discussion, see the main text. (Reprinted with 
permission from H. L. Stormer, D. C. Tsui, and A. C Gossard, The fractional quantum Hall effect, 
Rev. Mod. Phys. 71 (1999), S298-305. Copyright (1999) by the American Physical Society.) 

(IQHE). The generalization to plateaus at rational filling fractions - which, curiously, hinges 
on altogether different physical concepts - is discussed in Section 9.5.1 below. 

When specified in units of the conductance quantum e 2 //i, the plateau conductance a xy = 
p~y of the IQHE is an integer. By its very nature an integer cannot vary continuously (upon 
changing some physical control parameter). Indeed, more often than not, observables quan- 
tized in integer (or rational) units are linked to some topological origin. In the following, 
we will formulate two different “explanations” of the quantization phenomenon. The quotes 
indicate that, far from being rigorous, both lines of argument involve some degree of “boot- 
strap character” : assuming that the effect is of topological nature, we feel free to subject 
the “real world” arrangement of a QHE experiment to all kinds of abuses (deformation of 
the sample boundaries, etc.) to then discover that, yes, the Hall conductance emerges as a 
topological invariant. The backbone of a more rigorous approach to the problem is discussed 
in Section 9.3.7 below. 


9.3.5 Background information on the IQHE 

Let us begin our discussion with a nutshell summary of the phenomenology of the IQHE. 
This is followed by a brief reminder of the phenomenology of Landau level quantization - 
formulated in a language that highlights the symmetries of the problem. 
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The QHE is observed in two-dimensional electron gases 
subject to a strong magnetic field. The prototypical setup 
of a QH experiment is shown in the figure. A voltage drop 
V x applied across the sample induces a current I x in the 
^-direction plus, by virtue of the Lorentz force, a Hall cur- 
rent I y in the ^-direction. The relation between I x and I y 
determines the Hall conductance. More precisely, the key 
quantities of interest are the entries a xx and a xy determining the conductance tensor 



a = 





which is defined as usual by I = a\ where I = (I x ,I y ) T , V = (V x ,V y ) T (note that, 
by symmetry, a xx = <J yy ). The inverse p = <r _1 defines the resistance tensor V = pi. 
For a system of linear extension L, conductance g and conduct ivity a are connected by 
the relation g = aL d ~ 2 . In two dimensions, the two quantities coincide. This implies 
that no device-geometry-dependent factors interfere when we pass from the basic quan- 
tity determined by the microscopic physics of the system (a) to experimental observables 
( g ); otherwise, no completely universal Hall conductance could possibly be observed. In 
the context of the QHE, it is truly important to keep the tensorial structure of a and p 
in mind. For example, somewhat paradoxically, for a xy 0, a vanishing of the longitu- 
dinal resistance p xx implies a vanishing (as opposed to a divergence) of the longitudinal 
conductance a xx . 

Within the context of the QHE, the natural unit for the strength of the applied magnetic 
field is the “filling fraction”. This is defined as the ratio 


2 t tNII 
A 


(9.17) 


where N denotes the number of electrons in the system, A is the sample area and lo = 

<&o/2tt\B\ the magnetic length, i.e. the external magnetic flux through the area Iq is 
equal to (27r times) one flux quantum. 21 

Experimentally, one finds that (see the figure overleaf, courtesy of D. Leadley), for 
field strengths close to an integer filling fraction v € N, the Hall resistance is quantized, 
Pxy = z^ _1 /i/e 2 to an accuracy of 0(1O -10 ). 22 At the same time, the longitudinal resis- 
tivity/conductivity drops by as much as 13 orders of magnitude. In passing we note that 
the rapid oscillations visible in the figure at small field strengths represent the familiar 
Shubnikov-de Haas oscillations. 


21 One may recall that the flux quantum is defined through the relation <I>o = h/e. In our standard units h = e = 
C = 1, $0 = 2tt and l 0 = |S| _1/2 . 

22 Due to the striking precision of the experimental data, the unit of electrical resistance is nowadays maintained 
as h/e 2 = 25 812.80 through quantum Hall measurements. 
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As a (presumed) topologi- 
cal phenomenon, we expect the 
quantization of the Hall conduc- 
tance to be robust against sample 
imperfections and/or the pres- 
ence of static disorder. What 
is more surprising is that the 
phenomenon, in fact, critically 
relies on the presence of disorder. 

To see this, let us for a moment 
assume that the opposite is true: 
we have a homogeneous electron 
gas accommodated by a transla- 
tionally invariant device. We further assume the absence of external voltage gradients, 
E = 0, so that no current is flowing in the system, 1 = 0. Now suppose that we observe the 
system from a frame moving with velocity v in, say, the 1-direction. An experimentalist 
working in that frame would observe a current density j = —vpe i, where p is the density 
of the electron gas. Further, the Lorentz covariance of electrodynamics implies that one 
would measure a finite electric field E = vei x B = —vBe 2, where B = Be 3 is the applied 
magnetic field. With ji = U12E2, we obtain cr-12 = pB~ l for the Hall conductivity in the 
moving frame. Being independent of the boost velocity, v, this result holds in all moving 
frames, including the static frame, v — >• 0. We conclude that, in any translationally invariant 
environment, the Hall conductivity is linearly related to the magnetic field. 

Reciprocating the argument above, we see that the presence of (translational invariance 
breaking) disorder must be necessary for the observability of the QHE. In fact, we shall see 
in a moment that the effect is born out of a conspiracy of disorder induced localization and 
the phenomenon of Landau level quantization. However, before turning to the discussion 
of the combined effect of these two mechanisms, let us briefly recapitulate the formation of 
Landau levels in a clean two-dimensional electron gas subject to a magnetic field. 

Let us temporarily consider a geometry where the electron gas assumes the form of a 
perfect disk. 23 To explore the quantum mechanics of this problem, we represent the in- 
plane vector potential in the so-called symmetric gauge, = (B/2')cijXj, i = 1,2 (where 
the coordinates are measured with respect to the center of the disk), whereupon the free 
electron Hamiltonian, H = (p — A) 2 /(2rn*), assumes the form 


(To avoid confusion with the quantum number to appear shortly below, we designate the 
electron mass by m * .) We wish to solve the Schrodinger equation H'ip n = e n ip n . This task 
is greatly simplified by subjecting the eigenvalue problem to the similarity transformation 


—id 1 — 


2 ll 


+ (~ id2+ lk) 


(9.18) 


0 2 4 6 8 10 12 14 



Remember our fundamental working hypothesis whereby the quantization phenomena forming the QHE will 
not depend on details of the geometry. 
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H -> H' = SHS- 1 , ip n -> V4 = 
the transformed problem = 


H' = 


1 

2m* 


+ i 


: Sip n , where S = exp [jp (x 2 + x^)j . The reason is that 
= CnV’n is governed by the effective Hamiltonian 


2 IV 


(xi + ix 2 ) ) + ( -id 2 + ^2 (®i + ix 2 ) 


i.e. an operator whose (vector) potential depends only on the linear combination X\ + 
ix 2 rather than on two linearly independent coordinates x\ and x 2 . To benefit from this 
simplification, we may switch to complex coordinates, z = x 1 + ix 2 , z = x 1 — ix 2 , in which 
the Hamiltonian assumes the form (exercise) 


H' = 


-4d z d E + 2pd z + p 


This Hamiltonian possesses a family of eigenstates ip' n = z n with eigenvalues e n = (n + 
1/2) = ^r(n + 1/2). Undoing the similarity transform, one can conclude that the original 
Hamiltonian is diagonal on the states 


ipn 


e n = w c (n + 1/2), 


whose eigenvalues e n are the celebrated Landau levels. The Landau levels differ by integer 
multiples of the cyclotron frequency u c = B/m*. On the other hand, we know that, for a 
system of linear extension L, there are of 0(k-pL)‘ 1 states below the Fermi energy fc|/2m*. 
This implies a typical level spacing ~ l/(m*L) 2 which is by a factor ~ BL 2 smaller than 
the spacing between Landau levels. Anticipating that the clean problem does not support 
energies other than e n , one can conclude that the Landau levels must be hugely degenerate: 
each of them hosting BL 2 = ( L/Iq ) 2 states. 

To reveal the origin of the massive degeneracy of the Landau levels, we have to identify 
a symmetry of the Hamiltonian (9.18) or, equivalently, a set of linearly independent oper- 
ators commuting with H. In the present context, these are the “magnetic translation 
operators” 


k = a ‘~M’ l = 0, + M' 


klpn = 0 . 


(In the absence of a magnetic field lo — > 00 , these would be ordinary translation opera- 
tors ~ —idi transformed to complex coordinates; hence the name magnetic “translation” 
operators.) It is straightforward (exercise) to verify the following properties: 


[H,k] = [H,k} = 0, %k]=]. 2 . 

We next use these operators as generators for the creation of states degenerate with the 
reference states f/’n- To this end, let us define 


T = exp 



U = exp 




522 


Topology 


Using the commutation relations between k and k, one may verify that these operators obey 
the relation 

TU = UTexp 4 t t 2 i 

sometimes referred to as the magnetic algebra. We now have all elements to construct 
the entire set of states: the fact that k annihilates ip n implies that Uip n = ip n . Consider now 
the family of states 

= A r n ,mT rn i> m (9.19) 

where Af n , m is a normalization factor. Since H commutes with k (and therefore with T), 
V’n.m form eigenstates with eigenenergy e n - they all populate the nth Landau level. How- 
ever, we do not yet know whether V’n.rre actually form a set of linearly independent states. 
To prove their independence, one can employ Eq. (9.19) and Uip n = tp n to show that 
Uil) n ,m = exp(— i47r 2 m(Zo/U) 2 )V’n,m- Therefore, as with Bloch states in a periodic potential, 
ip n ,rn are eigenstates of a generalized translation operator. For rn 1, . . . , 2n(L/lo) 2 , the 
corresponding eigenvalues are different, which proves that 

{ipn,m\0 < m < 2 tt(L/1 0 ) 2 }, 

is a linearly independent set of eigenstates in the nth Landau level. The magnitude of this 
set 2tt(L//o) 2 coincides with our estimate of the degeneracy of the Landau levels above, i.e. 
we have succeeded in constructing a complete eigenbasis of the magnetic Hamiltonian. 



9.3.6 IQHE as a topological phenomenon 



Shortly after the experimental discovery of the IQHE, 24 Laughlin pre- 
sented an ingenious argument whereby the quantization of the Hall con- 
ductance could be explained under fairly general conditions. 25 Slightly 
later it became clear 26 that Laughlin’s argument in fact implied a number 
of curious features of electrons subject to a magnetic field and static dis- 
order. However, before turning to a more detailed discussion of Laughlin’s 
ideas, and their subsequent refinement by Halperin, let us first outline 
the basic skeleton of the argument. 

Given its complete universality, the quantization phenomenon must be 
insensitive to (continuous) deformations of the sample geometry. Using 
this freedom, Laughlin proposed to subject the basic quantum Hall geom- 
etry to the sequence of transformations indicated in the figure. From a 
“Hall bar” geometry we pass to an annular geometry of higher symmetry. 


24 K. von Klitzing, G. Dorda, and M. Pepper, New method for high-accuracy determination of the fine-structure 
constant based on quantized Hall resistance, Phys. Rev. Lett. 45 ( 1990 ), 494 — 7 . 

25 R. B. Laughlin, Quantized Hall conductivity in two dimensions, Phys. Rev. B 23 ( 1981 ), 5632 - 3 . 

26 See B. I. Halperin, Quantized Hall conductance, current-carrying edge states, and the existence of extended 
states in a two-dimensional disordered potential, Phys. Rev. B 25 ( 1982 ), 2185 - 90 . 
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In the last step of the construction, the external bias voltage is replaced by the electromo- 
tive force generated by a weakly time-dependent flux through the annulus. Laughlin then 
suggested monitoring the fate of the system upon an adiabatic (i.e. infinitely slow) variation 
of the flux threading the annulus. Before discussing the response of the system in more 
detail, let us try to motivate this idea. 

We know that, for specific values of the annular flux, namely integer multiples of a flux 
quantum <j> = 2nn , the Hamiltonian of the system is gauge equivalent to the Hamiltonian in 
the absence of flux. This is because an integer multiple of the flux quantum can be removed 
by the gauge transformation tp a e me i[) a acting on the wavefunctions of the system. (For 
non-integer fluxes, this transformation alters the boundary condition ip(r, 2i r) = ip(r, 0) and, 
therefore, the Hilbert space of the problem.) 

Now, let us see what happens as we gradually 
increase the flux from <j> = 0 to <f> = 2n. The situation 
is visualized in the figure where each line symboli- 
cally represents a basis of eigenstates of the Hamil- 
tonian for a given value of the flux. Assuming that a 
gauge transformation has been applied to move the 
flux dependence of the problem to a change in the 
azimuthal boundary conditions (see our discussion in Section 9.1), the eigenstates for differ- 
ent values of the flux are truly distinct, i.e. as one increases the flux from <j> = 0, each state 
moves along a path “perpendicular” to the collective set of eigenstates, as shown schemat- 
ically in the figure. Eventually, for <j> = 2tt, we arrive back at the original (j) = 0 basis. 
(This follows from the fact that H\<f,= 2 n can be mapped onto H\<p = o by a gauge transfor- 
mation that does not alter the boundary conditions.) That, however, does not necessarily 
imply that individual basis states map onto themselves upon the completion of the path 
<j) = 0 — > (j) = 2n. I.e., while the set of eigenstates as a whole gets reproduced, permutations 
of individual states are consistent with the gauge invariance of the problem. 

EXERCISE If this statement does not make much sense to you consider, as an example, a clean 

one-dimensional ring subject to a magnetic flux. Explore what happens to the eigenstates of 

H = jA (p — A) as the flux is increased from 0 to 27r. Show that the non-invariance of individual 

states is compatible with the invariance of the global spectrum. 

The non-invariance of individual states upon completion of a round trip back to a gauge- 
equivalent Hamiltonian is a phenomenon called spectral flow. The spectral flow of the 
eigenstates of our magnetic environment lies at the heart of Laughlin’s argument. Specif- 
ically, we shall see that, upon the sending of a flux quantum through the ring, n states 
radially centered at the inner perimeter are pushed above the Fermi energy (n is the num- 
ber of Landau levels below the Fermi energy). At the same time, n states at the outer 
perimeter sink below the Fermi energy. To regain thermal equilibrium, the system responds 
by transferring n electrons from the inner to the outer perimeter. This process takes place 
during the time to it takes to adiabatically send the flux quantum through the system, 
i.e. the transverse current I 2 = n/to (remember, e = 1). The electromotive force driving 
the process is V\ = <j) = 2n/to- The corresponding Hall conductance is therefore given by 
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Figure 9.8 Energy levels of a quantum Hall annulus as functions of the radial coordinate. For a 
discussion, see the main text. 


<7i2 = h/Vi = n/ 2 -k. Expressed in physical units, this can be cast in the form <712 = ne 2 /h - 
the quantum Hall effect! 

To substantiate this picture, let us consider a situation in which the Fermi level Ep is 
placed somewhere between the first and the second Landau level. For simplicity we shall 
also make the (artificial but physically immaterial, cf. Halperin’s paper 26 ) assumption that 
the disorder is confined to the inner regions of the sample. We require the disorder strength, 
as set by the inverse elastic scattering time, r _1 , to be smaller than the separation between 
Landau levels, cn c r 1. This condition is, in fact, necessary to prevent the Hall plateaus 
from becoming “washed out.” As a result we obtain the level diagram shown in Fig. 9.8. The 
figure schematically shows the energies of the single-particle states as functions of the radial 
coordinate of the annular region. In the bulk of the annulus, disorder leads to a broadening 
of the Landau levels to energy bands of width r _1 . At the outer/inner perimeter of the 
annulus, the confining potential pushes the levels energetically up. Crucially, this implies the 
presence of as many Fermi energy states - “edge states” - as there are occupied Landau 
levels (in our case, just one). These states, and not so much the bulk states buried deep 
below the Fermi energy, are likely to be the carriers of longitudinal currents in the system. 
As to the bulk states, let us presume that they are localized by disorder on a length scale £ 
much smaller than the circumference I/|| of the system. (Intriguingly, we shall soon see that 
this assumption leads to a contradiction.) 

To explore the phenomenon of spectral flow in this environment, we need to turn to a 
refined description where individual levels are resolved. This is the subject of the following. 

EXERCISE We wish to explore the radial structure of the states occupying the lowest 
Landau level. This task is most conveniently accomplished in a basis different from the 
set {V’o.m} discussed above: revisit the construction on page 520 to show that all states 
0o, m. = Ao,m 2 m exp[— ^2 zz\ lie in the lowest Landau level. Switch to polar coordinates (r, 0) 
and verify that these states can be approximated as 
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n(r, 6) «JV„, m e“ 


( 9 . 20 ) 


where r m defines the area through which m flux quanta pass; nr^ n B = mh/e. Show that, for any 
reference angle do, the azimuthal current carried by these states is given by 


7 II = -7 / dr (3(n 0 )> ' 


= — dr |0o,77 

m Jo 

b r, ,, 


_ 

2 

1 2 (r-r m ). 


( 9 . 21 ) 


The wavefunctions (fio,m{ r >&) describe the system far away from its boundaries. They are 
symmetrically centered around r m and carry angular momentum L = m. From Eq. (9.21) 
and the symmetry around the center coordinate r m , we further conclude that the azimuthal 
current carried by these states vanishes, /y = 0. However, in the vicinity of the boundaries, 
this picture becomes perturbed. Firstly, the confining potential will push the states up 
in energy (see Fig. 9.8 where the circles represent the states centered at coordinates r m close 
to the inner boundary of the annulus). Secondly, the boundary potential will render the 
states radially asymmetric, implying that the integral (9.21) no longer vanishes: azimuthal 
currents flow at the boundaries. The surface currents flowing at the inner/outer perimeter 
are opposite to each other. 

We next discuss what happens as some weakly time-dependent flux 0 is sent through the 
annulus to generate an electromotive force = d t <j)/r. The vector potential generalized to 
the presence of the field E\\ takes the form A = ^J^-eg, where 0(r) = Br 2 . Referring for a 
more detailed discussion to Halperin’s paper, we note that the flux <p adds to the background 
flux 0(r). As a consequence, the center coordinates r TO (0 + 0) “contract.” Once a full flux 
quantum has been added to the system, 0 = 2-7T, the center coordinates have contracted by 
one unit, r TO (0 + 27r) = r m _i(0), and the original set of levels (and therefore the spectrum 
of the system) is restored. However, the individual levels have changed - spectral flow: at 
the inner edge of the system, one occupied level has been pushed above the Fermi energy, 
at the outer edge one empty level dived below the Fermi energy (see Fig. 9.8). To repair 
this energy imbalance, the system will want to transfer one electron from the inner edge to 
the outer edge. 

But how will it do that? Before the advent of the QHE, it had been common wisdom that 
the states of a two-dimensional electron gas in the presence of disorder (such as the states in 
the bulk of our annulus) are localized on a certain scale £. For 1, these states would 

be completely oblivious to the presence of our driving flux 0. This would imply that, as far 
as the action of 0 is concerned, the inner and the outer edge of the system are decoupled. 
How, then, would the system know that it ought to transfer one electron between the edges. 
(After all, this transfer has to be mediated through the bulk.) The only way out of this 
dilemma is to courageously postulate (as Halperin did) that, notwithstanding the presence 
of disorder, there must be at least one delocalized bulk state below the Fermi energy. 
Subsequently, it was indeed found that the localization length diverges upon approaching 
the center of the Landau band. The delocalized states in the center of the band establish 
the contact between the edges and may act as conduits of electronic charge. 
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Laughlin’s gauge argument hints at the topological nature of the QHE: independent of 
system-specific details, the addition of a flux quantum through the annulus transfers an 
integer charge across the system. This suggests that the effect should be understandable in 
terms of some kind of topological index lurking behind the mapping of a parameter space 
(presently, the amount of flux through the ring) into the Hilbert space of the problem. 
Subsequently, Avron and Seiler , 27 and Thouless, Kohmoto, Nightingale, and den Nijs 28 
indeed succeeded in rigorously identifying the quantum Hall conductance as a topological 
invariant - the first Chern class of the U(l) principal bundle over the two-dimensional torus. 
However, a satisfactory discussion of these ideas, for which we would need to introduce 
much more background in differential topology, would lead us too far astray. Instead, we 
shall head back to our prime subject, the discussion of topological concepts in low-energy 
field theories of condensed matter systems. 

EXERCISE As preparation for the next section, refamiliarize yourself with the field theory 
approach to the disordered electron gas introduced in Section 6.5. 


9. 3 . 7 Field theory of the integer quantum Hall effect 

Laughlin’s gauge argument, and its subsequent refinement by others, helped to unravel 
many of the mysteries posed by the experimental discovery of the IQHE. Specifically it shed 
some light on the conspiracy of dissipationless edge currents, disorder induced localization 
of bulk states, and an exotic family of delocalized states in the formation of the effect. 
On the other hand, even with Laughlin’s picture in store, we are still a long way from a 
more-than-schematic, quantitative understanding of the effect. A huge step towards a full 
microscopic theory of the IQHE - whose final structure still remains partly unknown - was 
taken by Pruisken when he adjusted the nonlinear u-model of disordered fermion systems so 
as to account for the presence of a strong magnetic field. In the next two subsections, we will 
reconstruct Pruisken’s field theory, and employ it to gain more insight into the long-range 
behavior of the quantum Hall system. 


Pruisken s field theory: construction 

As we have seen in earlier chapters of this book, much of the information about a non- 
interacting disordered electron system is contained in correlation functions involving the 
product of a retarded and an advanced single-particle Green function, 


(xi| 


1 


Ey +u> + iO — H 


|x 2 )(x 2 , 


Ep — u) — iO — H 


|xi') 


Depending on the choice of coordinates and the energy argument uj, these functions describe 
the conductance of the system, the statistics of its spectrum and many other characteristics. 


J. E. Avron and R. Seiler, Quantization of the Hall conductance for general, multiparticle Schrodinger Hamil- 
tonians, Phys. Rev. Lett. 54 (1985), 259-62. 

D. J. Thouless, M. Kohmoto, M. P. Nightingale, and M. den Nijs, Quantized Hall conductance in a two- 
dimensional periodic potential, Phys. Rev. Lett. 49 (1982), 405—8. 
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We have also seen that the long-range behavior (scales L £, much larger than the elastic 
scattering mean free path, G) of this correlation function can be extracted from a field theory 
whose action in d, = 2 is of nonlinear u-model type, 29 

S[Q] = ™J d2x [ D tr 0MQ) - tr (Qof )] . (9.22) 

For later convenience, let us recapitulate a few key features of the field theory defined by 
Eq. (9.22). 

> The matrix fields Q aa ,ss (x) ~ ip as (x)ip a s (x) describe the behavior of a product 
of two fermion field amplitudes ip as . The dynamics of the latter is controlled by the 
advanced/retarded single-particle Green function, (ip as (x)ip a s (x')) ~ (Up + s(u) + *0) — 
H)~ 1 (x,x')S ss 5 aa . Here, the two-component index s = +/— , and a = 1, . . . , R refers to 
the replica index. 

t> The fields Q take values in the coset space U(2i?)/(U(i?) x U (/?)). 30 A concrete rep- 
resentation is given by Q = Tcrf r T _1 , where T £ U(2 R) and cr™ is a Pauli matrix in 
two-dimensional advanced/retarded (ar) space. 

> For oj — y 0 (the limit we will concentrate on throughout) the action above is invariant 
under two distinct symmetry transformations: global transformations T — > gT, where 
g £ G = U(2 R) is constant, and local transformations T — >• T/i(x), where /i(x) £ H = 
U (R) x U(.R), i.e. the group of all matrices fulfilling the condition [h(x),cr 3 r ] = 0. 

o The microscopic parent action from which Eq. (9.22) was derived was, in fact, rotationally 
invariant under the full group XJ(2R). That our fields Q five in a smaller coset space 
signals the fact that, in a metallic system, this symmetry is spontaneously broken: the Qs 
are the Goldstone modes associated with the breakdown of the symmetry from U(2i?) to 
U (R) x U (R). 

t> Under a generalized 31 gauge transformation, ip — > e^ip, the Qs transform as Q — > 
e-^Qe 1 ^. Gauge invariance then implies that the operators <9 ;J appearing in the effective 
action must be interpreted as covariant derivatives, 

dn = -i[Ap, ], 

where A M transforms as a non-abelian vector potential, A ;t — >■ — ie^ l ^d lt e 1 ^ . 

(Notice, however, that the “physical vector potential” generating the perpendicular mag- 
netic field Aphys ~ 6 ab 6 ss is diagonal in replica and ar spaces and, therefore, does not 
enter the covariant derivative.) 

In a series of famous papers, Pruisken 32 extended the formalism above so as to account for 
the effect of a strong magnetic field. It turned out that a key player in the action of that 


29 Here we are using the reduced variant of the model (see Problem 8.8.4) suitable to compute the product of a 
retarded and an advanced Green function. 

30 Due to the presumed presence of a massive magnetic field, time reversal symmetry is broken and the introduction 
of a “time-reversal space” unnecessary. 

31 By “generalized” we mean that (j> can be a matrix in ar space as well as in replica space. 

For a review, see A. M. M. Pruisken, Field theory, scaling and the localization problem, in The Quantum Hall 
E ect , ed. R. E. Prange and S. M. Girvin (Springer- Verlag, 1987). 
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generalized field theory was a certain variant of a #-term. As with many other examples 
before, there are two ways to obtain this action: one may generalize the derivation of the 
cr-model discussed in Section 6.5 for the presence of a strong magnetic field and derive the 
#-term from first principles. For the details of this not entirely straightforward program we 
refer to the original papers. Alternatively, one may guess the structure of the generalized 
action on the basis of symmetry arguments and fix the coupling constants by running some 
consistency checks. It is this second strategy that we shall pursue shortly. 

As discussed in Chapter 7, elements of the conductivity tensor a llv can be obtained by 
a two- fold differentiation SA S SA Z[A] of the partition or generating function with respect 
to some generalized vector potential. Referring below for a more detailed discussion, all we 
shall aim for presently is to understand what kind of action is needed to produce a finite Hall 
conductivity ay 2 ~ Z[A\- By symmetry, the mixed derivative computed on the field 

theory defined by Eq. (9.22) vanishes (a point that warrants some consideration). Rather, 
we have to look out for an operator comprising the two long derivatives d-\ Q and & 2 Q in a 
single local expression. For example, we might contemplate a term like / tr(3i Q O^Q). This 
expression, however, is again not permitted by symmetry. The reason is that (unlike the 
rotationally invariant two-derivative operator in Eq. (9.22)) it is not form invariant under 
rotation of the coordinate axes (again something to think about). However, the next obvious 
choice, 

Stop IQ] = e ^ V J d2x tr {Qd^QdM) , (9.23) 

does the job. For one thing, this term is rotationally invariant. Secondly, the definition of 
the long derivatives d lt implies that it contains terms linear in the combination A-[ A 2 , from 
where we conclude that the (as yet undetermined) coupling constant 6 must have something 
to do with the Hall conductivity. 

To be somewhat more specific, let us draw on Problem 7.6.4 where it has been shown 
that a source- vector potential suitable for the calculation of the conductance takes the form 
= U~ 1 d l j,U with U = exp(i(xiKiO'f' r + X 2 Ki^) ® (here, E\ x is a projector onto the 
first replica channel and K fl are numbers). With this choice, 

on = lim —^—7 d 2 K I n Z, 012 = lim — I n Z. 

-R ->0 47T.L 2 KiKi 'k=o R^oAmL 2 KiK 2 'k=o 

We also know that the conventional Drude theory of a weakly disordered metal is obtained 
by setting Q = o-| r . (Remember that fluctuations around the origin of the field space, 03 r , 
describe mechanisms of localization, i.e. quantum effects beyond the Drude picture.) We 
next use this information to determine the coupling constants of the theory. To this end, 
we go to the Drude level (set Q = 0 -f r ), substitute the source A M into our long derivatives, 
and evaluate the action. This leads to (exercise) 

S'fQ = 0 - 3 1 ] = 2ttL 2 Du(k( + k\) + 16iI/ 2 0KiK2. 

Evaluation of the derivatives above on -Z[A]| Drude ~ exp(— S'[ 0 ’f r ]) then readily leads to the 
result 0 ii = 2'kDv for the longitudinal Drude conductance 011 (which we knew anyway), 
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and to the suspected identification 9 = —of 2 /8 of the coupling constant 9 with the (Drude) 
Hall conductance. Summarizing, we obtain Pruisken’s action of the IQHE, 

S[Q] = lj d2x [*11 tr&Q d»Q) - &uQ)\ ■ (9.24) 

What makes the identification 6 ~ 012 more interesting is that 6 (and therefore the Hall 
conductance) actually plays the role of a topological angle; the second term in Eq. (9.24) is 
a topological term. That such a term might be present in our field theory follows from the 
fact that 

7r 2 (U(2fl)/(U(i2) x U (R))) = Z. 

Unfortunately, we do not have the mathematical background to prove this result. 33 We can, 
however, make it plausible; and we can demonstrate that S top above is a representation of 
the corresponding 0-term. 

To this end, let us temporarily focus on a single replica channel (a = 1, say) and consider 
the field configuration 

Q n (x) = U(x)a^ T U~ 1 {x.) = n(x) • <7 ar , 

Q q6 (x) = 5 ah al\ o^l, 

where J7(x) G U(2). The second equality in the first line defines a unit vector n(x). It 
expresses the fact that the projection of the field space onto a single replica channel is 
isomorphic to S' 2 . 33 Evaluating 5 top on this particular field configuration, it is straight- 
forward to verify that i§tQ P [Q] = i ^ f d 2 xn ■ (<9in x c^n), an expression we iden- 
tified earlier (see Eq. (9.8)) as the topological term of a two-dimensional field theory 
on the sphere. We can now generalize from our particular Q to field configurations Q = 
T(x)Q(x)T -1 (x) where T(x) g U(2/?). However, using arguments similar to those employed 
in Section 9.3.1, one may convince oneself that small variations T will not change the value of 
the action 5 top • Indeed, S top is the general 0-term on the coset space U (2 r ) / (U(r ) x U(r ) ) . 

Pruisken s field theory: long-range physics 

Now that we have “derived” the Pruisken action, the next question to ask is what to do with it. To 
begin with the bad news, the long-distance behavior of the model is still pretty much unknown 
and, in fact, a subject of ongoing research. (That this is the state of affairs some 20 years after 
its derivation signals the fact that we are dealing with a very rich field theory.) However, by 
investing one’s physical insight in the quantum Hall problem (and with a little bit of good will) 
quite a few things about the model can, nonetheless, be said. 

Suppose we were dealing with a system of annular geometry, similar to that discussed in 
the last section. Let us further assume that the Fermi energy lies in between the center of the 
nth and (n+ l)th Landau bands, so that there are no delocalized states at E P . As discussed 
in the previous section, the bulk of the sample is then pretty much impervious to external 

33 Notice, however, that in the special case R = 1, U(2)/(U(1) X U(l)) ^ SU(2)/U(1) ^ S 2 is the 2-sphere. What 

is non-trivial here is the generalization of 7T2(U(2)/(U(1) x U(l))) = 7T2 (S 2 ) = Z to general R. 
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perturbations, and the interesting Fermi-energy physics takes place at the boundaries (the 
inner and outer perimeters). 

Indeed, it is possible to rewrite the topological action as a pure boundary operator. 
A straightforward application of Stokes’ theorem 34 shows that 

Stop [T} = ^ [ ds-tr^VT- 1 ), (9.25) 

1 JdM 

where the integral runs over the boundary dM of the annulus. Notice, however, that 
Eq. (9.25) is not represented in terms of the fundamental degrees of freedom of the theory 
(the Qs) but rather relies on a particular “coordinate” representation (Q = TtT| r T _1 ). As 
discussed in Section 9.4, this is not accidental but rather reflects a fundamental property of 
topological terms. For future reference, we also anticipate that the boundary descendant of 
the bulk 0-term Eq. (9.23) is an example of a Wess— Zumino term. 

We next show that, under the conditions stated above, the boundary representation of 
the theory raises a consistency problem unless a V i = integer. To appreciate the problem, 
recall that the bulk representation of the theory is invariant under local transformations 
T(x) — > T(x)h(x), where h G H = U(i?) x U(U). This follows trivially from the fact that 
its degrees of freedom Q = Tcr^T -1 — > Thcj'f }r [ T~ A = Q are invariant. Now, under 
our present working conditions - Fermi energy in a mobility gap - there are no Q-field 
excitations in the bulk of the system (the Qs describe mobile Fermi energy excitations), and 
the theory reduces to two decoupled boundary theories which must be separately invariant. 
In this respect, the only contribution that may potentially cause trouble is the topological 
boundary contribution Eq. (9.25) (since it is not a functional of the invariant degree of 
freedom, Q). Indeed, let us consider the specific choice h = exp(icrf r 0), where 9 is the 
azimuthal coordinate of the system. Focusing on the effect of this transformation at the 
inner perimeter, say, we obtain 

r-l-R i-l-K 

-StopPI = J dOt^TafdoT- 1 ) J d9tx{Tha™ d 0 {h 'T l )) 

Rrr r 2 * Her r 2n 

= S to pPI + V 2 ^ d9tv(a^hde(h- 1 )) = S top [T} + ^ d6 3 0 tr « In h” 1 ) 

= StopPI + ;?? ^t r (o' 3 r ln/i _1 )|Q 7r = S top [T] - 2nia 12 R. (9.26) 

The invariance of the exponentiated (!) boundary action requires that exp(27niicri2) = 1, 
from which we conclude that ay 2 = integer. Summarizing, we have seen that, for Fermi 
energies in between two Landau band centers (that is, conditions where QH plateaus are 
experimentally observed), the intrinsic consistency of the theory requires quantization of 
the Hall conductance. A refined variant of such arguments (see Pruisken’s aforementioned 
article) shows that the value of the conductance indeed coincides with the number of Landau 
levels below the Fermi energy. Notice also that the reasoning above is again “topological.” 
This time, the key players are the winding numbers of the mappings S 1 — > U(l) x U(l) from 


For a review of the general form of Stokes’ theorem, 


Section 9.4.2 below. 
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the boundary manifold (topologically, a circle) into the projection of the local transformation 
group to a single replica channel (U(l) x U(l)). 

EXERC I S E As an exercise in group manipulations, extend the argument given above to more general 
transformations at the boundary. To this end, consider a general (but single- valued!) boundary 
transformation h(6) E H and show that we will not obtain information beyond the quantization 
criterion discussed above. Hint: Use the facts that U(r) = U(l) x SU(r), and that VR € SU(r) : 0 = 
In det R = tr In R. 

INFO The argument above focuses on the boundaries of the system. As a brief digression, 
let us show how the same information can be obtained from a bulk picture. In the bulk, all 
Fermi energy states are localized. Technically, this implies that the longitudinal conductance 
renormalizes to zero, an — f 0, where the notation is meant to indicate that, in a renormaliza- 
tion group sense, the conductance scales to zero on length scales L £ much larger than the 
two-dimensional localization length. An alternative formulation of the same fact is to say that 
transformations ?/>(x) — ► T(x)?/>(x), where T(x) 6 U(2 R) fluctuates on scales £, must leave 
the theory invariant. 35 For an 0, and frequency differences w — ¥ 0, the non-topological sector 
of the action vanishes and is therefore trivially oblivious to fluctuations of the Qs. However, 
this is not the case with the topological action. No matter how slowly it fluctuates in space, a 
topologically non-trivial configuration Q(x) will have topological action S[Q] = 2nai2n, where 
n is the corresponding winding number. The only way to make the theory generally impervious 
to Q- field fluctuations is to require <712 = integer - the bulk variant of the quantization criterion. 

Quantum Hall transition 

Having understood the basic quantization phenomenon, we might now ask what other fea- 
tures of the quantum Hall system can be extracted from the field theory approach. Given 
that the nonlinear a-model arguably represents the most powerful approach to disordered 
electron systems in general, we might, for example, be ambitious enough to seek more infor- 
mation about the nature of the delocalized states expected to reside at the centers of the 
Landau bands. As we saw above, these states do play a vital role in the formation of the 
QHE. However, as long as they are deeply buried below the Fermi energy, they influence the 
system rather indirectly. We should expect them to become much more vivid as the Fermi 
energy sweeps through the center of a Landau band. Indeed, as Ey — >■ nw c approaches a 
band center, the system begins to build up long-ranged correlations. This is because the 
physics at the Fermi energy is now controlled by ever more extended, or delocalized, states. 
Formally, Ey — (n+ l/2)iu c goes along with a diverging correlation length £, a phenomenon 
indicative of a second-order phase transition. Indeed, E* = (n+ 1 /2)w c marks the position 
of a very peculiar (and still not fully understood) quantum phase transition, the quantum 
Hall transition. 


35 Indeed, we had argued earlier that metallic behavior goes along with a spontaneous breakdown of replica rotation 
symmetry. Our Qs are the Goldstone modes of this phenomenon. Conversely, localization is accompanied by 
a restoration of this symmetry; the Goldstone modes disappear and the theory will no longer respond to 
fluctuations of the Qs (or the Ts for that matter). Further, notice that we are now concentrating on the “large” 
transformation group of the model, U(2 R), and not the “small” invariance group U (R) X U(i?). 
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Figure 9.9 On the behavior of the longitudinal conductance, Hall conductance, and density of 
states, p, at the quantum Hall transition. 


In Fig. 9.9, the behavior of the three most relevant players in the system, longitudinal 
conductance, Hall conductance, and density of states, is shown as functions of the Fermi 
energy. In the vicinity of the Landau band centers the system becomes critical. Right at 
a transition point, E F = E* , (a) the longitudinal conductance an = 1/2, (b) the Hall 
conductance a 12 = n + 1/2 is half integer, and (c) the correlation length £ characterizing the 
spatial profile of wavefunctions at the Fermi energy has diverged. The latter implies that, 
for any system of finite size L, there is a whole range of energies around the critical value 
for which £ > L. Within this energy range, the wavefunctions are effectively delocalized. 
Naturally, the width of the band of delocalized states shrinks upon increase of the system 
size and, in the thermodynamic limit, approaches zero. More precisely, upon approaching 
the band center, the correlation length diverges as 

£~l e - e *\~\ 

where v defines the correlation length exponent and we have introduced A E = \E — E*\ as 
a relevant scaling variable. For a given system size, the width of the energy band, A E, of 
delocalized states is determined by the condition L ~ £(A E), or A E ~ L -1 /*'. The number 
of states within that window scales as N ~ pp ~ L -1 / I ' +2 , where 5 ~ L~ 2 is the two- 
dimensional level spacing. Unfortunately, there is still no reliable analytical prediction for v. 
(We shall see in a moment why this is so.) However, high-precision numerical analyses 36 have 
shown that £ = 2.35 ± 0.08. This implies that the number of states within the delocalized 
region diverges in the thermodynamic limit: even though the width of the delocalized energy 
window approaches zero, it hosts a continuum of extended states. 

We shall next explore the extent to which these features can be understood from the field 
theory approach. To this end, let us imagine the partition function formally expanded in 


See, e.g., B. Huckestein, Scaling theory of the integer quantum Hall effect, Rev. Mod. Phys. 67 (1995), 357-96. 
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terms of the topological index of field configurations: 

z = j2 e2niW<Ti2Z w, 

w 

where Zw is the partition function reduced to the sector of fields of winding number W, 
and Stop = 2niWai2 enters as a topological phase. In general, there is not much we can say 
about Zw, other than that it will be small: \Z n \ 1 <C > 1. Indeed, an estimate similar to the 
one employed on page 509 yields (exercise) 

0 < \jd 2 xtr[{d„Q + ie^QdvQ) {d^Q + ie„ x Qd x Q)\ 

= J d 2 x tr [d^Q d^Q - ie^Q d^Q d u Q] = J (fix tr (d^Q d^Q) - 167 tW. 

From here, we conclude that the non-topological contribution to the action obeys the 
inequality 

S°[Q] = J d 2 xtv(d^Q (%Q) > 2 tt Wa n , 

and that Z n ~ exp(— o\fi) is weighted by a small “energetic” factor. We have had 
ample opportunity to see that such factors can, in principle, be compensated for by large 
“entropic” counterweights. However, Pruisken has shown that, in the present context, this 
does not happen. 

Similarly, the functional expectation values for longitudinal and Hall conductance can be 
organized in an instanton series: 

w = 0 

The coefficients appearing in this series depend - in the exponentially sensitive 

way discussed above - on the longitudinal Drude conductance. Importantly, the right- 
hand side of the series depends on the Drude values <r?- while the left-hand side sets 
the true renormalized conductance (i.e. the second-order derivative of the functional with 
respect to a generalized source- vector potential). There is not much more we can say 
about the structure of these series except for one important symmetry criterion: under 
a change of orientation of the coordinate system, <712 changes sign while an does not. 
Specifically, 

012(-B) = — <721 (B), Cll(B) = +<722(-B), 

which is an example of an Onsager relation. Consistency with these relations requires 
that the topological series be of the form 

<7ii = 0u + 8a\\ + ^ cos(27tH 7 (7? 2 ) 
w = 1 

C12 = 012-+ si n( 27 rWc r ) ) 2 ) & (V ^(crii), 

w= 1 


(9.27) 
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where a {w "> and bW are expansion coefficients. The notation emphasizes that, in the topo- 
logically trivial sector, W = 0, the longitudinal conductance may be subject to renormal- 
ization while 0 12 remains unrenormalized. 37 

In view of these structures, let us now speculate a little on the renormalization charac- 
teristics of the model. Here, renormalization means, as usual, that we consider the outcome 
of the theory on ever increasing length scales. We have to keep in mind, however, that 
this program must be carried out for each topological sector separately: renormalization, 
i.e. the successive elimination of fast fluctuations, cannot change the topological index of a 
field configuration. On the same footing a topological angle (presently, the coefficient crj 2 
of the topological term) should not renormalize, at least not in a conventional sense. (For 
the cautious formulation, see below.) 

Consider, then, the flow (012(A), 011(A)) in the two- 
dimensional parameter plane defined by longitudi- 
nal and Hall conductance as we increase a reference 
length scale A. From Eq. (9.27) it follows that there 
are two families of fines on which 012 does not renor- 
malize: CT12 = 0-12 e N and 012 = a^ 2 G N + 1/2. As 
to the first fine, we do not expect any critical behav- 
ior in the vicinity of integer ayj . (Remember that the 
quantum Hall transition is observed at (012,011) = 

(N + 1/2, 1/2).) Rather we expect that, in the vicin- 
ity of integer Hall conductances, the system behaves 
pretty much like an ordinary two-dimensional elec- 
tron gas. This implies (cf. our discussion above) that 
the Goldstone modes become gapped, 011 scales to 0 
and 012 scales to an integer value: for (012,011) close 
to a line 012 = n £ N, the system will flow towards 
the fixed point (n, 0). A special situation arises for 012 = 0J2 £ N + 1/2. Again, the Hall 
conductance does not renormalize, but now we are sitting on a critical surface (namely, 
E = E* , corresponding to half-integer Hall conductance). At the transition point, the cor- 
relation length has diverged and the system has become metallic. Accordingly we should 
expect the longitudinal conductance to scale towards a finite fixed point value 0* x . Indeed, 
experimentally, one observes 0* x = 0(1) at the transition points. The two-parameter flow 
diagram discussed above was proposed in a seminal paper by Khmelnitskii. 38 

To theoretically understand the transition behavior, Pruisken and collaborators have 
derived renormalization group equations that take the lowest two topological sectors, W = 



Renormalization cannot change the prefactor of the topological term, cr^ 2 . The contribution of the topologically 
trivial sector to the conductance is just this coefficient. 

D. E. Khmelnitskii, Quantization of Hall conductivity, JETP Lett. 38 (1983), 552-6. 
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0,1, into account: 


Pu = 


$%2 = 


dan 
5 In L 
dai2 
5 In L 


1 

2n 2 au 
■aue~ 27rtTl1 sin(27rcri2), 


(9.28) 


where c > 0 is a numerical constant. 39 These equations indeed have a family of fixed points 
(cr!^, a l\)i where cr* = 0(1) and <r* 2 € N + 1/2. In the vicinity of these points, <712 (an) is 
a relevant (irrelevant) scaling variable. 

Summarizing, Pruisken’s approach appears to predict a parameter flow as shown in the 
two-parameter phase diagram above. This diagram nicely conforms with experimental 
observations but, alas, there are some problems. For one thing, the interesting physics takes 
place in the vicinity of a fixed point value ah = 0(1), well outside the regime of applicability 
of the <t- model as such. (Remember that the derivation of this hinges on an 1.) One may 
trust in the principle that “good” models (and Pruisken’s model of the QHE certainly is 
good!) usually produce meaningful results even in parameter regimes where they no longer 
stand on safe ground. However, with the current problem, the chances are that this principle 
does not apply. Indeed, we have come across a very similar situation before: for large spin 
S 1, an antiferromagnetic spin chain is described by an 0(3) nonlinear cr- model. For 
topological angles 9 = n (corresponding to half-integer spin) the model is critical and flows 
towards some strong coupling fixed point. The important observation now is that, in the 
vicinity of this point, the system is described no longer by an 0(3) nonlinear cr-model 
but by an altogether different model: a field theory on the group manifold SU(2) with a 
topological term of Wess-Zumino-Witten type. Intriguingly, on its journey towards the 
strong coupling fixed point, the model manages to enlarge its field manifold from the coset 
space 0(3)/0(2) ~ S 2 of the 0(3) nonlinear cr-model to the larger space SU(2) °~ y 0(3) 
of the fixed point theory. 

Now, when projected onto a single replica channel, Pruisken’s <712 = 1/2 theory indeed 
reduces to an 0(3) nonlinear cr-model with topological angle 9 = tv. It is, therefore, quite 
conceivable that, at strong coupling, the general model also flows towards a target model 
with a larger field manifold. Although nothing rigorous is known, a scenario to this effect 
has been outlined in the literature. At any rate, the applicability of Pruisken’s RG equations 
in the vicinity of their fixed points remains questionable. 


INFO There is one other problem which should not be swept under the carpet: as discussed 
above, the absence of Goldstone modes in the localized phase requires the coupling constant 
of the topological term to be integer. This coupling constant is set by the Drude conductance 
< 7 i 2 and should not renormalize. In a way, we have to require that, for Fermi energies which lie 
between Landau levels, the Drude Hall conductance must already be integer, lest the theory run 
into a consistency problem. On the other hand, topological criteria do not require the physical 
conductance <712 to be integer. This is exactly the opposite of our physical picture. We should 


One might have expected the topological angle and not the physical conductance <Ti2, as an argument of 
the transcendental functions (sine and cosine). For a discussion of why the physical conductance appears there, 
we refer to Pruisken’s article (Pruisken, in The Quantum Hall E ect ). 
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expect that for any value of the Drude conductance (away from N + 1/2) the system will flow 
towards an integer value of the physical conductance. (This is the flow illustrated in the two- 
parameter flow diagram above.) Essentially, the problem boils down to the fact that the coupling 
constant of the topological term does not renormalize. (If it did, it should flow towards an integer 
value which could then justly be interpreted as the physical conductance.) To deal with this 
difficulty, Pruisken subjects the fields to a transformation Q — >- UQU _1 , where U are the source 
fields introduced above, and then discusses the structure of the theory after the fields Q have 
been integrated out. This produces consistent results, but the procedure is highly implicit and, 
therefore, remains somewhat mysterious. 

Summarizing, Pruisken’s field theory microscopically explains many of the intriguing aspects 
of the quantum Hall effect. Notably, it elucidates the interplay of disorder scattering and 
topological mechanisms (spectral flow!) in the formation of the effect. Contrary to early 
expectations, though, it may well be that the actual quantum Hall transition lies outside 
the scope of the model. At any rate, a rigorous identification of the universality class of this 
transition, not to mention a quantitative calculation of its critical exponents, remains an 
open problem. 

This concludes our discussion of 0-terms in condensed matter field theory. In our brief 
survey, we were unable to discuss one of the most exciting applications of 0-terms in field 
theory in general: ’tHooft’s concept of 0- vacua, and its relevance to understanding some of 
the most intriguing observed features of matter - CP- and T-violation. (However, the ener- 
getic reader is strongly encouraged to turn to a textbook on theoretical particle physics to 
learn more of this subject.) Rather, we proceed now to discuss another large and important 
family of topological field theories. 


9.4 Wess— Zumino terms 

Almost every time that we met with a 0-term in the previous chapter, a field theory with a 
Wess-Zumino (WZ) term 40 was just around the corner. Yet most condensed matter physi- 
cists appear to be only vaguely familiar with the ideas behind WZ field theory (in contrast 
to, say, the much more widely appreciated concept of 0-terms) . Perhaps the most important 
reason for this lack of appreciation is that the general meaning of a WZ term is difficult to 
grasp in the “traditional” languages familiar to condensed matter physicists. On the other 
hand, the relevant concepts become quite transparent once we venture to reformulate a few 
elements of field theory in the language of modern differential geometry. This reformulation 
will be the subject of the first half of this section. Not assuming any background knowledge, 
we begin with a crash course in differentiation on manifolds and exterior calculus. (Readers 
familiar with differential forms are invited to skip this introduction.) In Section 9.4.2 we 


As to the terminology of Wess— Zumino terms, there is no generally accepted convention. Historically, these 
topological terms first appeared in the work of Wess and Zumino. However, owing to Witten’s seminal analysis of 
SU(iV)-invariant chiral fermion systems (see below), they are often referred to as Wess-Zumino— Witten (WZW) 
terms. Yet another designation (especially popular in the Russian community) is Wess— Zumino— Novikov— Witten 
(WZNW) terms. Except for the discussion of SU(iV)-symmetric systems below, we stick here to the short variant 
of WZ terms. 
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Figure 9.10 On the construction of coordinate representations of field manifolds. For a discussion, 
see the main text. 


then explain the general ideas behind WZ field theory before we turn to the discussion of a 
number of interesting applications. 


9-4-1 A crash-course in differential geometry 

Figure 9.10 reiterates the mathematical backbone underlying most of our field theories: a 
field <f> : M — > T, x <p(x) is a mapping from a base-space M to a field space T. In practice, 

we mostly tend to nonchalantly identify field values (j> £ T and their coordinate represen- 
tations z(<f>) e R" (n is the dimension of T). However, especially when it comes to the 
discussion of topological aspects, we must be very careful with such premature identifica- 
tions. The point is that topologically non-trivial field spaces usually cannot be represented 
in terms of one globally defined system of coordinates. Take the sphere S 2 as an example. 
You may choose the standard representation in terms of two angles ( 6,(f > ), a stereographic 
projection onto a single complex variable z, or any other parameterization. Inevitably, there 
will be regions on S 2 where the mapping “ S 2 — > coordinates” becomes ill-defined. One may 
object that this ambiguity, manifesting itself only at a set of “measure zero,” 41 cannot be 
of much practical significance. Yet, with regard to topology this is not the case. 


Coordinate representations 

Given the importance of these singularities in the context of topological field theory, it is 
pertinent this time to discuss the construction of proper coordinate representations with 
mathematical rigor: the basis of each coordinate system is formed by a system of open 


41 For example, in a system of polar coordinates, the problematic region is a line connecting the north and the 
south pole of the sphere. 
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subsets Ui CT chosen so that the union (J i Ui = T covers T. 42 One next defines coordinate 
mappings Zi : Ui — »• Vj C M" from the patches U, onto some open subsets V) of R” . The value 
Zi(p) £ R n is a coordinate representation of the point p £ T. To make this representation 
unique, we require the Z;,s to have an inverse. 

The central point is that, on our manifold, there will be non- vanishing overlaps Z7* fl Uj 7^ 
{}, i.e. points p £ T that have more than one coordinate representation (see Fig. 9.10). 
Ambiguities between the different systems are excluded by requiring that the maps Ziozj 1 : 
Zj(Ui fl Uj) — »• Zi(Ui fl Uj) be diffeomorphisms (invertible and differentiable). Conceptually, 
the functions Zi ozj 1 mediate the change between different coordinates. This being so, they 
must be as benign (differentiable) as possible. Notice, however, that it would be senseless 
to require the z,s themselves to be differentiable, simply because in general T R n and the 
notion of differentiability need not even exist on T. 

In the jargon of differential geometry, the coordinate maps z-i are called charts of T while 
a fully covering collection {z^} is called an atlas. The existence of a proper atlas is, by 
definition, equivalent to the statement that T is a differentiable manifold. 

Example Consider T = S 1 , the unit circle. The one-dimensional manifold S 1 has a natural 
embedding into R 2 : S 1 = {x £ R 2 |x 2 = 1}. We need a minimum of two charts to cover S 1 . 
For example, one may use 

z _1 : (0, 2 n) — s- S 1 , ,z _1 (<^) = (cos(<^>), sin(</>)), 

z '- 1 : (0, 2 n) S 1 , .z ,-1 (</>) = (cos(<j> + 7r), sm((j> + n)); 

zfz' cover all of S 1 except for the points (1, 0) / ( — 1,0). 

Tangent space 

Having discussed the coordinate representations of 
(field) manifolds, we are now in a position to lift elements 
of standard calculus (differentiation, integration, etc.) 
from R ra to manifolds. (Later, we identify topological 
terms as integrals over certain differentials on T.) We 
begin by introducing the tangent space T p T as a locally 
flat approximation to the manifold T at a point p £ T (see the figure). We then use this 
planar approximation to describe how functions defined on T vary in the neighborhood of 
p. To construct the tangent space, consider a curve 7 : R — » T with 7(0) = p. It is tempting 
(see the figure) to define a vector v p tangent to T at p by setting v p = d s 7(s)| s =o- However, 
this “definition” is problematic because, in general, T <jt R m , so that v p is not a decent 
vector. Nonetheless, the idea above is not far from the truth. To make it suitable, let us 
consider some function / £ C(U) defined on an open neighborhood U 9 p. (Here, C(X) 



Of course, one can exercise much freedom in the choice of the UiS. For example, for the sphere, one might define 
t/1,2 to be any two overlapping “caps” whose union covers T. Notice that overlaps are, in fact, unavoidable as 
we want to cover a compact set T by open subsets Ui. 
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denotes the space of smooth, real- valued functions defined on X.) We may then employ 
our curve 7 to compute the directional derivative 

v p(/) s d s | s =o/(7(s))- 

The notation on the right-hand side indicates that we have constructed a mapping that 
takes functions as arguments and produces a number. 43 We next define this mapping to 
be a tangent vector at point p. (Notice that the assignment “curve i-> tangent vector” 
constructed in this way is not unique: two curves 71 and 72 tangent 44 to each other at p will 
produce the same directional derivative, v 71 = v 72 .) The set of all directional derivatives 
formed in this way defines the tangent (vector) space T p T at point p. 45 

The definition of “vectors” given above may seem strange. (If you wait a while, its utility 
will become apparent!) However, given a coordinate function z we can meet the standard 
identification “vector o n-component object” familiar from linear algebra. This is achieved 
by writing 

v T (/):« 4 | s =o/(2 _1 <?(«)) ='f2d i fv7, 

i=l 

where dif = dt(f o z _1 ) is the ordinary partial derivative of the function / o z~ x : V C 
]R n -> M and v] = d s |s=oz*(7( s ))- We define v] € R to be the ith component of v 7 (in 
the coordinate representation defined by z). Notice that the components of a tangent 
vector Vi can alternatively be obtained as 


n = v^ 1 ), 


i.e. as the directional derivative of the ith component of the coordinate function. (Here, we 
have simplified the notation by omitting the superscript reference to the curve 7.) Relatedly, 
a coordinate system induces a natural basis of the tangent space, T p . This, is defined 
by 

V/ € C(U ) : «*(/) = dif. 

Suppose that the reference point p is represented by two coordinate functions z and zJ . 
It is then straightforward to verify (exercise!) that the components of the tangent vector 
transform as v J || tV? . 

EXERCISE Compute the basis vectors e and e' corresponding to the two charts forming the 
atlas of S 1 discussed on page 538. Show that e = e' = e c) where e r! , is the azimuthal vector of a 
two-dimensional polar coordinate system, and the natural embedding of S 1 into R 2 is understood. 


A purist might object that v^(/) is just what we had earlier defined to be a functional, so that we should use 
square brackets [/] for the argument. However, following standard convention in differential geometry, we shall 
here stick to (/). 

In differential geometry, one uses this criterion to de ne the notion of tangency: two curves 71,2 through p are 
tangent to each other (with the same tangent velocity) if, for all functions /, v^ 1 (/) = v^ 2 (/). 
Mathematically, T P T is isomorphic to the space of all equivalence classes of curves through p, where two curves 
are called “equivalent” if they are tangent to each other. 
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The union TT = [J peT T p T of all local tangent spaces is called the tangent bundle of the 
manifold T. 46 In fact, the tangent bundle is a differentiable manifold by itself. Its elements 
are given by ( p , V p ) 6 TT, where p £ T and v. p £ T p T. (The dimension of TT is given by 
2 n, twice the dimension of T.) A mapping 

v : T TT, 

P >-+ (P,v p ), 

smoothly assigning to each point of T a tangent vector is called a vector field on T. 

Now, suppose we are given two manifolds M and T. (Later on, the role of M will be 
played by the base space of field theory, a differential manifold by itself.) Further, suppose 
that there is a mapping (the field!) </> : M — >• T. The definition of tangent spaces above then 
implies the existence of an induced mapping, the so-called tangent mapping, 


</>* : TM ->• TT, 

v 0*(v). 


The image vector (<t>*(v)) p is defined by setting (0*(v)) p (/) = ° 4>), for any 

function defined in a neighborhood of p £ T (see Fig. 9.11). Suppose we are given a system 
of coordinates w around p £ M and z around <j>(p) £ T. It is then straightforward to show 
that the coordinate representation of the vector (0*(v)) p is given by 


<M v ))p = 


dwi A'Np)‘ 


(9.29) 


where <j> 1 = z l ((j)) is a shorthand for the coordinate representation of </>. The formula above 
explains why the mapping <^>* is sometimes referred to as the differential of the mapping <f>. It 
illustrates the general rule that everything taking place on the tangent spaces is a measure 
of local (or “infinitesimal” ) variations. 


The authors hate the double-T notation TT, too. However, the prefix T for tangent is a ubiquitous standard, and 
we want to keep emphasizing that the apparatus introduced here will be later applied to the target manifolds 
of field theory, T. 
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Di erential forms 

A 1-form lo p is a linear mapping uj p : T p T — > R (i.e. an element of the dual space of the 
vector space T p T). Smoothly extending ui p to a map ui globally defined on TT, we obtain a 
so-called differential 1-form (or, for brevity, just the 1-form). We will denote the space of 
1-forms on T by A 1 (T). A number of important remarks on these definitions are in order: 

t> Most 1-forms that we shall encounter in practice are realized as differentials of func- 
tions: for f € C(T ) we define the differential df by 

dfp(v p ) = v p (f). 

(Exercise: Convince yourself that, for manifolds T c M n , this reduces to the standard 
definition of the differential with which you are familiar.) 
i> However, not every 1-form is a differential of a function. Consider, for example, the tangent 
basis of T = S 1 constructed in the exercise on page 539. Let us define a 1-form by setting 
w(e) = co(e') = 1, where the equality holds in the domain of overlap of the two charts. 
This 1-form cannot be represented as the differential of a single-valued function on S 1 
(exercise: why?). 

t> Finally, let us make a formal remark: for reasons that will become clear in a moment, 
functions / € C(T ) = A°(T) are sometimes referred to as 0-forms. Technically, a 1-form 
is a mapping 

u>:TT A°(T), 

V — > L U(v), 

that maps vector fields onto 0-forms. The value of the function <u(v) at a point p is given 
by w(v)(p) = u p (v p ). Alternatively, we can say that the insertion of a vector field into a 
1-form lowers the degree of the form from 1 down to 0. 

Given a system of local coordinates, z, each 1-form can be represented as 

w = fi dz i . (9.30) 

where the coefficient-functions are given by fi = co(ei) (a result that is instructive to check). 

A 1-form maps a single vector field onto a function. However, to describe the geometric 
structure of a manifold (distances, surface and volume elements, etc.) we need mappings 
that take more than one vector field as input. Mappings of this type are called tensors and 
defined as follows: a covariant tensor of rank r is a multi-linear mapping 

(jj : TT x • • • x TT A°(T), 

(vi,...,V r ) !-► w(vi, . . . , v r ). 

For example, some manifolds admit the definition of a metric. A metric g is a tensor of rank 
2 that is positive (Vp € T : < 7 p(v p ,Vp) > 0) and non-degenerate (Vw p : gp(v p ,w p ) = 0 => 
v p = 0). A manifold with a metric is called a Riemannian manifold. We call \/g p (v p , v p ) 
the length of the tangent vector v p . For later reference, we note that, in a system of local 
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coordinates, g has the representation g = g^dz 1 Q dz :i . Here, g.jj = g(ej,e,- ) is the metric 
tensor of the manifold. 

EXERCISE Show that, in polar coordinates (r, 9, <p), the standard metric of R 3 , g = dx 1 dx 1 + 
dx 2 dx 2 + dx 3 dx 3 , assumes the form 

g = dr dr + r 2 dO d6 + r 2 sin 2 9 df> d</>. (9.31) 

However, by far most important in practical applications are tensors that are fully antisym- 
metric in their arguments: a p-form w is a tensor that changes sign under odd permutations 
of its arguments: (u(vg(i), . . . , vg(p)) = sgn(S')a;(vi, . . . , v p ), where S is an element of the 
permutation group. The space of all p-forms on T is denoted by A P (T). Given a p-form u 
and a g-form £ we can produce a (p + g)-form u> A £ by the following rule: 

fy A 0(vi, . . . , v p+g ) = ^ E sgn(S) w(v s(1) , . . . , v s(p) ) 0v s( p +1) , . . . , v S ( P + q ))- 

The operation A : A P (T) x A q (T) — f K p+q (T ) is called the exterior product of forms. (In 
passing we note that A defines the product of the Grassmann algebra A(T) = ^L 0 A P (T) 
which we met already briefly in Section 4.1.) 

EXERCISE Verify the following features: (a) to u = 0 if u> 6 A p and p is odd, (b) u £ = 
(~ ) P9 £ an d (c) ui (^ g) = (w ^) g, permitting us to write just oj £ g without brackets. 

Given a coordinate function 2 , each p-form has the unique coordinate representation 

(9.32) 

where = u^e^ , . . . , ej p ). To see this, notice that our definition of the exterior product 

above implies that (df 1 A • • • A df p ){\ 1 , . . . , v p ) = £)s sgn(S) dfi{v S {i)) ■ ■ ■ df p (v S (p))- Also 
notice that the coefficients are antisymmetric under odd exchange of the indices ij . 

This being so, Eq. (9.32) can alternatively be written as 

w = cu/ li ... i / p d, 2 Jl A • • • A dz Ip , 

where, by convention, summations over capitalized indices are ordered, I\ < I 2 <■■■< Ip- 
Above we have seen that, computing the differential of a 0-form / € A°(T), we are led to 
a 1-form df € A X (T). This principle can be generalized to forms of arbitrary degree: let us 
define the exterior derivative d by setting 

(9.33) 

where the coordinate representation Eq. (9.32) is understood. Obviously, doj € A P+ 1 (T), i.e. 
d can be interpreted as an operator on A(T) that raises the degree of forms by one. However, 
mathematically inclined people will object that it is unclear whether Eq. (9.33), an equation 
based on a specific coordinate representation, really represents a proper definition (i.e., is 
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it clear that the same rule, applied to a different coordinate representation of u>, leads to 
the same form dull). These worries are addressed in the following exercise. 

EXERCISE Show that (a) the definition Eq. (9.33) does not depend on the choice of coordinates, 
(b) d 2 = 0, and (c) d(u> £) = dui £ + (— ) p u> d£, where p is the degree of u>. 

In the following, two subspecies of differential forms will be of particular interest for us: we 
call a differential form u> € A P (T) closed if du> = 0. Conversely, ui is called exact if ui = d£, 
i.e. if u> is obtained as the exterior derivative of some (p — l)-form £. Now, owing to the 
identity d 2 = 0, every exact form is closed. Yet not every closed form is exact. 

EXERCISE Consider the 1-form ui on S 1 discussed on page 541. Since w(e) = uj(e') = 1, uj has 
the local (i.e. restricted to individual charts) representation ui = d(p or w = dtp 1 . (The coordinates 
are those introduced on page 538.) Obviously, ui is closed. On the other hand, we have seen that 
there is no 0-form (function) / such that u> = df; u> is not exact. 


The classification of forms that are closed but not exact is a deep mathematical problem 
(the subject of cohomology theory). We shall return to this issue below when we discuss 
the geometry of topological terms. However, before doing so, we need to introduce one last 
concept of basic differential geometry. Previously, we have seen that a mapping <p : M — > T 
between two manifolds induced a mapping from the vector fields on M to those on T. In 
a very similar manner, <p gives rise to a mapping <p* between forms on the two manifolds. 
This so-called pullback is defined by 


<f>* : A(T) -» A (M), 

ui 1— >■ <p* (ui) = ui o (p* . 


For example, for a 1-form ui S A(T) and v e TM, we have [^*(u;)](v) = u;(</>*(v)), etc. 
Notice that <p* maps in a direction opposite to that of <P, hence the name “pullback.” 

In a system of local coordinates z and w on M and T, respectively, the components of 
the 1-form <p*(ui ) are given by 


(p*{u)i 


[cP*(ui)](e i )=ui(M^i)) ( = 9) w 




where {e'} is a basis of T. 


EXERCISE Check that the coordinate representation of the pullback of a p-form is given 
by 


<?*(»)./, 1 ,. = det 






Also verify the useful formula (p*(ui £) = <p* ui <p*£, and the commutativity <p* od = do <p* of a 
pullback on an exterior derivative. (If you feel exhausted, just verify that (p*dui = d((p*u>) holds 
when applied to forms ui € A 0,1 (T). The general proof proceeds along similar lines but is a bit 
more cumbersome.) 


Indeed, the pullback of forms is a very important operation since it enables us to define ... 
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Integration on manifolds 

Consider a top-dimensional form uj £ A n (T), i.e. a form whose rank is equal to the 
dimensionality of T. To begin with, let us assume that supp(cu) C U (i.e. u p gu = 0), where 
U C T is the domain of definition of a chart z. We then define the integral of ui over U as 


fu UJ = Iv z lw - 


(9.34) 


To make this definition somewhat less abstract, notice that w can be represented as 


c o = fdz\ A • • • A dz n , 


where / is a function on T (exercise: why?). The definition above then assumes the form 


f(x)dz x A • • • A dz n = / f(z)dz 1 A • • • A dz n = 


f(z)dz 1 ■■■dz 


Here, we have used the fact that z~ 1 (f(x)dz 1 A • • • A dz n ) = f(x(z))dz 1 A • • • A dz n , 47 In the 
(crucial) second equality, we declare the integral over the product of forms dz 1 A • • • A dz n 
in V Cl" to be the ordinary Riemann integral. This identification is meaningful because 
(dz 1 A • • • A dz n )(v i, . . . , v„) measures the volume of the parallelepiped spanned by the 
vectors Vi,...,v„ (in the standard metric of R n ), i.e. it represents a sensible “volume 
element.” 48 

To make the definition of the integral complete, we would have to discuss its extension 
to a global integral over the manifold. However, for practical reasons we shall not do 
so: for one thing, we are often enough dealing with manifolds T that admit a 1-chart atlas, 
in which case the definition above is sufficient. Only slightly worse is the situation where T 
can be covered by a single chart except for isolated singular points. (This is the case with, 
e.g., T = S' 1 , S' 2 - exercise: why?) Since the integral is oblivious to “sets of measure 0,” 
an integral over the domain of integration of such “nearly complete” charts is as good as 
an integral over all of T ; again, the definition above does the job. In the rare cases where 
one is dealing with an unpleasant manifold which does not belong to the two species above, 
one has to work harder and split up the support of the integrand by means of a so-called 
“partition of unity.” However, in view of the relative rarity of such cases, and the fact that 
they are dealt with in every textbook on differential geometry, we limit attention to the 
“local” definition above. 


Do not be confused by the appearance of the same symbols dz 1 A • • • on both sides of the equations. On the 
left-hand side, dz 1 is referring to the differential of the ith coordinate function z % : T — »■ R. On the right-hand 
side, z 1 : K n — »■ M. simply projects a vector onto its ith component. 

However, the discussion above sweeps one subtlety under the carpet: under a change of coordinates, z w, 
uj = f(z)dz 1 A • • • A dz n f(z(w)) det ( dz/dw ) dw 1 A • • • A dw n . Compatibility with the transition behavior of 
integration volume elements, dz 1 • • • dz n |det (dz/dW) \ dw\ • • • dw n , requires det ( dz/dW ) > 0. Coordinate 
systems with this property are said to have the same orientation. A manifold with an atlas of identically 
oriented charts is said to be orientable. (A prominent counterexample is presented by the Mobius strip.) The 
definition of the integral above implies that we have chosen a definite orientation. 
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For future reference, we remark that, if (p : T\ U\ — > U 2 G T 2 is a diffeomorphism 
between open subsets U\ and U 2 of two manifolds T) and T 2 , then 



(9.35) 


This is the generalization of the transformation law familiar from calculus to the integration 
on manifolds. (To understand this equation, notice that, if z : U 2 — >• V is a chart of U 2 , 
then 2 o <j) : U\ — »• V is one of U \ . The statement made by Eq. (9.35) then follows from 
Eq. (9.34).) 

With the phalanx of definitions above, we are now - at last! - in a position to discuss the 
utility of differential forms in topological field theory. 


9-4-2 From 9- to Wess-Zumino terms 

In Section 9.3, we have seen that, more often than not, the topological phase associated 
with non-vanishing “winding numbers” could be given a representation in terms of an action 
Stop- However, no guidelines as to existence or non-existence of such representations have 
been given. We begin by discussing some principles behind the formulation of topological 
Lagrangian densities. This will bring us to a position where the connection to Wess-Zumino 
terms can be established. 


The geometry of 6 -terms 

Let us first observe that most target manifolds T of topologically non-trivial field theories 
are Riemannian, i.e. come with a natural metric g. This may be the case because T c K m 
is embedded into some and inherits the natural metric of the latter (e.g. S 2 C R 3 , etc.); 
or because T = G/H is a coset space of Lie groups 49 (a situation characteristic of problems 
with spontaneous symmetry breaking); or, indeed, for some other reason. 

Most important is the fact that a metric g entails the existence of a canonical top- 
dimensional form uj on T. Here, the attribute “canonical” means the following: locally, each 
top-dimensional form can be represented as w = f(z)dz 1 A • • • A dz n , where z i ,i= 1 
are coordinate functions, n is the dimension of T, and / is some function. In general, the 
form of / depends on the choice of coordinates and cannot be globally specified. However, 
on a Riemannian manifold, a canonical n-form with coordinate-invariant definition exists. 
To see this, let gijdz % 0 dz :) be the metric. Further, let us define 

w = \fgdz 1 A • • ■ A dz n , (9.36) 

where g = det {g t j} is the determinant of the metric tensor. What makes cu special is that 
it has the same representation Eq. (9.36) in every coordinate system. 50 


49 Remember that a Lie group is a manifold with the additional structure of a group. As shown in textbooks of 
group theory, the group structure induces a metric. 

50 To see this, recall that, under a change of coordinates z W, dz 1 A • • • A dz n det A -1 dw 1 A • • • A dw n , 
where A = ( dw/dz ). At the same time, gtj — >• ( A T g A) ij , i.e. g 1//2 —¥ det Ag 1 ^ 2 . The two determinants cancel 
each other, so that uj remains form invariant. 
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The n-form u is called the volume element of the manifold 51 and Vr = J T M is the 
volume of the (compact) Riemannian manifold T. Without loss of generality, we can set our 
unit of length so that Vr = 1. 


EXERCISE For d = 3 and the metric Eq. (9.31), show that u> assumes the familiar form ui = 
^r 2 sin 0 dr d6 d(j> proportional to the three-dimensional volume element in polar coordinates. 


We now claim that, for base and field manifolds of equal dimensionality dim M = dim T = d, 


Stop[0] = id f M <!>*&, 


(9.37) 


defines a coordinate invariant representation of the -term. To get warmed up to 
this abstract representation, we first note that <p*u> is a top-dimensional form on M, i.e. 
we really have something to integrate and the notation makes sense. Now, let us consider 
a trivial field configuration <Po(x £ M ) = const. In this case, 0 q u = 0 (exercise: why?) and 
Stop [0o ] = 0, as one should expect. Next, let us assume that 0 = <f>\ is a diffeomorphic (1-1 
and differentiable) covering of T. (The presumed existence of such a mapping amounts to 
the statement that we are dealing with a topologically non-trivial field theory.) In this case, 
the transformation law Eq. (9.35) holds and we get 


Sto P [0i] = i& [ 0*w (9 = 5) i0 f u = lxi&. 

JM JT 

Now consider a mapping <f> n that covers T W times (i.e. every point p £ T is the image 
of W points xi € M). One can show that Stop \4>w] = iWO, i.e. .S't op indeed counts 
the winding number of fields 0. Rather than giving the proof of this statement for general 
W, let us consider a simple example. Let M = T = S 2 and 0w(0, #) = {W<j>,0). 52 Then, 
uj = sin# dO A d<p, and (check!) = W sin# dO A d<j>. We thus obtain Vs 2 = 47r and 

iO J M 0^u> = iWO, in agreement with the general rule. 

EXERCISE Let n, : S 2 — > R , i = 1,2,3, be the ith component of the unit vector defining a 
point p £ S 2 . We consider the 2-form u = n • (dn dn) = tf&mdnj drik ■ Using the standard 
polar coordinate representation nif^ m(6,<j>), show that ui can alternatively be represented 
as u> = sin# dO d<j>, which is the familiar volume element on the sphere (upon choosing an 
orientation so that z~ 1 *(d9 d<j>) = d6d<t>). Considering a field n : (x,y) i-» n (x,y) show that 

/ n*w = / n • (d x n x d y n )dx dy, (9.38) 

J m J M 


which we identify (again, for a definite orientation) with our earlier representation of the #-term 
of a field theory with S' 2 -valued fields. 


To motivate this terminology, consider a basis in which the metric tensor gij = giSij is diagonal. The volume 
spanned by the (mutually orthogonal) tangent vectors ei , . . . , e n is then simply given by the product of their 
lengths, i.e. by i=1 y /g i = y/g. This is precisely what we get when we evaluate u>(ei, . . . , e n ), i.e. the form oj 
measures the volume of the domain spanned by its arguments. 

Here we have adopted the usual abuse of notation; in principle we should write (z o ° z'~ )(</>, 0) = (Wcf), 0), 
where z and z are polar coordinate charts on M and T respectively. 
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Figure 9.12 On the insensitivity of the integral representation Eq. (9.37) under field variations. 
For a discussion, see the main text. 


The discussion above shows that, when evaluated on certain reference configurations (pw, 
the integral Eq. (9.37) yields the winding number W. To complete the identification with 
our earlier representations of the 0-term, we have to show that Eq. (9.37) does not change 
under continuous distortions of (pw (be. that it responds to the topological sector, and 
nothing else). Readers not content with our assertion that this is the case may wish to 
navigate through the following argument (which, as a byproduct, nicely illustrates the power 
of geometric methods in topological field theory). 

I N FO To show the invariance of Eq. (9.37) under continuous field deformations, let us consider 
two field configurations <j) and <p' which can be continuously deformed into each other. More 
specifically, we set <j> = ip o (p, where ip : T — > T is different from unity only inside the domain 
U D T of some chart, see Fig. 9.12. (This is no serious restriction as, by iterative deformations 
of this type, any field configuration continuously deformable into <j> can be reached.) 

We then have 

Stop[<A] _ ’S'top[</>'] = iO f (<p*w - (ip o <p)*u) = id f <p*(u- ip*u). 

J M J M 

Now u> — ip*uj 6 A d {T) is a d-form on T, different from unity only locally (inside U). This implies 
its exactness, i.e. the existence of a representation ui — ip*ui = dn, where k E A d-1 (T). 53 We thus 
find 

Sto P [0] — Stop [<2u = id f <p*dK = id d(<p* k) = id I <p*K — 0, 

J M J M JdM 

where, in the second equality, we have used the general commutativity of the exterior derivative 
and pullback and, in the third equality, Stokes’ theorem (for a reminder, see below). In the fourth 
equality we have assumed that, on the boundary dM of the base manifold (physically, infinity), 
the fields <p\dM approach a constant, so that <p*K = 0. 

The representation Eq. (9.37) sheds some fight on the structure of 0-terms on Rieman- 
nian 54 field manifolds. However, this was, of course, not the only reason for maneuvering 


The reason is that, locally, u> — -0*o; has a coordinate representation u ) — iJj* uj = fdz\ A • • • A dz n . With the 
ansatz k = gdz2 A • • • A dz n , the equation dn, = w — reduces to the differential equation dig = f (if the 
special role ascribed to the coordinate direction 1 is irritating, notice that k is not uniquely defined; every 
k = k, + 77 with a closed ( dr} = 0) form 77 will do the job just as well), defined on some open interval of xi (for 
fixed X2, . . . , x n ). This is an ordinary differential equation which can be solved. 

In fact, all we need to formulate the construction above is a canonical top-dimensional form, i.e. none of 
our arguments above actually relied on the fact that, in Riemannian geometry, this form happens to be the 
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through the geometric constructions above. Our prime reason for introducing the invariant 
formulation is that it contains the key to understanding the connection to Wess-Zumino 
terms. 


The geometry of Wess-Zumino terms 


In this section, we discuss the general geometric principle common 
to all Wess-Zumino (WZ) terms. We then explore how our previ- 
ous sporadic encounters with WZ field theories fit into this scheme 
and discuss a few more applications. Note that 0-terms appear in f Fy 
theories whose field manifold and base manifold are of equal dimen- 
sionality: dim M = dimT. In contrast, WZ terms are at home in 

field theories with dimM = dimT — l. 55 How can a topological term be constructed that 
relies on this dimensional relation? As we shall see in a moment, the key to the answer lies 

in Stokes’ theorem: 




( 9 . 39 ) 


Here, C C M is a benign (smooth, orientable, etc. 56 ) subset of a differentiable manifold M, 
dC is its boundary, and dco € A n (M) is top-dimensional on M (see figure). 


I N FO The proof of Eq. (9.39) is beyond the scope of our present discussion and we have to refer to 
textbooks on differential geometry. Nonetheless, a few remarks for readers not familiar with 
Stokes theorem may be helpful. First note that dC is a manifold by itself, with dim dC = n— 1 . 
Thus, u> is top-dimensional on dC and can be integrated. To gain some insight into the substance 
of Eq. (9.39) let us consider a few special cases. For example, let M be a three-dimensional 
manifold, n = 3. Consider the two- form w — Vi dx2 dx3 -\-v2dx3 dx\ +U3f.tei dx-i and int erpret 
the coefficients Vi = Uj(x) as the components of a three-dimensional vector field v = (vi,V2, V3). 
With dco = div v dx 1 dx 2 dx 3 = div v dV and the identification 4 ):— v • dS, where dS is the 
two-dimensional “surface element,” we obtain the standard representation of Gauss law 


f dV div v = [ 

Jm Jdlk 


Similarly, let M be some two-dimensional surface, n = 2 , and w = vidxi + V2dx2 + V3dx3. 
Identifying the surface element as above, we then have dut = curlv • dS and 


/ d 5 -curlv= / ds • v, 

Jm JdM 


where v ■ ds = ui, ds is commonly known as a “line element,” and f gM is a line integral: this is 
called Ampere s law. 


volume element. For example, some field theories with dim(T) = 2 live on manifolds with a so-called symplectic 
structure (which is nothing but a skew-symmetric, non-degenerate 2-form). This form may then take over the 
role of our uj above and, in perfect analogy to our discussion above, gives rise to a topological Lagrangian 
density. 

More generally, WZ terms can be constructed if, in dimensions dimM + 1, the manifold T possesses a closed 
differential form. (In the special case dimM + 1 = dimT, this form will be the volume form of T.) 

More precisely, C must be a so-called chain on M. However, for brevity, we shall keep the sloppy characterization 
above. 
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So suppose that we are dealing with a field theory defined on a d-dimensional base man- 
ifold M with (d + l)-dimensional target manifold T. Without (too much) loss of generality, 
we may think of T = and M = S d as unit spheres. 

INFO Although we have seen that spheres frequently appear as target spaces in field theory, 
the statement above may seem to be too restrictive - for example, what about the many 
field theories with group-valued target manifolds? The rationale behind emphasizing spherical 
target manifolds lies in an important statement of homotopy theory: the integration spaces we 
encounter in field theories with continuous symmetries are usually embedded in some sufficiently 
large U (N). However, it can be shown that the topological structure of the group U(N > 2) 
essentially reduces to that of its subgroup SU(2) C U(JV), i.e. mappings into U(iV) can be 
continuously deformed into mappings into SU(2). (If you find this difficult to visualize, consider 
a two-dimensional plane with a circular hole as an example. Any curve in the plane can be 
continuously deformed to a curve on the circle surrounding the hole. In this sense, the circle 
preempts the topological content of the plane. Similar reduction mechanisms exist for higher- 
dimensional manifolds.) However, SU(2) ~ S 3 is isomorphic to the 3-sphere. (Representing 
SU(2) as the group of matrices g = xo<ro + 1 x j a ii x n £ R, the condition det(g) = 1 boils 

down to the relation ^2 0 = ft') This argument shows that, as far as topology is concerned, 

the spaces S' 1,2,3 basically exhaust the list of relevant target spaces. 

As to the base manifold, the identification M = S d is motivated by the compactification 
scheme discussed in Section 9.3 above. 

Notice that the target manifold T = S d+1 is Riemannian and, therefore, comes with a 
canonical ( d, + l)-form oj. 

Given these prerequisites, our strategy will be to utilize, as much as possible, our previous 
understanding of the geometry of the (9-term. To this end, let us interpret <p{M) as a d- 
dimensional submanifold in T. Further, let r + 4>{M) be a subset of T on which the T 
volume form w = dn + can be represented as the exterior derivative of some d-form k + . 

EXAMPLE In fact, we may even assume that T + = S d+1 — {p} covers all of our sphere except 
for one point p. For example, on the 2-sphere S 2 , the volume form oj = sin 0d9 d(j> is obtained 
from k+ = (1 — cos Q)d(f> as u = dft+ everywhere except for the south pole. Similarly, with 
K- = — (1 + cos 9)d<f>, ui = d,K- everywhere except for the north pole. Of course, the choice of the 
exclusion point is quite arbitrary. Notice, however, that there is no global representation w = dn. 
If such a representation existed, Vol(T) = f T u = J T d,K = J gT k = 0, because T is boundaryless, 
i.e. the volume of the sphere would vanish. 

We now dimensionally extend the field 0 : M = S d — > Tjv C S d+1 = T to a mapping 
4 > : £n +1 S d+1 defined on the entire northern hemisphere, <5 n + 1 ) of S d+1 . This is achieved 
in a series of steps: firstly, identify M = S d as the equator of (see Fig. 9.13). We then 
introduce a (d + l)-dimensional coordinate representation ( s,x ), s £ [0, 1], of S d ~ 1 . These 
coordinates are defined in such a way that (s = 1, x) parameterizes the equator ~ S d = M 
while lim s _).o(s, x) = Pn is the north pole; otherwise, their choice is arbitrary. We finally 
extend our field to a mapping (j> : <S^ +1 — > T^,(s,x) —> (f)(s,x). Apart from the obvious 
consistency condition cp(s = l,x) = (j)(x ), the choice of this extension is, again, arbitrary. 
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Given this setup, let us define 


Swz [4>\ = iC 



< p* u ), 


(9.40) 


as a trial candidate of a topological action (C = const.). Constructed in manifest analogy to 
the #-term Eq. (9.37), this expression is “topological” in nature (it is scale and reparameter- 
ization invariant, etc.). Nonetheless, the definition does not look quite trustworthy. Notably, 
we have written “«Swz [</>]” while the right-hand side of the definition involves the extension 
<p. To see that the integral is, in fact, independent of the particular choice of the extension, 
we use the fact that, on ^(S'^ +1 ) c IV, the volume form can be written as w = dun- This 
implies 




= iC <f>* kn = iC 

JM=S d JM 


<j)*Kjsi. 


Here we have made use of (i), in the second equality, the commutativity of the pullback 
and exterior derivative, (ii) in the third equality, Stokes’ theorem, and, (iii) in the crucial 
fourth equality, that the integral over dS ^ +1 = S d = M depends only on the value of the 
boundary field <j>(s = 1.x) = (p(x). This proves the independence of the action S\yz on the 
extension scheme. At the same time, we have obtained the alternative representation 


SwZ,[4']=iC / (j>*KN, 


(9.41) 


of the topological action. This form makes the independence of the action Swz of the field 
extension manifest. For this, however, a price has had to be paid: Eq. (9.41) involves the 
d-forrn k n which we saw is tied to a certain coordinate representation (owing to the absence 
of a global representation u = (Ik) on the field manifold. In other words, the extension- 
independent representation Eq. (9.41) necessarily involves the choice of a specific coordinate 
system of the field. 


INFO As an example, consider M = S 1 and T = S 2 . Think of the latter as the space of unit- 
vectors n. Our field is a mapping n : S 1 — l S 2 , t — > n (f), where t € [0, 1] and periodic boundary 



Figure 9.13 On the extension of a field theory defined on a d-dimensional base manifold M ~ S d 
to a (d + l)-dimensional theory. 
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conditions n(0) = n(l) are understood. Extension to a field n (s,t) then allows us to represent 
the action £wz[n] as (see Eq. (9.38)) 

<Swz[n] = i [ ' f ds [ dt n • ( d s n x d t n). 

4tt Jo Jo 

Now, except for the south pole, we can write the volume (or better to say area) form on S 2 
as u = d[(l — cos 9)d(j)\. Then, using the fact that n*[(l — cos Q)dcj)\ = (1 — cos 6(t))d<j>(t) = 
(1 — cos 0(t))dt<f>(t)dt, we obtain the alternative representation 

£wz[n]=i— f dt (1 — cos0(i))<9 t </>(f). (9.42) 

Jo 

Notice that this form explicitly uses the coordinate representation n (</>,#). 

Equations (9.40) and (9.41) are two different representations of the Wess— Zumino action. 
As discussed above, both have their advantages and disadvantages - the need to artifi- 
cially extend the field vs. lack of representation invariance - which is why they are used 
interchangeably in the literature. 

Finally, we need to discuss one further important point; the value of the coupling constant 
C. As discussed above, the coupling constant of the #-term, the topological angle, was quite 
arbitrary. This is not the case with the WZ term. In fact, we shall see that the constant C 
is subject to quantization conditions. To understand why, recall that above we have chosen 
the northern hemisphere ,S'J^ +1 as the domain of integration of our extended field theory. Of 
course, the southern hemisphere *S's +1 would have provided just as good a choice. In this 
case, we would have defined 



where <j> is a field extension to the southern hemisphere, and the extra minus sign takes care 
of the fact that the equator is the boundary of the oppositely oriented southern domain. 
Of course, our definition of the WZ action would be senseless if this ambiguity mattered. 
Therefore, 


Swz[4>] - -S'wzl^] 

= iC [ fru 

+ iC [ 4>* 

u = iC [ <j>' 

*w = iCW 


Js £ +1 

Js * +1 

Js d + 1 



where W is integer. At first sight, the non-vanishing of this expression looks worrysome. 
However, since the action appears in the exponent no harm is done as long as the difference 
is equal to (i times) an integer multiple of 27 t, i.e. the the coupling constant C obeys the 

quantization condition 57 

(9.43) 

The number k is called the level of the WZ theory. To summarize the main results of this 
section, we have found that: 

57 Recall that the volume form u> underlying the construction has been normalized in such a way that j s d+i w = 1. 
For different choices of the normalization, the value of C will change accordingly. 
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> The WZ action is closely allied to the 0-term. It affords two different representations 
where: 

> Equation (9.40) involves a dimensional extension of the field, while 

> Equation (9.41) relies on an explicit coordinate representation of the field. 

> The coupling constant of the WZ term is quantized according to Eq. (9.43). 

Having discussed the structure and geometry of WZ theories, we now return to physics. 
Using a simple prototype system as an example, we begin by exploring how WZ terms enter 
low-energy theories of many-body quantum systems. 


9-4-3 Example: magnetic moment coupled to fermions 

Consider a single energy level e of a spinful fermion system. (One may think, for example, 
of a discrete level of an atom.) Let us assume that fermions inhabiting the level are coupled 
to a classical magnetic moment n. The coherent state action of this system is given by 

rP _ 

S[ip, n] = / dr ip(d T + £ + 7 n • a)ip, 

Jo 

where 7 is a coupling constant and, as usual, £ = e — p. A complete specification of the 
problem would have to include a term S[n] controlling the dynamics of the uncoupled 
magnetic moment. However, for the purposes of the present discussion, it is sufficient to 
consider the moment-fermion coupling in isolation. 

INFO Actions of this type appear as building blocks of larger systems. For example, once 
generalized to a set of levels e a , our model might describe a system of atomic shell electrons 
subject to Hund s rule coupling to a spin or orbital magnetic moment. Alternatively, the 
magnetic moment n might describe the Hubbard-Stratonovich decoupling of some electron- 
electron interaction in a spinful channel. 

Integration over the fermion degrees of freedom brings us to the reduced action 
S[n] = — trln(d T + £ + 711 • a). 

To proceed, let us write n = Re 3, where R € SO (3) is a rotation matrix, and then use the 
fact that a matrix U € SU(2) can be found 58 such that 

na= (Re 3 )-a = Uo 3 U- 1 . (9.44) 

For example, it is straightforward to verify that, with the standard polar representation, 
n = (sin 6 cos 0, sin 6 sin 0, cos 6 ) T , the choice 

(9.45) 


This is, in fact, just a concrete realization of the correspondence between SU(2) and SO (3) familiar from 
group theory. 
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will suffice. However, for the moment we shall not need an explicit representation of the 
rotation matrix. Rather, substituting the general expression Eq. (9.44) into the action, we 
obtain 


S[n] -n 5 [C 7 ] = -tr ln(0 T + f + 7t/ < 7 3 t/“ 1 ) = -tr In {U~ l d T U + £ + 7 tr 3 ) 

= -trln {d T +Z + 'ya 3 + U~ 1 U), 


where, in the first equality, we have used the cyclic invariance of the trace and, in the last 
equality, we have defined d T U = U . To proceed, we assume that the two energies |£ ±y| » T 
are well separated from the chemical potential, |e ± 7I T. We also assume that the 
frequency scales £j on which the fields not/ vary are so slow that they do not change 
the occupation of the fermionic levels: ch/7 -C 1. In these circumstances, we may proceed 
by a straightforward expansion of the logarithm: £[[/] = tr(Gt/ -1 t7) + 0(65/ 7) 2 , where 
G = (—d T — £ — 7<T3) _1 and we have anticipated (check!) that higher-order terms of the 
expansion will be small in the parameter Co/'). Switching to a frequency representation, 

Sp] = ^^tr(G„(t/ -1 t7) TO=0 ) = — J drtr(n F (£ + 7<7 3 )f? wl 9 r t/) 

= -JdTtr([n F (Z + ,* 3 )-l] U~%uy 


where we have used the fact that the frequency sum of a fermionic Green function introduces 
the Fermi distribution function, rip. In the last line, we have shifted n f by 1/2, which is 
permissible because tr(C/ _1 {7) is a boundary term: 
rP rP 

/ drrit(P'~*U) = / dr 5 T tr(ln 17) = trln(Cf)|g=0. (9.46) 

Jo Jo 

Now, if both levels are either occupied (^(£±7) — 1/2 ss 1/2) or unoccupied (uf(£ ± 7) — 
1/2 —1/2), the action vanishes 59 since tr([np(£ + 703) — |] U~ 1 d T U) oc tr [U~ x d T U) is 

again a boundary term. 

Assuming, however, that £ + 7 is empty while £ — 7 is occupied we obtain 


‘ S '[t / ]k+7»o»|- 7 = S WZ [U] = -- 


•P 

dr tr(cr3[/ _1 C/) . 


(9.47) 


We now run a few tests on Swz to identify it as a topological term of Wess-Zumino type. 
For one thing, our base manifold M ~ S 1 is one-dimensional and Swz involves a first-order 
derivative. This exemplifies the scale invariance characteristic of topological terms. The 
suspicion that Swz is topological is corroborated by the fact that it is purely imaginary. 
(The time-derivative acts on a unitary matrix, i.e. something like a generalized “phase” - for 
more details, see below.) But what type of topological term are we dealing with? To get an 
answer to this question, we first need to determine the dimensionality of the target manifold 
T. Clearly, T SU(2) in spite of the fact that we are temporarily using U € SU(2) for our 


Up to corrections of <9(exp(— 1£ ± 7 I/T)) which, in any case, are beyond the scope of the first-order expansion 
of the “tr In” above. 



554 


Topology 


fields: our original theory was defined for n £ S 2 . Indeed, the representation Eq. (9.44) is 
invariant under the “gauge transformation” U — »• U expfiv/icra). This means that the “true” 
field manifold is SU(2)/U(1) ~ S 2 as one should expect. Now, dim S 2 = 2 = dimM + 1, so 
that Swz is likely to be a Wess-Zumino term. 

Indeed, we observe that, under the gauge transformation above, tr^t/ -1 !/) — > 
tr(a 3 U~ 1 U ) + 2 iip, i.e. the topological density is not invariant. This indicates that we 
will not be able to find a coordinate-independent representation of S'wz in terms of n. As 
discussed in the previous chapter, the lack of parameterization invariance is a hallmark of 
WZ terms. (There is no need to worry, though, about the gauge invariance of the theory : 
the periodic boundary conditions imposed on U imply that ip(/3) = V’(O) + 27m, where 
n is some winding number. Under the gauge transformation, the action thus changes by 
6S = —2-Kin, so that exp(— 8S) = 1 remains invariant.) 

To present .S'wz in a less abstract form, we can substitute the angular representation 
Eq. (9.45) into Eq. (9.47). The straightforward evaluation of the derivative then gives 

Swz[0, 0\ = — ^ J dr (1 — cos 0)d T <j). (9.48) 

We identify this expression as the coordinate representation Eq. (9.42) of the WZ term 
arising in theories with base S' 1 and target S 2 . Recall that S'wz evaluates to —i/2 times the 
area swept out by the closed curve ((f), 6) on the sphere. 

In fact, we have already met with the action Eq. (9.48) in Section 3.3 when we discussed 
the path integral of a single spin. That the core contribution to the action of that problem 
reappears here should not come as too much of a surprise: the classical magnetic moment oc n 
“enslaves” the electron spin. In the limit of perfect alignment - realized under the conditions 
assumed above - we are left with the dynamics of a quantum spin whose dynamics are tied 
to that of n. The resulting action therefore coincides with that of a free spin described by 
the unit vector n. 

In the literature, the action 
Eq. (9.48) is frequently referred to 
as the Berry phase action. To 
understand the rationale behind 
this terminology, let us briefly recall 
(see the Info block below) a few 
facts about the quantum mechanical 
Berry phase. Consider a quantum 
particle subject to the Hamiltonian 
H(x(t)), where the D-component 
vector x(t) Sts {^(t)} parameterizes a weakly time-dependent contribution to H . As shown 
by Berry, the dynamical phase acquired during the evolution of the particle can be expressed 
as exp(— * f dt' eo (t') + i-y(t)), where eo(t) is the energy of the instantaneous ground state 
of H(t). The first contribution to the exponent is the usual dynamical phase of quantum 
evolution. (Here, it is assumed that the time-dependence of H is sufficiently weak for us to 
neglect transitions into excited levels; for a static Hamiltonian, this contribution reduces to 


Sir Michael Berry 1941- 

Theoretical physicist who has 
made groundbreaking contribu- 
tions to the field of quantum non- 
linear dynamics and optics. Berry 
introduced the concept of the 
Berry phase (or geometric phase 
as he himself prefers to call it) and 
explored its manifestations in various physical con- 
texts. (Figure courtesy of Sir Michael Berry.) 
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the familiar phase eo t.) The second contribution, 7 (i), is of geometric origin, inasmuch as it 
depends on the path traced out by the vector x in parameter space, but not on dynamical 
details. (In particular, it is independent of the velocity at which the parameter path is 
traversed.) The geometric phase assumes a particularly simple form if the parameter 
dependence of H is periodic in time, x((Y) = x(t). In this case, x(t) defines a closed curve 
and 7 can be expressed as an integral over any surface in parameter space bounded by 
that curve. Specifically, for a spin S particle Zeeman-coupled to a weakly time-dependent 
magnetic field B(f) = n (t)B of constant magnitude, B, 

7 = ^ J d 2 x n • din x c>2n, (9.49) 

is the area bounded by the curve n(t) on the 2-sphere (which, in this case, defines the 
parameter space). Comparison with Eq. (9.48) and the example discussed on page 538 
indeed identifies the WZ-term as the Berry phase of the spin problem. 60 

INFO A brief reminder of the Berry phase in quantum mechanics: Consider a particle 
governed by a Hamiltonian H(x(t)). It is assumed that the time-dependence of the parameter 
vector x is adiabatic, which means that a particle initially prepared in the ground state |0(f »= 0)) 
of the Hamiltonian H(x(0)) will remain in the instantaneous ground state |0(t)) (i.e. the lowest 
eigenstate H (x(t))\0(t)) = e(f)|0(f)) of the operator H(x(t))) throughout the entire observation 
time. (In practice, this means that the rate w at which the parameters change in time must be 
much smaller than the excitation gap of the system.) 

We are interested in computing the dynamical phase corresponding to the time evolution of 
|0(f)>. To this end, let us represent the wavefunction of the particle as IV’(f)) = e~ z ^^\0 (f)) and 
consider the time-dependent Schrodinger equation 

H(xmm) = idt\m >. 

Substitution of the representation above and multiplication by (0(f) | then leads to the equation 
dt4> = e(t) — i(0(f)|9t|0(t)). Integrating over time and comparing with our discussion above, we 
are led to the identification 7 (/.) Sij i f g dt' \dt' lO^)) of the Berry phase (exercise: why is 7 
real?). Now, the instantaneous ground state inherits its time dependence from the parameters 
x(t). We may thus write 

7 (t) = ij dt' {0(x(t'))\d Xi ( t ')\0(x(t')))dtXi(t') = ijdx (0(a:) |c?a: |0(a;)) = ij (0|d0). 

Here, the second integral has to be interpreted as a line integral in parameter space. It is taken 
along a curve c which starts at a:(0), follows the evolution of the parameter vector, and ends at 
x(t). Importantly, the line integral depends only on the choice of 7 but not on the velocity at 
which this curve is traversed (the dynamics of the process). In this sense, we are dealing with 
a phase of geometric origin. The third integral representation above emphasizes the geometric 


In the discussion above, we considered a particle coupled to a weakly time-dependent magnetic moment - a 
canonical setup for the appearance of Berry phases. But why did the Berry phase/WZ-action also appear in 
our earlier discussion of a spin coupled to a fixed magnetic field? To answer this question, consider the world 
from the point of view of the spin. In the reference frame of the spin, the magnetic field is dynamical and a 
Berry phase term will be generated. Moreover, the area traced out by the field vector (i.e. the area determining 
the geometric phase) is equal to the area traced out by the spin (now we are back in the fixed-field reference 
frame) in its motion around the field axis. 
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nature of the phase even more strongly: for any value of x, we have a state |0(cc)) . We may then 
construct the differential 1-form (0(a:)|d0(a :)). 61 The geometric phase is obtained by evaluating 
the integral of this form along the curve c. 

The advantage of the third representation above is that it suggests yet another formulation of 
the geometric phase, at least in cases where a closed path in parameter space is traversed. For a 
closed loop c, application of Stokes’ theorem gives 


7 = i £{0\d0) = i J (d 0 | |d 0 ), 


(9.50) 


where 5 may be any surface in parameter space that is bounded by 7 . This last representation is 
aesthetic, and easy to memorize, but also a bit too compact to be of real computational use. To 
give it a more concrete meaning, we insert a spectral decomposition in terms of instantaneous 
eigenstates, 

7 — i f ^ (d 0 |m) (m|d 0 ). 

J s m = 0 


(Exercise: Why does the m = 0 term vanish, i.e. (d0|0) (0|d0) = 0? Hint: Make use of the fact 

that d(0|0) = (d0|0) + (0|d0) = 0 and the skew-symmetry of the -product.) We now evaluate 
the equation 0 = (m\d[(H — eo)|0)] to obtain (m|d0) = (eo — e m ) _1 (m\dH\0) or 


(0\dH\m) (m\dH |0) 
T— l Js^m=0 ( _ , \2 


This is about as far as we get in general terms. We have established the geometric nature of the 
Berry phase. However, inasmuch as it requires explicit knowledge of the spectrum of H(x(t)), 
the actual calculation of the phase remains a difficult problem. 

There are cases, however, where the calculation of the geometric phase reduces to a straight- 
forward surface integral in parameter space. One such example is provided by a spin sub- 
ject to a weakly time-dependent magnetic field. Consider the Hamiltonian H = fm(x) ■ a =. 
p.U(x)o 3 U~ 1 (x), where U is the rotation matrix introduced in Eq. (9.44). The 25 + 1 instan- 
taneous eigenstates of H are given by C/ 1 S 3 ) , where 173 I 53 ) = S 3 IS 3 ) and S 3 = —S , . . . , S is the 
azimuthal spin quantum number. To compute the Berry phase, we can consider the first of the 
two representations in Eq. (9.50). Noting that the ground state is given by |0) = U\S 3 = —S) (we 
assume that the magnetic moment 7 > 0) and parameterizing the rotation matrix as in Eq. (9.45), 
one verifies that (0|d0) = (— Sjf7 - 1 df/| — S) = *5(1 — cos 9)d(j>- We thus obtain 7 = 5 c d(f> 
(1 — cos 9) = S f dt ( 1 — cos 9)4 > , which coincides (up to a Wick rotation l — » —ir, which we know 
does not affect topological terms) with the 5 = 1/2 WZ-action Eq. (9.48). 


9-4-4 Spin chains: beyond the semi-classical limit 

In Section 9.3.3, we began to explore the physics of one-dimensional spin chains. Much of our 
analysis was limited to the case S>1, a semi-classical regime where quantum fluctuations 


Should you find the representation too abstract, choose any basis {|A)} and write (0(x)|d0(ai)) 
(0(cc)|A)d(A|0(ai)), where d(A|0(fc)) is the exterior derivative of the function (A|0(fc)). 
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are weak. In this limit, the spin chain is described by an 0(3) nonlinear u-model with a 6- 
term. This model, however, does not stay invariant under renormalization. It flows towards 
a “strong coupling” regime where fluctuations are large and the effective value of the spin 
(formally: the coupling constant of the gradient term in the model) becomes weak. Apart 
from the conjecture that, for half-integer/integer initial spin, the model flows towards an 
ordered/ disordered phase with gapless/gapped excitations, there was nothing we could say 
about the large-distance behavior of the model (or about the physics of chains consisting of 
small spins S = 0(1)). 

In this section, we invest quite some effort in developing a fresh attack on the physics 
of the small S spin chain. This is motivated in part by the enormous amount of recent 
experimental activity on quasi-one-dimensional spin compounds. (“Real” spin chains are 
often realized as structural sub-units of transition metal compounds. In these systems, the 
spin - which is carried by Hund’s rule coupled inner shell electrons of transition metal 
atoms - may reach as high as 7/2.) Another, and more theory-related, motivation for our 
study is that the low-energy physics of the spin chain is governed by a fascinating interplay 
of different topologically non-trivial quantum field theories. 

Following Affleck and Haldane, 62 the principal idea of our approach will be to exploit the 
equivalence of the antiferromagnetic spin chain to a one-dimensional Hubbard model at half 
filling. 63 The advantage gained by this digression is that the fermionic model is amenable 
to various analytical tools which (at least not in any known sense) do not apply to the 
spin-chain per se. We will find that the effective low-energy model describing the fermion 
system is the WZ field theory conjectured to be the fixed point theory of the cr-model in 
Section 9.3.3. 


Fermion representation of the antiferromagnetic spin chain 
Consider a one-dimensional chain of equidistant (spacing a) lattice sites i. Each site hosts 
n c degenerate fermion states, or “orbitals.” Consider a half-filled situation, i.e. on average, 
each orbital is occupied by a single spin-(l/2) fermion. We now introduce some dynamics so 
that the low-energy physics of the system is equal to that of an antiferromagnetic spin chain 
with spin S = n c / 2. (Notice that this construction is not quite as artificial as it may seem: 
the effective moments observed in “real” crystals are usually composite objects, composed 
of more than one elementary electron spin and stabilized by electron correlations.) 

To align the n c spins at each site, we introduce a strong (U T) Hund’s rule coupling 

//int -U'E'Si ■ Si = -jYl 

= J [ I. 2 * - s abj (V’ffcVfa ~ S ba) + ~ n c) j + Const. (9.52) 


I. Affleck and F. D. M. Haldane, Critical theory of quantum spin chains, Phys. Rev. B 36 (1987), 5291-300. 

Recall that, for strong interaction, the one-dimensional Hubbard model maps onto the so-called ( t — J)-model 

which (at half filling) reduces to the antiferromagnetic Heisenberg model (the spin chain). 
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Figure 9.14 (a) A chain of sites, each containing n c fermions on average, (b) A strong Hund’s rule 
coupling maximizes the spin carried by each state to S = n c / 2. (c) Upon the switching on of a 
finite nearest neighbor hopping matrix element, the system becomes a spin S antiferromagnet. 


Here, a = 1, . . . ,n c is the orbital index, a, 3 are spin indices, and, to get from the first to 
the second line, we have used the identity ■ cr a @ = 2 5 a P S^ 01 ' — S a ^S a '^ . Of special 
interest to our further discussion will be the last term in the second line. This Hubbard-type 
interaction tells us that each site favors a site occupancy of n c electrons (i.e half-filled). 
Indeed, n c electrons are needed to manufacture a net spin of maximum weight S = n c /2 
(see Fig. 9.14) and, thus, to optimize the Heisenberg interaction Eq. (9.52). 

Let us now introduce a small amount of inter-site hopping: 

Ho = -\ ( n c JUY / 2 2 [cM Q +ia + h-c] , (9.53) 

where the constant J determines the hopping strength. In the limit J/U — > 0, the half-filled 
system becomes equivalent to the spin S antiferromagnetic chain, 

H = H 0 + Hi nt ^ nc Haf = J Y, Si • Sj+1 + 0(J/U). (9.54) 

The easiest way to see this is to recall the situation in the standard Hubbard model (the 
n c = 1 variant of our present model) at half filling (cf. Section 2.2). There, virtual deviations 
from half filling led to an effective antiferromagnetic exchange coupling between the 5 = 1/2 
spins carried by neighboring sites. The effective strength of this interaction was J ~ t 2 /U, 
where t is the strength of the hopping term. Formally, the generalization of this mechanism 
to the case n c > 1 can be shown, for example, by subjecting the Hamiltonian H to a 
canonical transformation eliminating the hopping term (all in complete analogy to the 
n c = 1 canonical transformation discussed in Section 2.2). 

We have, thus, established the equivalence between the strongly interacting J <C U 
fermion Hamiltonian and the spin chain. Now there comes a major conceptual jump - and 
admittedly one that is not backed up by quantitative reasoning: we postulate that the 
equivalence between the two systems pertains to the case J > xU, at least as far as the 
relevant long-range excitations are concerned. If this were not the case, there should be some 
kind of abrupt change (a phase transition) in the behavior of the system as the interaction 
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is increased. Although this scenario cannot be rigorously excluded, it seems unlikely to be 
taken too seriously. 


Non- abelian bosonization 

Let us, then, consider the low-energy physics of the weakly interacting (J > U) fermion 
system. In fact, we shall begin by considering the totally non-interacting case. 64 Switching 
from a lattice to a continuum description and linearizing around the two Fermi points (for 
details, see Section 2.2), we describe this prototypical system in terms of the action 

S[ip^,ip] = ^2 J dxdTipl r (— isv-pd x + d T )ipl = J d 2 x , ip r pip r , (9.55) 

where ^m±i are the left- and right-moving fermion fields. In the latter equality we have set 
wf = 1 and switched to the Dirac notation (for details, see Section 4.3), and r — (a, a) = 
1, . . . , 2 n c = N is a composite index comprising spin and orbital components of the fermion 
field. In previous chapters, we have seen that the one-component ( N = 1) variant of this 
model could be equivalently described in terms of a free bosonic action, 

S[^p,ip] = J d 2 x Ppfhp +* <S[0] = J d 2 x d^O 8^6, (9.56) 


where the double arrow <->■ indicates 
that all fermion operators 0[xp,ip\ 

(currents, densities, etc.) can be 
expressed in terms of boson opera- 
tors 0[(f>\, and correlation functions 
(0102 • • • )b can be identically 
rewritten as (0102 •••)<£• 

To what extent does this picture 
survive generalization to the many- 
channel case? The answer to this question was given in a seminal paper by Witten. Witten 
found that 65 


Edward Witten, 1951- 

Mathematical physicist and string 
theorist. Awarded the 1990 Fields 
Medal for his ground breaking 
work in differential geometry. Wit- 
ten contributed massively to the 
success of string theory. (Photo 
by Randall Hagadorn. Courtesy of 
the Institute of Advanced Study.) 



The free fermion action Eq. (9.55) can be equivalently described in terms of a 
two-dimensional nonlinear cr-model with a Wess-Zumino term. 


More precisely, he showed that 


S[tp, ip] = d 2 x ip' 


Swzw[g] = ^~ [ (Px uid^gd,,!) J ) + T [<?], 

87 t J s 2 


(9.57) 


As in our previous discussion of the single-channel case, it will turn out that the inclusion of interactions is 
straightforward once the effective bosonic degrees of freedom have been identified. 

E. Witten, Nonabelian bosonization in two dimensions, Commun. Math. Phys. 92 (1984), 455-72. 
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where g € U (N) and 

r M= J d3x e ljk toig^diQ g^djg g-'dkg), (9.58) 

denotes the WZ action . 66 On the right-hand side of Eq. (9.57), two-dimensional space-time 
has been compactified to a 2-sphere S 2 . This sphere is then understood as the bound- 
ary of a three-dimensional unit ball, B 3 , which serves as the integration domain of the 
WZ functional. As in the one-component case, the double arrow in Eq. (9.57) implies equal- 
ity of all correlation functions upon suitable identification of operators (see the Info block 
below). 

Equation (9.58) is the multi-component or “non-abelian” generalization of the prototyp- 
ical bosonization identity (9.56). To understand the connection between the two equations, 
consider the restriction of U(AI) to its maximal abelian subgroup, i.e. the group of all diag- 
onal matrices g = diag(e - ' 20 , . . . , e af> ). Evaluated on such configurations, the WZ term 
vanishes (exercise: show this) while the gradient term, 

N i r 

3 = E 2 ^ J d2x ( w 2 ’ ( 9 - 59 ) 

collapses to the sum of N free boson actions. This is the description we would have obtained 
had we applied Eq. (9.56) to each of the N fermion components individually (which, after 
all, is a perfectly legitimate thing to do!). But what, then, is the advantage of the generalized 
variant of bosonization Eq. (9.57)? Referring for a more detailed discussion to the Info block 
below, we here merely note that the action Eq. (9.57) possesses a huge symmetry group: 
transformations ip s — >• g s ip s ,g s € U (AT), leave the action invariant, i.e. the symmetry group 
of the problem is given by U(IV) x U (N). Suppose we had bosonized each fermion in the 
standard abelian manner, thus arriving at the action Eq. (9.59). We might now ask how the 
symmetries of the problem - which of course must survive a change of representation - act 
in the 0-language. Frustratingly, there is no answer to this question; 67 the symmetries are 
no longer manifestly present and there is no direct way to benefit from their existence. With 
the non-abelian generalization Eq. (9.57), this is not so. As we shall see in a moment, the 
symmetry group \J(N)x\J(N) acts on the ^-degrees of freedom by left-right multiplication, 
g — »• g+ggZ 1 - 68 In previous chapters we have emphasized time and again the tremendous 
importance of symmetries and the resulting conservation laws. Indeed it turns out that, in 
the present problem, the comparative complexity of the action Eq. (9.57) is far outweighed 
by the manifest presence of the symmetries. 


Deviating from our earlier conventions, we will denote the WZ-action by r[^] throughout (instead of the 
notation Swz[g] used above). In doing so, we follow a standard literature convention. Also, the full action 
(including the gradient term) is commonly denoted by 5wzw (where the last “W” credits Witten’s contribution 
to two-dimensional WZ field theory), and the resemblance between the two symbols 5wz and 5wzw may cause 
confusion. 

One may object that a theory of free bosons is so simple that one need not care about the conservation laws 
introduced by symmetries. However, this argument is too shortsighted. It ignores the fact that most operators 
of interest are transcendental in the 0s, i.e. the full theory is not quite as simple as Eq. (9.59) would suggest. 
This provides another explanation of why the 0-descript ion is too narrow to accommodate the symmetry: a 
diagonal matrix g will not remain diagonal when acted upon by the transformation matrices. 
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INFO The proof of the bosonization identity (9.57) relies on an extension of the considerations 
summarized in Section 4.3 to the case of non-abelian symmetries. Although a detailed discussion 
of the construction (for which we refer to Witten’s original, yet highly pedagogical, paper) would 
be beyond the scope of this text, let us briefly summarize some of the principal ideas behind 
the method of non-abelian bosonization. 

Let us begin by recalling that, in the abelian case N = 1, we had two basic symmetries, 
Ips — > g s tps, where g s = exp(*20 s ), (j> s are constant phases and ip B =± i the left- and right-moving 
fermion fields. These symmetries express the independence of the left- and right-moving states 
or, equivalently, the chirality of the problem. A straightforward application of Noether’s theorem 
(or, equivalently, a direct variation of the action) shows that the conservation laws induced by 
the chiral symmetry are d z j = d z j = 0, where j — j = ip\_ip+ and we have introduced 

complex coordinates a = ^(®o + ix i), d z = ^=( do — id i). 69 

In Section 4.3 we saw that, on the bosonic level, the chiral symmetries act as 0 9+(f>+ — (f>- . 

The conservation laws corresponding to these transformations read as d z d z 9 = 0 (exercise: 
check this) . At the same time we know that the transformation generated by <p+ ( <f > _ ) generates 
the conservation law d z j = 0 (d z j = 0). Comparison with the two equations above leads to the 
identification j = ip\.ip+ and 3 = V’-V’- where the factor of i/\J 2 tt has 

been included to obtain consistency with the definition of the vectorial current j Vl o = p ~ diO ~ 

j + J - 69 

We now reformulate these results in a manner amenable to generalization: introducing g = 
exp(«20) € U(l), the bosonic action of Eq. (9.56) assumes the form £[<?] « T_ J dpxd^g d^g -1 , 
while the symmetries now act by conjugation: g — >• g+ggZ 1 ■ Here, e'' 2 ^ >± = g±. Finally, the 
conserved currents can be defined as 

i - J s ^n '()*!>■. j {<hg)g ‘ • (9.60) 

At this stage, the ordering of the - abelian - factors g is of course arbitrary. However, this will 
change once we proceed to the multi-channel case. (Notice that the equations (9.60) are solved 
by g = g + (z)g-(z), where the two independent factors g+ and g- describe the right /left-moving 
fermion states. This solution does not rely on the commutativity of the phases g and generalizes 
to the non-abelian case.) 

For N > 1, the phases g s generalize to two independent unitary matrices g± € U(A r ). The 
enlarged symmetry entails the conservation laws d z d z j rr = 0, r, r' = 1, . . . , N, where the cur- 
rents are given by j rr = and j rr = ip^ipL . 

EXERCISE To verify this last statement, consider the infinitesimal unitary transformation 

U„ !— exp(irjW) and expand the action Eq. (9.55) to first order in the Hermitian generator 

matrices W. 

What are the bosonic counterparts of these expressions? It is natural to generalize the phases 
g £ U(l) of the abelian case to unitary matrices g 6 U(IV). As before, the symmetry group will 
act by left-right-multiplication, i.e. g — > g+ggZ 1 and we expect the conserved currents to be 
given by Eq. (9.60). 70 


Throughout, it will be preferable to use this complex notation. Notice that the two conserved currents j and j 
are related to the components of the vectorial currents discussed in Section 4.3 by j v o ~ j+j and j v \ ~ i (j — j)- 
Notice, however, that the currents are now matrices j s = } and that - unlike in the abelian case - the 

ordering of the matrices g on the right-hand side of the definition is crucial. The factors in Eq. (9.60) are 
ordered in such a way that dzj = 0 and d z j = 0 are compatible with each other. 
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We now need an action that (a) is invariant under the multiplicative action of the symmetry 
group and (b) for N = 1 reduces to f d^gd^g -1 . An obvious candidate would be 

So [ff] = $ f d 2 xtT(d IJ ,gd IJ ,g~ 1 ). (9.61) 

Yet, for a number of reasons, Eq. (9.61) does not suffice. Firstly, for this action, the conservation 
laws derived from the symmetry of the problem, read d M (p _ 1 d M p) = 0 , which is inconsistent with 
our result above, dz{g~ 1 d z g) = 0. Secondly, we have seen in the previous chapter (cf. Section 8.5) 
that the nonlinear cr-model Eq. (9.61) renormalizes at large length scales to smaller values of 
the coupling constant. At the same time, it is supposed to describe the fermionic action (9.55), 
which obviously does not renormalize. This tells us that £[< 7 ] alone does not suffice to establish 
the boson-fermion correspondence. 

But let us now inspect the second term in the action proposed by Witten, the two-dimensional 
WZ functional. The first thing we have to understand is why the second term of the action indeed 
represents a WZ functional in the sense of the discussion of Section 9. 4. 2 . 71 To construct a WZ 
term, we need a differential form w on the target manifold that is closed, dco = 0 , but only locally 
exact (a). = dn only locally). In Problem 9.7.2 it is shown that, on a group- valued target manifold, 
these criteria are met by the form w = tr (g~ 1 dg g~ x dg g~ 1 dg). (Do not be confused by the 
notation. What it really means is oj = ^ijklmnoiis^Bdgjk ( 9~ 1 )lmdgmn ( g~ 1 )nodg 0 i ), where 
dgjk is the differential form of the function gjk assigning to each element g € U(IV) its matrix 
component gjk-) The general theory developed in Section 9.4.2 then tells us that iC J b3 g*cu is 
a WZ term where g is a smooth extension of the field g : S 2 — > U (N) to a field defined on the 
entire ball B 3 . Expressed in terms of some coordinate functions (xi, X2, X3) of B 3 , this expression 
becomes identical to Eq. (9.58). (As to the quantization of the coupling constant, see Problem 
9.7.2.) In passing we note that the WZ functional is manifestly invariant under the action of the 
chiral symmetry group, as is required by the general structure of the theory. 

It is instructive to inspect the equations of motion obtained from the WZ term. In Prob- 
lem 9.7.2 we show that, upon variation g — > e w g ~ (1 + W)g and expansion to first order in W, 
we obtain 

r[(l + W) g \ - r[<?] = ^ J d 2 x e M tr (Wd^gd p” 1 ) + 0 (W 2 ). (9.62) 

A straightforward calculation shows that the variation of the gradient term So is given by 
Sb[(l + "')<?] - S 0 [g] = ~^J d 2 xtr(W(gdlg~ 1 - (dfag- 1 )) + <D(W 2 ). 

Combining these two results, we obtain the equations of motion 

- 2 d,j,gd p -1 + gd 2 g~ 3 - {d 2 g)g~ x = 0 . 

A straightforward calculation shows that these equations are indeed equivalent to the relation 
d z j oc d z ({dzg)g~ 1 ) = 0. 

Summarizing, we have succeeded in finding an action that (a) is manifestly chirally invariant 
and (b) produces the same conservation laws as the free fermion theory. Notice, however, that 
we have not “proven” the analogy Eq. (9.57); rather, our analysis was mostly based on drawing 


Notice that, in Section 9.4.2, we focused on the case dimT = dimM + 1. Presently, however, dimM = 2 while 
dimT = dimU(iV) can become arbitrarily large. 
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illustrative analogies to the one-dimensional case. (For a more rigorous discussion, we refer to 
Gogolin et al. 72 ) 

We conclude our preliminary survey of non-abelian bosonization by noting two crucial differ- 
ences to the abelian case: 

In the abelian case, a free fermion action was mapped onto an equally free boson action. Being 
quadratic in the fields, the two theories manifestly do not renormalize. In the non-abelian 
case, the situation is different: although it is not obvious, the right-hand side of Eq. (9.57) - 
a highly nonlinear functional of the group-valued fields g - does not renormalize. For a proof 
of this feature we refer to Problem 9.7.3. 

Above, we have seen that bilinears ~ composed of left- or right-moving fermions afford 

a representation in terms of the Bose fields. Without proof, we mention that even bilinears 
involving fermions of different chirality can be expressed in terms of bosonic fields: 

(9.63) 

However, unlike in the abelian case (see Eq. (4.46)), no boson representation of individual 
fermion operators is known. This is quite unfortunate as it excludes the applicability of the 
formalism to several interesting fields of investigation, notably the physics of fermions in the 
presence of disorder. 

In fact, the U(1V) action introduced in Eq. (9.57) defines two independent field theories at 
once: every matrix g 6 U (N) can be decomposed as g = e^g' into a matrix g' € SU(IV) 
and a phase factor e l2JV<#> = det(<?) € U(l). Substituting this decomposition into the action, we 
obtain 

•%] = %'] + 2 X _ f s2 d 2 x {d„of. (9.64) 

Equation (9.64) tells us that the action decomposes into a WZW action for an SU(A r )-valued 
field variable and an independent second action for the phase degree of freedom. Recalling that 
the invariance of the action under a homogeneous phase (gauge) transformation corresponds 
to the conservation of electric charge, we identify the phase action as that of the collective 
charge degrees of freedom (the charge density waves), while the SU(A r )-action describes the 
spin degrees of freedom. 

Renormalization group ”ow of the WZW model 
The action (9.57) describes a free fermion fixed point, if not in an obvious manner. To 
understand better the behavior of the WZW model under renormalization, let us generalize 
the free fermion action by introducing an arbitrary coupling constant A -1 in front of the 
gradient term. We thus consider the action 

S = y f d 2 x tr{d M g d^g -1 ) - / d 3 x e yfc tr {g~ 1 d i g g~ 1 d j g g~ x d k g), (9.65) 

A J S 2 IZ7T J B 3 

where g £ SU(N) (and the complementary U(l) action is trivially free). Except for the 
presence of the WZ term, the model defined by this action is equivalent to the SU(AT) 
nonlinear cr-model studied in Section 8.5. However, the latter is known not to have a fixed 


A. O. Gogolin, A. A. Nersesyan, and A. M. Tsvelik, Bosonization and Strongly Correlated Systems (Cambridge 
University Press, 1998). 
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point at finite values of A. This tells us that the WZ term must have a crucial impact 
on the RG flow. Suppose, then, we started renormalizing the model at small values of A. 
In this regime, the gradient term dominantly suppresses field fluctuations and we expect 
the WZ term to be of little significance. Consequently, the RG flow will initially resemble 
that of the standard SU(iV) model - towards larger values of A. Eventually, however, A 
and the coupling constant i/Ybr of the WZ term will become of the same order. At this 
point, at last, the coupling constant will interfere with the flow. As we know that A = 
87 t defines a fixed point, we expect that it will simply truncate the flow of the coupling 
constant. 

To confirm this expectation, we need to go through the RG program, at least to one-loop 
order. Fortunately, however, the RG analysis of the WZW model (see Problem 9.7.3) almost 
exactly parallels that of the standard SU(fV) model. As a result, we can infer the scaling 
equation 


dX 

dlnb 



(9.66) 


This result confirms our qualitative expectation: the value A* = 87 r defines an (attractive) 
fixed point at which the upwards flow of A comes to an end. Using the methods of conformal 
field theory, one can indeed show (see Witten’s original paper 65 ) that A* defines an exactly 
solvable reference point. (This is important additional information inasmuch as our one-loop 
analysis does not rigorously prove that A* is a fixed point.) 


WZW model of interacting fermions 

All we have accomplished so far is a highly complicated reformulation of the trivial free 
fermion problem. However, as we shall see in a moment, this exercise has been far from 
useless: as with the abelian case, it will turn out that the boson language is of unsurpassed 
efficiency when it comes to the discussion of particle interactions. 

As usual, the most relevant particle interactions are mediated by certain four-fermion 
operators. Important constraints on the structure of these operators follow, once again, from 
the symmetries of the model. Above we have seen that the non-interacting model is invariant 
under transformations by U(2JV C ) x U(27V C ), where the first/second factor acts on spin and 
color indices of the left-/right-moving fermions. However, the Hubbard-type interaction 
Eq. (9.52) reduces this symmetry. Inspection of the interaction operator shows that chiral 
symmetry gets lost (i.e. only transformations that act identically on the left- /right-moving 
sector are permitted, U(2 N c ) x U(2 N c ) — > U(2 N c )), and the remaining U(2iV c ) — » U(l) x 
SU(2) xSU(_/V c ) gets reduced to symmetry transformations that act on the charge/spin/color 
sector separately. 

There are a number of continuum interaction operators that are compatible with the 
symmetries of the lattice system. Consider, for example, the bilinears j% = Vsa V’sco 3 s' 1 — 
V’l % <jl '°‘P ‘'Psa an d is j = ^ta^aa'^fa' > where T- 7 £ u(iV c ), j = 1, • • • , rig — 1 are the generators 
of U(7V C ) transformations, and s = +/— . These are the left-/right-moving components 
of the conserved currents j 9,s,c = j+ s,c + jt s ’ c , generated by U(l), SU(2), and S\J(N C ) 



9-4 Wess-Zumino terms 


565 


transformations, respectively. As in the abelian case, interactions solely between left- or 
right-moving fermions are largely inessential. However, the operators 

\ = j+j-, K = tr (j+jf.), A c = tr{j c + j c _), (9.67) 

are physically relevant and compatible with the symmetries of the model. Another relevant 
player is the “umklapp operator” (V , - a ' cr 2 ^ V’-a') + h.c. 73 For the dis- 

cussion of a few more allowed interaction operators, we refer to the original reference of 
Affleck and Haldane. 62 


EXERCISE Show that the umklapp operator is invariant under the action of all three symmetry 
groups. 

Expressed in terms of the continuum fields, the Hund’s rules coupling Eq. (9.52) translates 
to a sum of the interaction operators listed above. The question we now have to answer is 
how these operators - which at sufficiently large strength will turn the free fermion model 
into the Heisenberg model in which we are interested - affect the long-distance dynamics 
of the model. Naturally, we shall address this question in the bosonized language developed 
in the previous section. As a warm-up to the case of arbitrary spin, we shall begin with the 
discussion of the colorless case, N c = 1, i.e. the spin-1/2 chain. 

Writing g = e l ' 2 ^g’ . where g' £ SU(2) acts on the spin indices, and using Eq. (9.60) for 
the bosonic representation of the currents, we obtain j q = ~^ K 9 z (p and j* ~ ^ ~g'~ 1 d z g' 
for the spin current. Reflecting its invariance under spin transformations, the bosonic rep- 
resentation of the umklapp operator reads ~ A u k cos 40. 

EXERCISE To obtain the bosonic representation of the umklapp operator, substitute Eq. (9.63) 
into its definition and obtain ~ X u ke l4 ^tr(g' <j 2 g' T 02 ) + h.c. ~ A u k cos(40). Here the last equality 
is best proven by using the representation g' = exp(«v • a). 


The action generalized for the presence of these interaction operators takes the form 
S[<f>,g'} = S[#j + %'], where 


$[4>l = -^(i - Ag) J d 2 x (( dq i) 2 + CAuk cos(>/40)) , 
%'] = Swzwb'] +A S / d 2 x tr {dg'dg'- 1 ). 


(9.68) 


Notably, the action still decouples into a spin and a charge sector (the separation of spin 
and charge characteristic of one-dimensional systems). Further, both S[0] and S^g'] are old 
acquaintances: <5[0] is the action of the two-dimensional sine-Gordon model. In Section 8.6 
we have seen that, for X q < 0 (repulsive interactions), it flows towards a phase with a 
mass gap. This flow is driven by the umklapp operator. What this tells us is that umklapp 
scattering leads to the presence of an excitation gap for charge density waves. This gap 


Recall that umklapp scattering is the scattering of two fermion states of opposite spin from one point of the 
Fermi surface (±&f) to the other ( fcp)- At half filling, &f = n/2a , the momentum transferred in this process 
is of magnitude 2(-7t/2 a — (—n/2 a)) = 2tz / a = G , where G = 2-n /a is the reciprocal lattice vector. Since lattice 
momentum is conserved only up to multiples of G, umklapp scattering (at half filling) is a permissible process. 
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is nothing but the Mott— Hubbard gap present in the spectrum of interacting fermions 
at half filling. (Recall that the presence of umklapp processes is tied to the case of half 
filling.) Turning to the action we note that the A., -perturbation merely renormalizes 
the coupling constant of the gradient term of Swzw> 1/87T — > (l/87r) + A s . However, in the 
previous section, we have seen that this change does not alter the long-range behavior of the 
system: irrespective of the value of the coupling constant, the model will flow back towards 
the free fermion fixed point at A* = I/87 r. 

Summarizing, we have found that the N c = 1 interacting fermion system builds up a 
Mott-Hubbard gap for its charge excitations while the spin excitations are described by 
a critical WZW theory. Now, our entire analysis was based on the presumed equivalence 
(N c = 1 interaction fermion system) (S' = 1/2 Heisenberg chain). The existence of long- 
range excitations in the former then implies that the latter must be in an ordered phase 
(since a disordered phase would be defined by the absence of long-range excitations). This, 
however, is a result with which we are familiar. 74 The truly interesting question is what 
happens for larger values of the spin (i.e. larger values of N c ). 

At N c > 1, the structure of the theory gets significantly more involved. Referring for a 
more detailed analysis to the original paper by Affleck and Haldane, 62 we restrict ourselves 
here to a qualitative discussion of the most relevant aspects. 

> Consider a matrix field g' £ SU(2) x SU(A C ). 75 Among the various interaction operators 
there is one that drives excitations of the color sector into a massive phase. This means 
that the soft excitations of the model can be parameterized as g' = g x 1 jy c , where g acts 
in the spin sector. Substituting these configurations into the free fermion reference action, 
we obtain S\yzw [g'\ = N c ,S\yzw[fl], where the prefactor N c arises from the tracing out 
of the color sector. Remembering our discussion of the quantization of the WZ coupling 
constant in Section 9.4.2, we identify this model as a WZW model of level k = 2 N c . 
t> Among the plethora of conceivable perturbations of the critical theory, one operator family 
deserves special attention: tr(g n ), where g £ SU(2). (Here, we are writing g instead of 
g for notational simplicity and n is some integer.) For two reasons, these are interesting 
operators: firstly, they represent the most relevant perturbations of the theory (due to 
the absence of derivatives); secondly, they relate to an important discrete symmetry of 
the model. To understand why, let us return to the lattice version of the theory and 
consider the fermion representation of the spin operator Sf = a a,3 'il>i . Decomposing 
the fermion into left- and right-moving components, jp> = e lkFl -tp + (xi) + e~ tkFl ip-(xi), 
where xi = la, we obtain Sf = + [(— ) l, tp+(xi)^a : ’ip-(xi) + h.c.] = 

tr([c(j^_+jf.)-|-c , (— Yig+g- 1 )]^), where c and c' are numerical constants. Here, we have 
used the relation e llkfX = 0i*/ a ) la = (— )* as well as the bosonization identity Eq. (9.63). 
The second term under the trace is of particular interest. It tells us that translation by 
one site, l — > l + 1, corresponds to a sign change of the field g. This observation can be 
read in different ways. For example, contributions to the action that are not invariant 


Recall that the 5=1/2 chain can be subjected to a Jordan— Wigner transformation whereupon it becomes a 
model of spinless interacting fermions. The existence of gapless excitations (charge density waves) thus proves 
the existence of long-range order in the spin chain. 

We will ignore the gapped charge sector throughout. 



9-4 Wess-Zumino terms 


567 


under g — > —g explicitly break translational invariance on the lattice. Similarly, a 
ground state that is not sign invariant cannot be translationally invariant, etc. 

> After these preparatory remarks, let us return to the discussion of the operators tr(g n ). 
As we are not interested in modeling situations where the Hamiltonian explicitly breaks 
translational invariance, only contributions with n even will be allowed. Physically, these 
operators correspond to products of n left-moving and n right-moving fermion states. 
In the case considered previously, N c = 1, there is only one such contribution, namely 
the backscattering operator discussed above. However, for N c > 1, terms tr (g 2n Nc ) are 
physically allowed. Let us, then, consider the action <S[g] = i'V c ,S'wzw + A / d 2 x tr (g 2 ) 76 
Void of derivatives, the new contribution acts as a potential contribution to the action 
(although it is still invariant under the SU(2) symmetry transformation g — > h~ 1 gh). This 
being so, the physical behavior of the model crucially depends on the sign of the coupling 
constant A. 

i> For A < 0, the term A / tr g 2 favors the mean- field configurations g = 1 or g = — 1. 
Both ground states break the discrete sign inversion symmetry g — >• —g and, therefore, 
they cannot be translationally invariant. Before identifying the physical meaning of these 
ground states, let us briefly discuss the issue of fluctuations. Writing g = e 1 ^” 3 ! 2 g and 
expanding to quadratic order in <j>, we obtain a mass term | J ( ]' 2 X ((j>'') 2 . This means 
that spin fluctuations around g are gapped. There is, indeed, one well known low-energy 
state of the spin chain that displays these features, the dimer phase. In this phase 
(see Fig. 9.15), spins at neighboring sites form spin singlets. Obviously, this state is not 
invariant under translation by one site (while translation by two sites is a symmetry). 
Further, there is no room for massless spin fluctuations, i.e. the system indeed shows an 
excitation gap. 

t> We next discuss the complementary case, A > 0. In this case, configurations g with eigen- 
values (i, —i) are energetically favored. There is a continuous family of such states, namely 
g = exp(i7rn • a/2), where |n| = 1. Again, these states break translational invariance 
(g —t —g). However, they also break SU(2) symmetry g — > hgh~ x . This means that, unlike 
the dimerized case, the continuous spin rotation symmetry is spontaneously broken. Its 
breaking of both translational and spin rotational invariance identifies g as the Neel 
state of the chain. 

i> To describe the physics of the Goldstone modes corresponding to the symmetry broken 
state, we note that hexp(iwn- <x)/i _1 = exp(*7rn / • a), where the unit vector n' = An and 
R £ SO(3) is a rotation matrix canonically corresponding to the SU(2)-matrix h. 77 Each 
n' defines a new ground state, i.e. we have identified the 2-sphere as the Goldstone mode 
manifold. 

EXERCISE Try to guess what the action of the Goldstone modes might be! 

To identify the Goldstone mode action, we substitute the soft field configurations 
exp(*7rn(x) • a/2) = in a into the action. It is straightforward to verify that the gradient 


76 The operator tr(<? 2 ) can be shown to be the most relevant of the family. 

To any element h E SU(2), the prescription above assigns an element R E SO (3). The correspondence is 2—1 
and not 1—1 because both h and —h map onto the same R. 
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Figure 9.15 Dimerized phase of the spin chain (a) vs. Neel phase (b). 


term of the WZW action simply becomes So [in • cr] = ^ f d 2 x (On) 2 . As to the WZ 
functional, we show in Problem 9.7.2 that A/T[in- cr] = f s2 n ■ d \ n x 92 n = irN c S top [n] , 

i.e. the #-term of the 2-sphere. Recalling that N c = 2 S, we conclude that, for large values 
of S (this is where the massive fluctuations around the Goldstone mode manifold can be 
neglected), the system is controlled by the action of the 0(3) nonlinear ex-model with 
topological angle 2nS. 

S[n] = ^ j d 2 x ( dn ) 2 + 2irS S top [n], (9.69) 

the same S[n] identified earlier on semi-classical grounds as the low-energy action of the 
spin chain. 

This is now a good point to pause and consider what we have obtained. At first sight, it 
seems as if nothing much has been achieved: as a result of a long series of derivations we 
have arrived back at the semi-classical representation of the chain, the ex-model. However, 
it turns out that it is the interplay between the WZW and the cr-model that enables us 
to really understand the physics of the problem. To see this, let us consider the S' = 1/2 
chain in the presence of a next-nearest neighbor ferromagnetic coupling - admittedly an 
artificial model but, for the sake of the present argument, this does not matter. One can show 
that the long-range physics of this system is described by the 0(3)-model with topological 
angle 2nS = tt. On the other hand, we might have approached the problem via the WZW 
route discussed above. Within that context, the ferromagnetic perturbation turns out to be 
irrelevant (see Affleck and Haldane 62 ), i.e. the long-range physics of the system is described 
by the critical free fermion WZW action. Comparing these two findings we can conclude 
that 


The 0(3) nonlinear cr-model with topological angle 7r x (odd integer) is equivalent 
to the SU(2) WZW action at critical coupling, 


(although no explicit field theoretical proof of this equivalence is known) . This was the last 
missing piece of information needed to understand the long-range behavior of the spin chain. 
To summarize, we have found that (see Fig. 9.16) the antiferromagnetic spin S chain can 
be described in terms of a perturbed WZW action of level k = 2 S. Depending on the sign 
of the most relevant perturbation ~ A f tr(g 2 ) (which is set by the material parameters of 
the problem) , this model can be either in a globally gapped phase - the dimer phase of the 
chain - or in a Neel phase. The fluctuations superimposed on the Neel phase are described 
by an 0(3) model with topological angle 2nS. For integer spin, this model flows towards a 
strong coupling phase, i.e. the spin chain is in a disordered state. However, for S half-integer, 
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Figure 9.16 On the long-range physics of the antiferromagnetic spin chain. For a discussion, see 
the main text. 
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it becomes equivalent to the level 1 WZW model at criticality, implying ordered behavior 
of the chain. 

Notice that none of the conclusions summarized above was established rigorously. More 
often than not we had to trust in the principle of adiabatic continuity, i.e. the belief that the 
physics of a system does not change qualitatively upon interpolating between the regime 
of weak interactions (where various approximation schemes work) to the regime of strong 
interactions (in which we are actually interested). Similarly, connections between differ- 
ent theories were constructed on the basis of symmetry arguments and indirect reasoning 
(rather than by “hard-boiled” calculations). In this way various pieces of evidence were 
pieced together to form a network that was intrinsically consistent and made physical sense. 
In recent years, this type of semi-quantitative research has become more and more prevalent 
in various areas of condensed matter physics. This development is driven by the increas- 
ing complexity of the questions and, relatedly, the absence of straightforward perturbative 
schemes. In the next section, we shall turn to another problem field where such “detective 
work” has been successful, the fractional quantum Hall effect. 


9.5 Chern— Simons terms 

As our third and last example of topological field theories with relevance to condensed 
matter physics, we now turn to the discussion of Chern-Simons (CS) theories. However, 
deviating from the strategy pursued in previous sections, this time we do not begin with 
a formal analysis of the underlying geometrical framework. Instead, we directly turn to 
a review of the master application of CS field theory in condensed matter physics, the 
fractional quantum Hall effect. It will then turn out that, once we have left the qualitative 
level and turned to the field integral description, we readily wind up in the basin of attraction 
of CS field theory. 
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9.5.1 Fractional quantum, Hall effect (FQHE) 


In Section 9.3.4 we interpreted the QHE as a topological phenomenon. Depending on the 
chosen perspective, the integer number setting the Hall conductance can be interpreted 
as the number of occupied edge channels or, more formally, as the number of instanton 
excitations in Pruisken’s field theory. Both figures appear to be largely impervious to changes 
in the parameters of the system. 

In essence, the basic mechanism behind the formation of integer-valued Hall conductance 
had been understood shortly after the discovery of the effect. It thus came as a surprise when 
Tsui et al. 78 discovered a sequence of plateaus at fractional values of the Hall conductance. 
More specifically, it turned out that: 

> Fractional values o xy = v f: h , v = of the Hall conductance are observed only in the 

purest samples. This indicates that, unlike with the integer effect, disorder does not 
stabilize the FQHE. 

> Not every rational n/m qualifies as a plateau value. The most prominent plateaus are 

observed for the “principal sequence” 1/m, where to is odd. More generally, plateaus have 
been observed for ^ = 2s p+i > € N. 79 

> Curiously, it has turned out that, at certain even-denominator fractions (formally, the 
limit p — > oo in the hierarchy) the system largely behaves as if no magnetic field were 
present at all! For instance, in the vicinity of v = 1/2, pronounced Shubnikov-de Haas 
oscillations (otherwise shown by Fermi liquids subject to a weak field) are observed. 

At first sight, the coexistence of phenomena of that degree of complexity with the (seemingly 
so robust) integer QHE may cause some consternation. But, then, let us recall that we are 
considering an isolated Landau level at fractional filling! In other words, we are considering a 
hugely degenerate quantum state that is only partially populated. Within that environment, 
a macroscopic restructuring of the electron gas can be afforded at little cost (which will 
be set by “residual” mechanisms such as electron-electron interactions and/or disorder 
scattering). On the same footing, it is clear that perturbative expansions around any given 
trial ground state will be pretty fragile. (Perturbations enjoy a huge phase volume while 
“energy denominators” are small.) 

Keeping these things in mind, it is perhaps no longer surprising that dramatic things hap- 
pen in the fractionally filled Landau level. The considerations above suggest that elementary 
electrons - subject to the full strength of Coulomb interactions, at fully “quenched” kinetic 
energy - will hardly qualify as stable elementary excitations of the system. At the same 
time, the appearance of Fermi-liquid-like states at least for some filling fractions suggests 
that the dominant players in the game are elementary fermionic particles. 


D.C. Tsui, H.L. Stormer, and A.C. Gossard, Two-dimensional magnetotransport in the extreme quantum limit, 
Phys. Rev. Lett. 48 (1982), 1559-62. 

In fact, the two-parameter “hierarchy” defined by the right-hand side of this equation still is not general enough 
to account for all experimentally observed values. (The prominent exception is a fragile plateau at v = 5/2.) 
However, to explain these anomalous values of the Hall conductance, one has to keep track of the polarization 
of the electron spin, an extra level of complexity which we would here rather avoid. 



9.5 Chem-Simons terms 


571 


(a) 

Figure 9.17 Illustrating the idea of the composite fermion approach. Imagine that an even number 
of flux quanta constituting the magnetic field get tied to the electrons in the system. The new 
composite particles ( “electron + (2n) flux quanta”) continue to be fermions. They only see the 
remaining “free” flux quanta, i.e. a reduced external field (indicated by the lighter shading in part 

(b))- 



Remarkably, it is not at all difficult 80 to conceive of a fermionic quasi-particle picture 
wherein - at least at a mean-field level - the plethora of phenomena above admits a straight- 
forward explanation: imagine the external magnetic field as a large number of flux tubes 
piercing the plane of the two-dimensional electron gas (see Fig. 9.17). The ratio v = N/N^, 
where N denotes the number of electrons, defines the filling fraction of the system. Since we 
are working under “fractional” conditions, 0 < v < 1 , the number of flux quanta exceeds the 
number of charge carriers. 81 Now assume that, by some mechanism, each electron matches 
up with an even number of flux quanta to form a composite particle. (By way of example, 
consider v = 1/3, in which case there would be enough flux quanta around to let every 
electron pair with two flux tubes.) What can be said about the properties of these composite 
objects? 

t> Firstly, they would still be fermions. To understand why, recall that the statistics of 
particles can be probed by exchanging their position in space. However, our composite 
particles and the original electrons differ only in the presence of an even number of integer 
flux tubes. The flux tubes give rise to additional phase factors, so that our new particles 
are fermionic: one speaks of composite fermions (CFs). 

> The CF s see an effectively reduced external field. For example, for v = 1/3, each electron has 
absorbed two flux quanta. Thus, the residual field seen by the CFs is three times lower than 
the original field. In other words, the number of remaining flux quanta, — 2N = N, is equal 

to the number of CFs. Forgetting for a moment about the origin of the composite particles, 
we are considering a large number N of (composite) fermions subject to N flux quanta, i.e. 
a situation where the integer QHE should arise. This picture suggests an interpretation of 
the FQHE as an IQHE of composite fermions. 82 In the specific case of a half-filled (v = 1/2) 


80 However, our discussion should not deceive the reader about the fact that the composite fermion picture rep- 
resents the outgrowth of years of most intensive research! 

81 If i/ > 1, is — [v\, where [v] is the largest integer smaller than v, sets the filling fraction of the highest occupied 
Landau level and our discussion applies to that level. 

82 More generally, let us assume that every fermion binds 2s flux quanta to it. Further suppose that N/ — 2 nN) = p, 
i.e. that p Landau levels of the residual field are occupied. In this case, the CFs will also display the IQHE. This 
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band, — 2 N = 0, i.e. the CFs experience a mean-field of vanishing strength. This nicely 
conforms with the experimental observation of Fermi-liquid-like behavior (no QHE) close to 
half filling. 

> We have seen in Section 9.3.4 that, when a flux quantum is adiabatically pushed through 
an annular quantum Hall geometry, an electron charge flows from the inner to the outer 
perimeter of the sample. In a way, the phase vortex created by the addition of the gauge 
flux leads to an expulsion of electronic charge. Similarly, the flux tubes involved in the 
construction of the CF picture effectively carry a positive charge v. These “screen” the 
charge carried by the bare electron, so that the CF is not exposed to the full strength of 
the unit-charge Coulomb interaction. 

In the history of the FQHE, the introduction of the CF picture by Jain 83 was pre- 
ceded by a number of other important developments. Shortly after the experimental dis- 
covery of the effect, Laughlin 84 proposed a trial wavefunction which, in a close-to-optimal 
way, 85 minimizes the Coulomb repulsion between the quasi-particles in the lowest Landau 
level. Most of the concepts central to the subsequent introduction of the CF picture - 
fractionally charged quasi-particles, incompressibility of the QH state, the importance of 
correlations, etc. - have effectively been motivated by this trial wavefunction. Later, the 
theory of the (many-body) CF system was formulated in terms of a Chern-Simons type 
field theory. 86 This effective field theory has become the basis of many subsequent analyses 
of FQHE phenomena. 

In this text, we shall turn the sequence of historical developments upside down: starting 
from a field theoretical description, we shall identify the CF degrees of freedom and then 
rediscover Laughlin’s wavefunction. Once these structures are in place, the computation of 
the fractional Hall conductance will be little more than a straightforward exercise. 


9.5.2 Chern-Simons field theory: construction 

Consider the Hamiltonian of two-dimensional interacting electrons subject to a perpendic- 
ular magnetic field of strength B: H = Hq + Hint where 

H 0 = /dW(K) [i(- i ^A ext fW(x)] a( x), 

Hint = ^ J d 2 x d 2 x' (p(x)-p 0 )^(x-x , )(p(x , )-po). 


happens for filling fractions v = N/N<j, = p/(2sp + l).In essence, this simple picture explains the structure of the 
rationals where the FQHE is observed. 

J. K. Jain, Composite-fermion approach for the fractional quantum Hall effect, Phys. Rev. Lett. 63 (1989), 
199-202. 

R. B. Laughlin, Anomalous quantum Hall effect: an incompressible quantum fluid with fractionally charged 
excitations, Phys. Rev. Lett. 50 (1983), 1395-8. 

For certain short-range correlated model interactions, the Laughlin wavefunction can even be shown to be an 
exact ground state, see F. D. M. Haldane, Fractional quantization of the Hall effect: a hierarchy of incompressible 
quantum fluid states, Phys. Rev. Lett. 51 (1983), 605-8. 

A. Lopez and E. Fradkin, Fractional quantum Hall effect and Chern-Simons gauge theories, Phys. Rev. B 44 
(1991), 5246-62. 
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Here, A ext = ^^(y, —x) T is the vector potential of the external magnetic field (sym- 
metric gauge), V is a single-particle potential created by the presence of, for example, 
impurities, and H mX describes the particle interaction on the background of a constant 
counter-density po- 


Singular gauge transformation 

After our discussion in the previous section, we do not expect bare electrons to be a useful 
reference for the construction of a low-energy field theory. Certainly, it will be more promis- 
ing to start out from composite fermions as discussed above. As we have seen, a CF is an 
electron with 2s integer flux tubes attached to it. Other fermions moving around the CF 
along a circular contour will acquire the winding phase 2 scj) indicative of the presence of 2s 
flux quanta. Within a first-quantized framework, these phase vortices can be attached 87 to 
the position of each fermion by virtue of the “gauge transformation” 


^(xi, . . . ,xjv) -t ^(xi, . . . ,xjv)exp 


—2is ar g( x * ~ Xj) 

i<j 


(9.70) 


where is the many-body wavefunction, and arg(x) = tan -1 (x 2 / x\) is the angle enclosed 
between xf l 2 and the positive real axis. The transformation Eq. (9.70) becomes singular 
whenever two coordinates x, — > x. ; approach each other, i.e. it does not represent an ortho- 
dox gauge transformation. In fact, the vector potential corresponding to the phase factor 
above, a = — 2.s<9 x ]T\ arg(x — x, ) = —2s ]T\ ( x i~ x <,i)^~fe- x <, 2 ) e i ; crea t es a perpendicular 
magnetic field 88 of strength 

b = e^d^aj = — 47rs^£(x — Xj), (9-71) 


i.e. the field corresponding to 2 sN flux tubes centered at the coordinates of the fermions. 
Summarizing, the singular gauge transformation above converts the N fermions into a 
system of CFs (fermions with 2s flux lines attached). 


Derivation of the Chem-Simons action 

Within the framework of the second quantization, the transformation Eq. (9.70) amounts 
to the replacement 

«»(x)^at(x)exp[-2 is /A'ar g(x -x')^)], 
where, as usual, p = af a. 

EXERCISE Check that, with this definition, a quantum many-body wavefunction | ) S 
f =1 af . |0) transforms according to Eq. (9.70). (The index A refers to the states |A) of a suitably 
chosen single-particle basis [e.g. a basis of Landau states].) 


In fact, the attachment of these phases is equivalent to placing flux quanta at the winding center. 
This would, of course, not be possible for a non-singular gauge transformation. 
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Substituting the transformed operators into the Hamiltonian, one obtains 


.Ho -t [ <l 2 x a* (x) — (— id x + A) 2 + V(x) a(x), 

J L 2m 


where A = A ext + a and 


a(x) = —2s 


dV ( ».-^)e2-fe-4)e, -( x ,) 


(9.72) 


At this stage it is convenient to switch to a real-time field integral representation. We 
thus introduce the partition function Z = Af f D(ip,ip) e * s 'W’>b] , where the action S[ip, ip] = 
S 0 [ip,ip] + S int [ip,ip], with 


S 0 [ip,ip] = j dt d 2 x ip | id t +ii- ^(-i<9 x + A [ip, w'Pf - V(x)j ip, 

Him [v, ip] = J dt J d 2 xd 2 x' (p(x)-p 0 )^(x-x , )(p(x')-po), 


is obtained in the usual way by trading field operators for coherent state amplitudes (specif- 
ically, p = ipip and A[ip, ip] = A^ aa \y^^^). Thanks to the presence of the vector potential 
A[ip,ip], the kinetic energy operator has become a pretty unpleasant object, depending 
non-locally on up to six field amplitudes ip, ip. To avoid this complication 89 let us shift 
the nonlinearities implied by the singular gauge transformation to some other place in the 
action. This can be achieved by promoting the vector potential to an integration variable 
whose value is set so as to generate the flux pattern: i.e. multiply the partition function 
by 1 = Af J Da± ]/[ x t 8 (b(x, t) + 47r.sp(x, £)), where b = e lJ d,a± j and the subscript “ ” 
indicates that the integration extends only over transversal configurations of the vector 
potential (that is, configurations obeying did, = O). 90 As a result, we obtain the double 
functional integral, 


Z = Af f D (ip, ip) Da± 5 (b(x, t) + 4nsp(x, t )) exp (—S[ii:, ip, a_J) 

x,t 

= Af J D(ip,ip) Da± Dcpexp (iS[ip,ip,a±\ — i J d 2 xdt(p( — fi p 


8 Af j D(ip,ip)Da±DcpexpliScF[ip,ip,Six,<p]+i-Scs[si±,(i 


where we have introduced the common shorthand notation 9 = 1/27 rs, the action 
S[ip,ip,8L_ l] s v] >Anx +a is obtained by replacing the fixed vector potential 


Technically, it is not advisable to disturb the structure of the most basic operator of the theory. 

This latter condition is necessary because the ^-distribution does not fix the longitudinal, or gauge, freedoms of 
the potential. In two dimensions, the decomposition of the vector potential into longitudinal (ip) and transverse 
(0) components is achieved by setting a* = di<p + eijdjO. 
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a.[tjj , t/)] by the integration variable aj_, 91 

S'cf[V , ) = J d 2 x dti> ^id t + /j, - § + ^-(-w9 x + A) 2 - V^j ip + S irrt [V>, ip\, (9.73) 
<Scs[o^[^ B — [ d 2 x dt(f>eijdia±j, (9.74) 


and A = A ext + aj_ . At this stage, we have fulfilled our intermediate goal; a field integral 
representation has been derived wherein particles are tightly bound to fluxes. There is, 
however, something unsatisfactory about the present representation of the theory: the way 
the variables aj_ and (j) enter the action (9.74) suggests an interpretation of the theory as one 
of fermions coupled to a (2 + l)-dimensional electromagnetic gauge potential aj_ = (<i>, aj_). 
However, the action Scf + Scs falls short of the two standard criteria any decent theory 
of electromagnetism should obey: (i) in its present representation, it has nothing to say 
about the longitudinal degrees of freedom a|| of the vector potential and, relatedly, (ii) the 
contribution Scs is not gauge invariant (which means that, using a covariant representation 
x = {x ; ,} = (xq •- t,x i, X 2 ) , under a transformation a ;t — > a ;i + d^f, // = 0, 1, 2, it changes 
value) . 

We now claim that the action Scs possesses a natural gauge-invariant extension, namely 
the well-studied Chern— Simons action 92 


-Scs [a] = ~ J 


d X eui/crdudi/Cla . 


(9.75) 


Firstly, a straightforward integration by parts shows that Scs is gauge invariant. Secondly, 
one verifies (exercise!) that, for a purely transverse configuration aj_ = (6. d^O. —d\9). 
Scs[u_l] reduces to the form given in Eq. (9.74). Put differently, the prototypical action 
(9.74) is but the gauge-invariant Chern-Simons action evaluated in a particular gauge, 
namely the Coulomb or radiation gauge a|| = 0. The gauge-invariant extension of the 
theory is obtained by integration over all gauge sectors, 


Z=M 


Da exp ( *Scf [ip, V’) «] +*^<Scs[a] 


(9.76) 


where 

ScFbl’,$,a] = J d 3 xti> (id 0 +/z- <j> + ^(-*<9 X + A ext - a) 2 - V^j tp+S int [ip,ip\, (9.77) 


Notice that Eq. (9.72) is purely transversal, so that the replacement a — > aj_ makes sense. 

Recall that space-time vectorial components are defined as = (xq, x\, X2), while d ^ = (—do,di,d2)- 
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Figure 9.18 Different ways of realizing constructive particle exchange along transmutation paths. 

is but the action (9.74) generalized to arbitrary gauge field configurations. 93 In the following 
we will investigate what information can be obtained from the functional integral Eq. (9.76) 
about the physical behavior of the FQH system. 

Particle exchange in two dimensions 

Before embarking on this program, it may be of interest to discuss a few general aspects 
of Chern-Simons field theory and particle transmutation statistics. In undergraduate 
quantum mechanics courses we learn to discriminate between particles with bosonic and 
fermionic statistics. On the formal level, the distinction between the two is met by consid- 
ering the behavior of the wavefunction under an “exchange of particles,” namely 

$(. . . , x 1 , . . . , x 2 , . . .) = ±vk(. . . , x 2 , . . . , x 1 , . . .), (9.78) 

for bosons/fermions respectively. In fact, however, this definition should leave one with a 
certain feeling of uneasiness: what is actually meant by the phrase “exchange of particles”? 
Surely, the definition above does not imply a concrete physical prescription, i.e. strictly 
speaking it does not make sense. 

To appreciate the fact that we are not just discussing a formal subtlety, let us try to give 
the exchange of particles a more physical meaning. (The construction below follows closely 
an argument from chapter 1.2 of Wilczek. 94 ) Consider two quantum particles occupying 
positions x l ,x 2 in a two-dimensional system. Suppose we were interested in computing the 
amplitude for these particles to re-occupy the positions x 1 , x 2 after some time t. Clearly, 
these amplitudes receive two distinct contributions, (i) x 1 — > x 1 , x 2 — x 2 and (ii) x 1 — > 
x 2 ,x 2 — > x 1 where x —>■ y denotes the single-particle amplitude for propagation from x — > y. 
Interpreting the second process as an operation of particle exchange, we are interested in 
identifying a fixed relative phase between (i) and (ii). Thinking about the total transition 
process in terms of a coherent double sum over single-particle paths, it is clear that con- 
tributions from (i) cannot be continuously deformed into those of type (ii). The path-sum 
falls into disconnected pieces, implying that there will be no variational (or classical) prin- 
ciples telling us about the relative phase. Yet, quantum mechanics itself provides us with 


As usual with gauge theories, the integral Eq. (9.76) must be interpreted in a qualified sense: the very fact that 
the action is gauge invariant implies that the integration over all the different gauge realizations yields the - 
infinite — volume of the gauge sector. To give the functional integral some meaning, a gauge xing contribution 
<Sfix[a] has to be added to the action, i.e. a contribution that restricts the integration to a specific reference 
gauge. Mostly, however, the presence of the gauge fixing action is not indicated explicitly. 

F. Wilczek, Frictional Statistics and Anyons Superconductivity (World Scientific Publishing, 1990). 
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an important clue: from the point of view of the first of the two particles, the net result of 
a (i)-process will be a rotation of particle 2 (around the position of particle 1) by an angle 
<p € 2-7 tZ. This is illustrated for (f> = 0 (left) and <f> = 2n (right) in Fig. 9.18. Conversely, a 
(ii)-process corresponds to a rotation by (p £ (2Z + l)7r. (For <j> = n, see the center of the 
figure.) 

Now, let us suppose that the topologically distinct processes differ by some phase n(<p). 
If we iterate transmutation processes, (x 1 ,® 2 ) A (x^x 1 ) —t (x^x 2 ), the winding angles 
add while quantum mechanics requires that the topological phases multiply, n(cp + (f)') = 
K{(p)n{(p'), implying that n{<p) = exp (iip<p), where ip € [0, 2] is some parameter. 95 

Specifically, a single exchange operation (x^x 2 ) — > (x^x 1 ) corresponds to a phase e l7r ^. 
Let us compare this with the formal exchange definition above. According to Eq. (9.78), 
a twofold exchange, (x^x 2 ) — > (x^x 1 ) — > (x^x 2 ), leaves the wavefunction unaltered. In 
contrast, even the most elementary “physical” exchange procedure corresponds to a winding 
angle <p = tt and, therefore, to a topological phase exp(2niip) . Only for the special choices 
ip = 0 (bosons) or ip = 1 (fermions) do we recover the result of the formal exchange. 
For a beautiful (yet non-path-integral-oriented) extension of the arguments above to a 
physically meaningful exchange prescription for general A'-particle systems we refer to the 
seminal paper of Leinaas and Myrheim. 96 Presently, all we need to appreciate is that a 
constructive exchange operation appears to leave more room for non-trivial (i.e. 7^ 1,-1) 
transmutation statistics. Particles with ip ^ 0, 1 have been dubbed anyons, where the 
“any” stands for “any exchange statistics.” Skeptical readers may justly object that the 
overwhelming majority of particles observed in physics are of either bosonic or fermionic 
type. So where, then, does anyonic exchange statistics play a role? The short answer to 
this question is that our discussion above was critically tied to the two-dimensional system. 
In three or more dimensions, the winding angle <p is defined only mod 2i r. For example, 
in the three-dimensional world, you might use the dimension perpendicular to the paper 
plane to contract the process shown in the right hand of Fig. 9.18, (p = 27T, to a <p = 0 
type process. This forces ip to be an integer, i.e. ip = 0 (bosons) and ip = 1 (fermions) 
simply exhaust the list of possible options: 97 in d 7^ 2 anyons do not exist. 98 Yet, in various 
two-dimensional applications, anyonic excitations do play an important role. For example, 
a theoretical approach to the FQHE competing with the CF approach discussed in the text 
is based on composite bosons ." Although these are not anyons in the strict sense, here, too, 
the exchange statistics has been externally modified (in this case to manufacture bosons 
from fermions). Indeed, for a while, attempts were made to link anyons to the subject 


Since <j) is a multiple of 7r, -j/j is defined only mod 2. 

J. M. Leinaas and J. Myrheim, On the theory of identical particles, II Nuovo Cimento B 37 (1977), 1-23. 

In d = 1, particles cannot be exchanged anyway, which accounts for the interchangeability of bosonic and 
fermionic modelings. 

Another argument can be developed to the same effect: according to the spin-statistics theorem, the exchange 
statistics of particles is intimately tied to the quantization of angular momentum. The latter, in turn, is a 
direct consequence of the SU(2) commutation relations [ Ji , Jj] — ieijkJk : the structure of the right-hand side 
fixes the dimension of the irreducible representations of SU(2) and, thereby, the particle statistics. Yet, when 
restricted to a two-dimensional world, the algebra of angular momentum becomes one-dimensional, i.e. abelian. 
The absence of angular momentum quantization then implies that the spin statistics theorem loses its meaning. 
S. C. Zhang, T. H. Hansson, and S. Kivelson, Effective field theory model for the fractional quantum Hall effect, 
Phys. Rev. Lett. 62 (1989), 82-5. 
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of high-temperature superconductivity. 100 Further, there is compelling evidence that the 
quasi-particle excitations in quasi-one-dimensional FQHE systems show fractional statistics, 
etc. Finally, field theories of anyons have been considered as prototypical model systems of 
matter fields coupled to massive gauge field excitations. These applications are motivation 
enough to briefly discuss the quantitative theoretical framework of anyon dynamics in two- 
dimensional systems. 

Let us start out from the following functional setup: 

Z = J Da Dcf) exp |iSq[©] +* J d3x il a i + ^>Scs[a]J , 

where either f D<f> = f ip) may stand for a coherent state field integral, or f D<j> = 
/nf=i D(x®, x®) for a multiple path integral over the configuration space trajectories {x'} 
of N particles. In the latter case (see the exercise below), j(x) = ^^(1, x*)<J(x — x*(f)) is 
the current density carried by the world lines of N particles. 

EXERCISE Subject the first-quantized many-particle Hamiltonian H = X)(=i ( + F (x 1 )) to 
the singular gauge transformation (9.70). Then construct a path integral representation for the 
transition amplitude (x 1 , . . . ,x JV |C/’(i)|y 1 , . . . ,y N ). Show that the decoupling of the nonlinear 
gauge field contribution acquired by the Hamiltonian exactly parallels the field integral scheme 
discussed above. As before, the action contains the Chern-Simons contribution Eq. (9.75) and a 
current vector potential coupling. The latter is given by the canonical expression 

/*£<-#***■*<*# = j = j d 3 xa^{x)f(x). 

In the first line, j(x) = (1, x) denotes the (2 + l)-dimensional current carried by a point particle 
while, in the second line, j(x) = £T(l,x*)£(x — x l ) represents the corresponding current density. 
(Upon space integration, f d 2 xj^(x) = j M (x 1 ).) 

We next employ the path integral variant to show that the statistical angle 6 controls 
the transmutation statistics of the planar particle system. Let us consider a system of 
N = 2 particles initially (t = t{) prepared to occupy the coordinates (y 1 ,?/ 2 ). We want to 
analyze the transition amplitude (y 1 , y 2 |17(tf, t\)\y 1 ,y 2 ) to reoccupy the same state at a time 
if. As in our qualitative construction above, the path integral describing this amplitude 
receives contributions from topologically distinct pairs of trajectories. (The world lines of 
two trajectories with winding angle </> = 0 (left) and <j> = 2ir (right) are shown in the figure.) 

To explore the impact of the statistical vector potential on the transition amplitude it is 
convenient to switch to an imaginary time formalism. Upon analytic continuation t — > — ir, 
do ia o, the path integral for the two transition amplitudes assumes the form 

Z = J ^(x^x 2 ) exp 5'o[^J + J (Pxj^a,, - ^ Scs j , 


R. B. Laughlin, The relationship between high-temperature superconductivity and the fractional quantum Hall 
effect, Science 242 (1988), 525-33. 
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where = i dx ^ 8 (x — x(r)) is the (2 + 1)- 
dimensional imaginary time current. The most 
important consequence of the transition t — » — it G 
— i[0, /3] is that the initial and final points of our two 
space-time trajectories now have to be identified 
(the usual temporally periodic boundary conditions 
of the imaginary time formulation 101 ). A glance at 
the figure shows that this leads to a pair of two 
closed world line curves which, for a winding angle 
(j> = 27 rn, are n-fold intertwined. 102 Focusing on the 
two most elementary variants (j> = 0 and (f> = 27T, 
we next evaluate the Chern-Simons action on these 
trajectory pairs. 

We begin by integrating out the statistical vector 
potential. For now, it will be convenient to perform 
this integration in the radiation gauge, = 0. (Remember that, to integrate over a gauge 
field, a specific gauge has to be chosen.) This gauge can be selected by adding a gauge fixing 
contribution a J d 3 x {d^a^) 2 to the action. In the limit a — »• oo, configurations with <9 M a M no 
longer contribute to the integration and the gauge is effectively fixed. Equivalently, one may 
limit the integration from the outset so that only the two components of a(q) perpendicular 
to q are integrated over. Either way, one finds (exercise) ( a li (q)a l ,(q')) a = |^§ —8 q+q ’, 
where (•••)„ denotes functional averaging over the (imaginary time) Chern-Simons action. 
Using this result, we obtain 



£>(x 1 ,x 2 ) exp 

J , x 2 ) exp 

J , x 2 ) exp 


-So[$ + \ I d * x d 3 x' ju(x)j„(x') {a tl {x)a v {x')) l 


~So[4>: + l ^2 {a»{q)a v {q')) a 




Let us now try to understand the meaning of the last term in the action, the remnant of 
the Chern-Simons integration. In fact, this term will turn out to be a topological invariant 
which tells us about the degree of knotting of the two integration paths. To see this, let us 
define 

3 1* = ~iJn = 9 t Yl x Z( t ) 6 ( x i ~ Xi(t))S(x 2 - xI(t)), 

d=l,2 


where x = (r, X \ , a? 2 ) - Temporarily forgetting about the time- like origin of the first compo- 
nent, we may think of j as the current vector field created by two loops in three-dimensional 


For the present, however, the parameter /3 does not carry any physical significance. 

For the moment, we do not consider odd multiples cf> = (2 n + l)?r (corresponding to a particle exchange ) since 
for y 1 ^ y 2 these cases do not have a meaningful imaginary time extension. 
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Figure 9.19 Two intertwined current loops in (2+l)-dimensional Euclidean space-time. 


space, each carrying a current I = 1 of unit 103 strength (see Fig. 9.19). The key point now 
is that, according to Ampere’s law, b ti = —ie llva q r £j a is the magnetic field created by 
a static 104 current distribution j. We may thus write the topological contribution to the 
action as 

S'toplx 1 ,® 2 ] = d3x ^( x ) ba ' ( x ) = dT dX dr ^ 

= - 1 £ £ * - 4 *' = - 1 £ j s . dS ■ curl 1 = - 1 £ X . dsr 

2i U a a\ 

= -J 1 ^ ’7 )• 


Here, 7 “ is a shorthand for the curve {r, and S a a surface spanned by 7 °. The 
crucial last line states that the topological action is proportional to the current I( 7 “, 7 “ ) 
flowing through the area spanned by loop a’ due to the presence of loop a (see Fig. 9.19). 
Obviously, /( 7 a , 7 a ) = 0. For a ^ a', 7 ( 7 “, 7 “ ) € Z is equal to the number of times 7 “ 
pierces S a or, in other words, the degree to which the two loops are intertwined. 

Specifically, amplitudes where one of the particles encircles the other n times acquire a 
phase K(2nn) = e 2mn where ip = 1/(7 t6). We have thus found that the Chern-Simons 
action does the book-keeping of the anyonic exchange phases discussed qualitatively above. 
In fact, it is relatively straightforward to extend our present two-particle analysis to N 
particles, or to the fully-fledged formalism of the coherent state field integral. In the specific 
case of the Chern-Simons field theory of the FQHE, 9 = (2i rs) _1 , i.e. fermions get trans- 
formed to (composite) fermions. In general, however, 6 may be tuned so as to generate any 
form of exchange statistics. 


To compute the strength of the current, one may integrate j over a space-like surface xq = r = const, 
intersecting the loop (see Fig. 9.19). The integral gives / = f dx i dx 2 jo(x) = jo(r, x\ (r), X2 {t)) = 1. 

Keep in mind that, in our present picture, the 0-direction of space no longer carries the significance of time. 
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9.5.3 Chern-Simons field theory II: analysis 


Equation (9.76) defines an exact reformulation of the FQHE field integral in terms of a 
Chern-Simons action. Trusting that the CS gauge field degree of freedom is sufficiently 
benign, we now proceed to our familiar program “mean-field + fluctuations.” We begin by 
subjecting the Coulomb interaction ,S'; nt to a Hubbard-Stratonovich transformation, 

giStodv.vi = J Dae ifd 3 xd 3 x'<T( X )[V- 1 (x-x')6(xo-x' 0 )Mx')+ifd 3 x(p(x)-po)a(x), 

where V~ x denotes the inverse of the interaction kernel. Integration over the - now 
Gaussian - CF degrees of freedom i/’ then brings us to the partition function, Z = 
AT f Da Dae lS ^ a,cr \ where 


S[a, a] = —i tr In |*<9 0 + p — <j) — a + ^ (— iV + A) 2 — vj — p 0 J d 3 x a(x) 

1 /» Q 

+ - / d 3 xd 3 x' o-(x)V~ 1 (x — x')6(x 0 — x' 0 )cr(x') + -Scs[®]- 


(9.79) 


Starting from this representation, we now subject the theory to a mean-field analysis. 


Mean-field equations 

Let us seek for solutions of the equations 

££[«, cr] I _ <55[a, cr] I _ ^ 

Sa^x) a Sa(x) | - - 

Explicitly performing the differentiation with respect to ao = 4>, one may see that the first 
of these equations translates to the - by now familiar - form 

p\a,a\ = — ' —6, (9.80) 

47TS 

where p[a, a](x) = i(ido + p— (j) — a+ + A) 2 — V)~ 1 (x,x) denotes the local density 

of CFs, and the notation emphasizes the functional dependence of p on a and the Hubbard- 
Stratonovich potential cr. The differentiation with respect to the space-like components a 
does not yield independent new information; all it gives us is two relations expressing the 
compatibility of Eq. (9.80) with the continuity equation. Finally, differentiation with respect 
to a gives 

<t( x ) = -/ d 2 x'V(x-x') {p[d,d\{x')- p 0 )\ x , o=xo . (9.81) 

This equation also affords a transparent interpretation: on the mean-field level the potential 
~ f V [p — po\ created by local density fluctuations compensates the interaction potential. In 
the absence of fluctuations of the external potential H(x), Eq. (9.80) and (9.81) possess the 
obvious homogeneous solution 

p[a, 0] = po = const., a = <f> = 0, b = Airspo a = 2si'A ext . 

The particle density is homogeneous, implying that no mean-field interaction cr is generated. 
The strength of the equally homogeneous CS mean-field b is set by the average density p of 
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composite fermions. As expected, the effective field seen by the CFs, B = B ext — b, turns 
out to be lower than that of the external magnetic field. Specifically, for a half-filled band, 
v = 1/2, po = B ext / A tt. In this case, CFs with two flux quanta attached, s = 1, experience 
a mean-field B of vanishing strength. 

Now let us investigate the stability of the mean-field with respect to fluctuations. In 
many respects the N CFs behave like ordinary fermions subject to a perpendicular field 
of strength B. In particular, they will undergo a Landau quantization where, typically, the 
highest Landau level will be only partially occupied. Foreseeably, these generic configura- 
tions will be highly susceptible to all sorts of fluctuation effects (i.e. poor candidates for 
mean-field schemes): the massive degeneracy of the Landau levels leads to “small-energy 
denominators,” so that even a slight perturbation/fluctuation may cause dramatic effects. 
We thus expect our present mean-field scheme to work only in the vicinity of filling fractions 
v where an integer number p of CF Landau levels are fully occupied. These filling fractions 
are determined by the equation v e g = p or $ e fj = 2nN/p, where $ e ff = BL 2 denotes the 
total effective flux piercing the system. Using the relation B = B ext — b where b = AnsNL~ 2 , 
it is straightforward to solve these equations to obtain 
_ 2? tN _ p 
B ext L 2 2sp+l’ 

in agreement with the experimental observation. Summarizing, the mean-field analysis of 
the Chern-Simons action confirms the basic expectation formulated on page 571 that 

the fractional QHE can be interpreted as an integer QHE of composite fermions. 

I N FO There is one more aspect of the theory that can be explored on the level of plain mean- 
field theory, namely the charge of the composite fermions. To this end, let us consider the 
functional expectation value of the operator 

O(x.x') ii (v(x)v(x')) . 


The correlation function O describes the amplitude for creation of a CF at a space-time point 
x and its annihilation at x'. Let us regard this amplitude as the coherent sum over all paths 7 
connecting x and x' . Due to the presence of the external magnetic field, each such contribution 
acquires an Aharonov-Bohm phase b 7 = —q e g ^ dx p A£ xt , where the coefficient q e g defines the 
effective charge of the particle. (As we shall see, the presence of the statistical field makes q e s 
different from the bare electron value q = 1.) Performing the Gaussian integral over ip, we 
represent O as 


0(x,x) = (^{x\(id 0 + p - </> - cr + -^(-i +A) 2 -U) 1 \x) 

= « x |U(f, f , )|x , » a>cr 

= Dx exp [x, cr] — i J dt {cj> — A • x)j ^ 

V const, x j 
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In the first line, the angular brackets denote the functional averaging over the Chern-Simons 
action (a) and the interaction kernel (<r). Further, we have made use of the fact that the (retarded) 
real time Green function is equal to the time evolution operator U. The latter is represented 
as a path (as opposed to a “field”) integral. The action of the path integral contains a field- 
independent contribution So, the coupling to the scalar field component <j>, and the canonical 
coupling to the vector potential A. Until now, all manipulations have been exact. In the crucial 
last line, we then evaluate the (a, cr)-integration in a mean-field approximation. Assuming that 
fluctuations (lumped into the “const.” in front of the integral) will be small, this amounts to a 
substitution (a, cr) — > (a, 0) in the action. Using the fact that a ex t = 2s^A ext , we arrive at the 
last line, where the coupling constant 

M = (9 82) 

is identified as the effective charge of the CF. The line of reasoning above tells us that CFs 
effectively carry fractional charge. The partial “screening” of the bare electron charge, 0 < 
q e s < 1, is explained by the tendency of the phase vortices to expel electronic charge. In the 
specific case, v = 1/2 q e s = 0, the electron charge becomes completely screened by 2s = 
2 vortices, a result that led some authors 105 to interpret the v = 1/2 FQHE as a dynamic 
phenomenon of charge dipoles - each comprising one electron ( q = 1) and two vortices (“q = 
-2 x 1/2 = -1”). 


Fluctuations 

We now proceed to explore the role of quadratic fluctuations around the homogeneous 
mean-field. In particular: 

i> What does the field theory have to say about the electromagnetic response of the system? 

> What else (beyond the fractional transmutation statistics) does it tell us about the micro- 
scopic features of the composite fermions? 

To answer these questions, we do not have to go into much quantitative detail. Rather the 
ubiquitous condition of gauge invariance and the presence of an excitation gap (an integer 
number of CF Landau levels are fully occupied!) suffice to fix the structure of the quadratic 
action. 

Shifting a — > a + a and a — > ■&' + a, our goal is to expand the action to second order in 
the deviations (a, a). Let us begin by considering the interaction field cr. A shift ao = <f> — > 
<j) + cr removes a from the “tr In” component of the action and makes it reappear in the 
Chern-Simons action. More specifically, the Chern-Simons acquires a linear contribution 106 
Scs [a] — > Scs [a] — 2 f (fix ab so that the total cr-expansion of the action now reads 

= S[a] — ^ J fix a(x)b(x) + ^ J (fix (fix' cr(a;)U _1 (x — x')d(xo — x' 0 )a(x'). 


A. Stern, B. I. Halperin, F. von Oppen, and S. H. Simon, Half-filled Landau level as a Fermi liquid of dipolar 
quasiparticles, Phys. Rev. B 59 (1999), 12547-67. 

Here b = eijdiaj describes the uctuations of the statistical magnetic field around its mean value b. 
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After the straightforward Gaussian integration over a we then obtain 

S[a , a] ^4 Sfa] + ^ J d 3 x d 3 x' b(x)V(x — x')<j(:ro — x' 0 )b(x'). 

The physical interpretation of the induced term is obvious: in the CS theory, fluctuations 
of the particle density are tied to fluctuations of the statistical magnetic field. Accordingly, 
spatial inhomogeneities of the statistical magnetic field get penalized by an interaction 
contribution, as described by the second term above. However, as far as the two basic 
questions above are concerned, we may temporarily forget about this contribution (see, 
however, the concluding remarks below) and focus attention on the action S' [a]. 

Let us, then, consider the action S[o, A'] = Scf[o + a + A!\ + jScs[h + a], where 

Scf[o] = —/trln + /i — ao — (— ZV — a) 2 — V 

denotes the CF contribution to the action to which we have coupled a source potential A' . 
As usual, a two-fold differentiation with respect to elements of A' will later tell us about 
the relevant transport coefficients of the system. Next, we develop the formal expansion 
S[a, A!] = ^2 n [«? A '\ , where S( n )[a,A'] is of total order n in a and A! . The zeroth- 

order term S-G describes the CF system on the mean-field level and will not be of further 
interest to us. The first-order term S'- 1 ) does not contain a as we are expanding around a 
stationary configuration. Moreover, its A dependence [A] = iA^j' 1 is inessential because 
the mean-field CF density j° is structureless and a mean-field current j = 0 does not flow. 
However, at the second-order level S^\ things start becoming more interesting. Formally, 
the second-order contribution can be represented as 

S (2) [a, A'] = ^ J d 3 x d 3 x' (a + A')^(x)K lxv (x,x')(a + A') v (x') + ^Scs[a], (9.83) 

where v (x,x') = sa^ (x^a^x 1 ) | • By construction, K is but the linear response kernel 

discussed in Section 7.2. For the moment, all we need to recall about this object is that 
it (a) is gauge invariant, d tJ ,K^ v = K^ v d v = 0, (b) is generally short-range ( K(q ) can be 
expanded in powers of q), and (c) contains information about both the polarizability of the 
medium and its conductivity. 

Building on property (b), one can expand the second-order action in derivatives: 
SW[a,A'] = Ej=o s(2,l) K^}> where S^l^A'] is of Zth order in derivatives ( do,d x ) 
and, as usual, one may focus attention on the contribution with the least number of 
derivatives. As discussed before, no gauge-invariant zero derivative term S'*' 2,0 ) can be 
constructed. However, with the contribution linear in the number of derivatives, S^ 2,1 ), 
the situation is more tricky. While in general there are no gauge invariant contributions 
of first order in q — ZV, terms nominally scaling as |g| for q — > 0 exist. (For example, 
in systems with a non-vanishing longitudinal conductivity, the action takes the form 
6 , ^ 2 ’ 1 )[A / ] ~ A'^q) A' v (—q), where D denotes the diffusion constant.) However, the 

system at hand is an insulator, so that no such term is present. 

Yet, in two-dimensional systems, there exists one more gauge-invariant first-order deriva- 
tive term, namely the notorious Chern-Simons term! Thus, we are led to the preliminary 
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identification A'\ = c Scs [ a + -Mj + |S'cs[a], where the first contribution is obtained 

by a first-order gradient expansion of the CF action while the second contribution has been 
present from the outset. Of course, the coupling constant c remains to be determined. This 
can be done by explicit - and quite laborious - calculation or by the following more elegant 
construction: remembering that Scs[a] = — / d?x = — i € vtv<j a t i{q)$yda {"- <?), 

and comparing with Eq. (9.83), our findings so far translate to the relation 

Kii V {q) = -i2ce^ v q a + 0(q 2 ). (9.84) 

INFO To fully describe the electromagnetism of the statistical gauge field, we should 
also include the second-order derivative contribution .S’^ 2 ’ 2 - 1 in our analysis: gauge-invariant con- 
tributions of second order are obtained as bilinears formed from elements of the field tensor 
Fp = d^a! —d a/. These terms are then combined into an action S^ 2 ’ 2 ^ = | f d 3 x (ee'-e'+yfe 2 ), 
where e, = do a, — ditto is the electric field derived from the gauge field a' , and e and x are 
the electric and magnetic permeabilities, respectively. Physically, the action S^ 2 ’ 2 ^ describes the 
electric and magnetic susceptibility of the (mean-field) CF medium to the presence of gauge field 
fluctuations. However, as far as our present objectives are concerned, these effects turn out to 
be of lesser relevance. 

On the other hand, we know the conductivity of the system relates to the linear response 
kernel through the relation crj 2 = — i lim^o q). For the present, we have to 

interpret <7° 2 = 2 r / as the quantized Hall conductivity carried by the CF system at the 
mean-field level. Comparison with Eq. (9.84) then leads to the identification c = < 712 / 2 , or 

S^[a,A'] = ^ks CS [a + A'] + ^os'H- 

Of course, <Ji 2 does not coincide with the actual Hall conductance <712 carried by the system - 
within our present level of approximation, the latter is obtained by two-fold differentiation 
with respect to A! after the statistical gauge field has been integrated out. Now, there is 
a simple general formula telling us what happens to two quadratic actions (such as the 
Chern-Simons actions) upon integration over one half of the fields: 

ci£![a + 6] + c 2 <S'[a] (cf 1 + c 2 x ) 1 5[6]. (9.85) 

EXERCISE Of course, Eq. (9.85) can be proven by straightforward Gaussian integration over a. 
A more elegant procedure is based on the fact that, for any quadratic action, S[a, 6] - — >■ £[&, a[6]], 
where a [6] is the solution of the mean-field equation <5a5[a, 6]| a _. = 0. To make use of this 
identity, let us formally write 5[a] = a T Ka, where AT is a non-degenerate 107 operator kernel. 
Find the solution of the mean-field equations corresponding to the left-hand side of Eq. (9.85) 
and show that, upon substitution back into the action, we obtain the right-hand side. If you are 
critical, you may object that our CS actions are not, in fact, non-degenerate: when evaluated on 


Otherwise the field integration would create a headache! 
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a pure gauge configuration a ^ = d^f, they vanish. Nevertheless, you may convince yourself that 
Eq. (9.85) remains valid if the integration over a is performed in some fixed reference gauge. 

When applied to our action S^ 2,1 > above, the auxiliary identity Eq. (9.85) tells us that 
S'MfaA'l ^ SeffKJ s °fSc S [A'], 

where a 12 = ( l + |) _1 . The terminology “cri 2 ” is justified because, as seen above, the 
constant in front of a CS source action is but the Hall conductance of the system. Recalling 
that 6 = 1/27 ts and = p/ 27r, we find 

_ 1 P 
ai2 2 tt 1 + 2 sp ’ 

i.e. we confirm the expectation of fractional Hall quantization. Let us recall that this result 
is critically linked to the absence of a longitudinal conductance u\\. In our analysis 
above, alluding to the striking analogies that exist between CFs and ordinary electrons 
in a magnetic field, we simply postulated on = 0 at CF filling p. (Technically, this hap- 
pened when we said that the CF “tr In” does not support a longitudinal current-voltage 
relation.) However, one may note that the actual problem of Anderson localization of 
CFs has not yet been attacked on a truly microscopic level. What makes the problem so 
difficult is the massive inter-CF correlations induced by fluctuations of the statistical gauge 
field. In the absence of disorder, the full extent of this correlation mechanism does not yet 
become clear (at least not on the level of our simplistic “mean-field plus quadratic fluc- 
tuations” analysis). However, once external inhomogeneities are present, things instantly 
become more complicated. For example, a static impurity potential will be screened by an 
inhomogeneous CF distribution. This creates an accumulation of statistical flux that in turn 
acts as a scattering center of CFs, etc. Thus, we readily wind up with a full-blown problem 
“interaction + disorder + strong magnetic field” whose rigorous microscopic solution seems 
to be elusive. Nonetheless, all evidence suggests that eventually the CFs will be localized, 
so that the analogy (FQHE of fermions) (IQHE of composite fermions) remains valid. 

I N FO As mentioned above, the field theory approach to the FQHE was preceded by a number of 
other developments. Perhaps most importantly, shortly after the experimental discovery of the 
effect, Laughlin proposed a trial wavefunction - nowadays generally referred to as Laughlin s 
wavefunction 108 - which did a fantastic job at explaining much of the phenomenology of the 
FQHE. 

With the benefit of hindsight (!) it is not difficult to motivate the structure of the Laugh- 
lin wavefunctions from a few simple considerations. Consider a clean FQHE system at a filling 
fraction l/(2s+ 1) belonging to the principal series. 109 Any ground state many-body eigen- 
function should (a) be built by superposition of single-particle states belonging to the lowest 


R. B. Laughlin, Anomalous quantum Hall effect: an incompressible quantum fluid with fractionally charged 
excitations, Phys. Rev. Lett. 50 (1983), 1395-8. 

For an extension to more complex fractions, see F. D. M. Haldane, Fractional quantization of the Hall effect: 
a hierarchy of incompressible quantum fluid states, Phys. Rev. Lett. 51 (1983), 605-8, and B. I. Halperin, 
Statistics of quasiparticles and the hierarchy of fractional quantized Hall states, Phys. Rev. Lett. 52 (1984), 
1583-6. 
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Landau level, (b) optimally account for the effects of Coulomb repulsion (recall that the strongest 
player in the Hamiltonian, the kinetic energy, is completely degenerate), and (c) obey Fermi 
statistics. 

As to (a), we have seen in Section 9.3.4 
that any function of the form ip(z) = 

/(a)exp(— |«| 2 /(4 Zq)) automatically belongs 
to the lowest Landau level, provided that 
f(z) is analytic. 110 So let us seek a many- 
body wavefunction of the form 

~ F({zi — Zj})e~^o ^ '* i|2 , 

where F must depend only on differences of 
coordinates (the translational invariance of 
the clean system). The antisymmetry of the 
wavefunction (c) requires that F is skew- 
symmetric under any exchange z t Zj . 

Also, (b) F should vanish whenever two 
coordinates approach each other. Taken 
together, (a)-(c) motivate the ansatz 

v, z N ) = M (. Zi-zj (9.86) 

i<j 

with some a priori undetermined integer coefficient m. Equation (9.86) defines Laughlin’s wave- 
function. Notice its high degree of universality (except for the integer m, the trial wavefunction 
does not contain a single adjustable parameter!) and the simplicity of its structure. Nonetheless, 
the ansatz Eq. (9.86) offers a straightforward explanation of many observable features of the 
FQHE. Specifically, it can be shown that: 

The integer m relates to the filling factor as m = v _1 = 2s + 1. 

For certain types of short-range interaction, i/j is an exact ground state of the Hamiltonian. 
Numerical analyses have shown that, even for the long-range Coulomb interaction, ip has a 
close to perfect overlap with the exact ground state. 

Single-particle excitations superimposed on ip are gapped and fractionally charged. 

In fact, one may readily rewrite ip in a way that suggests an interpretation in terms of composite 
fermions; simply factor out a power ( Zi — Zj ) m_1 to obtain 

i<3 

Here, ip m = i is the wavefunction at an integer filling factor while the prefactor adds 2s winding 
phases to each particle coordinate. In other words, the prefactor converts fermions to composite 
fermions, so that ip can be interpreted as a CF wavefunction at integer filling. Indeed, this 
anticipates the field theoretical picture constructed above. 

} To be precise, in Section 9.3.4, we considered wavefunctions ^(z) = /(z)exp(— |; z| 2 /(4Zq)) with an anti- 
holomorphic prefactor. One can switch from one form to the other by inversion of the external field B —B. 
For notational simplicity, we shall use the analytic form throughout. 




Robert B. Laughlin 1950- 

Theoretical condensed matter physicist. 
Laughlin was awarded one third of the 1998 
Nobel Prsize for his groundbreaking contri- 
butions to the explanation of the FQHE. The 
complementary share went to the experimen- 
talists Horst L. Stormer (1949-) and Dan 
C. Tsui (1939-) for the experimental discov- 
ery of the effect. 
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9.6 Summary and outlook 

This concludes our introduction to topological quantum field theory. 111 We have seen the 
way “large-scale” geometric structures of quantum fields may lead to intriguing physical 
phenomena - phenomena, in fact, whose understanding required the application of the entire 
spectrum of field integral techniques developed earlier in this text. Once again, this chapter 
could provide no more than a very preliminary impression of the diversity of topology-related 
quantum phenomena in condensed matter physics. Given the rapid development of the field, 
we encourage readers motivated to deepen their knowledge in topological condensed matter 
field theory to turn to the original literature. (For an excellent particle-physics-oriented 
introduction we refer to Ryder. 13 A comprehensive discussion of topological textures in 
classical physics can be found in Efetov. 112 ) 


9.7 Problems 

9.7.1 Persistent current of a disordered ring 

In the main text we have argued that, contrary to a long-standing belief, the presence of disorder 
does not conflict with the formation of persistent currents in normal metal rings. Here we support this 
assertion by a microscopic calculation. 

Consider a quasi-one-dimensional ring of circumference L » £ and transverse extension 
Af C fi < £, where £ is the mean free path. We assume that the ring is pierced by a 
magnetic flux . Using the fact that L±/L <C 1, variations in transverse direction of the 
vector potential A may be neglected, i.e. A = (3>/Z/)e||, where ey is the unit vector in 
the longitudinal direction and units e = K = 1 are used. We are interested in the typical 
value / t yp = y/ (I 2 ) of the persistent current I = —d&F, where F is the flux-dependent free 
energy. 113 Assuming that the ring is metallic in the sense that its dimensionless conductance 
g > 1, we will be content with computing the first contribution to / typ in an expansion in 
powers of g~ x . For notational simplicity, we assume the flux $ to be measured in units of 
$o throughout. 

(a) Represent (/(T-ij/f'I^)) as a correlation function of two non-interacting Green functions. 
Without going into quantitative detail, convince yourself that the dominant contribution 
to the correlation function is given by the sum of diagrams shown in Fig. 9.20, where 
the dots denote the operator (— id\\ — (T/L)) /to = — fly and is the derivative along 
the ring. Estimate the relative contributions of the four diagrams. (Hint: Recall the 
derivation of low- momentum diffusion modes in Section 6.5. Owing to the thinness of 
the ring, fluctuations of the diffuson/cooperon modes in the radial direction may safely 
be neglected.) 


111 Strictly speaking, the terminology “topological quantum field theory” is reserved for field theories whose 
behavior is solely determined by topological terms. However, we use it here in a more liberal sense to denote 
field theories where topological aspects play a significant role. 

112 K. B. Efetov, Supersymmetry in Disorder and Chaos (Cambridge University Press, 1996). 

3 In the presence of static disorder, the sign of I sensitively depends on the impurity configuration, i.e. (I) /typ 
is much smaller than the typical value of the current for a fixed impurity configuration. 
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cl c2 dl d2 


Figure 9.20 Four diagrams contributing at leading order to the correlation function (7(4>i)7(<f>2)) 
of the persistent current in a disordered ring. 


(b) To quantitatively compute the correlation function, we employ the formalism of the 
field integral. Using the fact that F = —T lim/^o tj {Z r — 1), construct a generaliza- 
tion of the nonlinear a-model action S'[Q] suitable to compute the current correlation 
function. (Hint: Introduce a two-component index t = 1,2 discriminating between the 
two currents, and derive the form of the vector potential by minimal substitution, cf. 
Eq. (6.64), i.e. by recalling that the flux dependence of the Hamiltonian may be formally 
removed by a gauge transformation.) 

(c) As in Problem 8.8.4, introduce the parameterization Q = e lW Ae~ zW = A(1 — 2iW — 

2W 2 -\ ), where the generators W are given in Eq. (8.53) (block structure in Matsubara 

space). Notice that B = {B nata ’ n a f a } are matrices in Matsubara (n), replica (a), 
current ( t ), and time-reversal (cr) spaces. Expand the action to second order in B. It is 
convenient to split B = Bj + B c into contributions diagonal and off-diagonal in time 
reversal space. Explore the symmetries of the constituent matrices £fy c and perform 
the Gaussian integrals. Show that 


(7($ 1 )J($ 2 )) ~ dl^T 2 + ($i - $2 -t $1 + $ 2 ), (9.87) 

q,ui n >0,u> n i<0 


where T qnn ' = [ Dq 2 + \cj n — u n > |] 1 is the diffusion mode. 

(d) The final step in the evaluation of the correlation function, the summation over 
q,u> n ,ui n i, is a good exercise in executing tricky integrals (here defined as integrals 
which have to be simplified by physically motivated approximations). 

EXERCISE Differentiate with respect to the flux before doing the frequency/momentum 
summations; identify four contributions corresponding to the diagrams above in Fig. 9.20. 

What prevents the expression above from vanishing is the discreteness of the momentum 
sum; were we able to approximate the sum by an integral, a shift q — > q + <f>/L would 
remove the flux dependence of the integrand. The validity of an integral approximation 
in turn depends on the value of |u;|. For |w| > (2tt) 2 D/L 2 = E c much larger than 
the magnitude of the lowest non-vanishing quantized q-mode (physically, the Thouless 
energy, i.e. the inverse time of diffusion around the ring), many modes contribute to the 
integral and a continuum approximation is valid. Thus, the dominant contribution to 
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the correlation function comes from frequencies |w| < E c , for which the discreteness of 
the sum really matters. 114 

Carry out the sum over momenta by the methods otherwise employed in performing 
frequency summations. Approximate the subsequent sum over Matsubara frequencies 
|n;| < E c by an integral. Show that, for small values of the flux, / typ ~ _E c <f>(— ln($)) 1,/2 . 
(Hint: Do not try to do the frequency integral rigorously; keeping the relevant infor- 
mation on board, simplify the integrand to obtain a manageable integral. As to the 
^-summation, use the freedom to do the flux derivatives before or after the summation 
to obtain a contour integral that is as simple as possible.) 

The most important feature of this result is that it depends only algebraically on the 
disorder concentration, i.e. through Ec ~ D ~ r. In contrast, the long-standing yet 
erroneous expectation that impurity scattering destroys the phase coherence required to 
maintain a stable persistent current inevitably leads to the prediction J typ ~ exp (—L/€). 
For experimentally “realistic” values L/£ = 0( 10 2 ), the difference between the two 
results is dramatic. Irritatingly, the actually observed value of the current turns out to 
be roughly two orders of magnitude larger than our result above. Although the origin of 
this discrepancy is unknown, it is clear that it cannot be resolved within the framework 
of a non-interacting theory. 

Answer: 


(a) Using the fact that the free energy of a non-interacting Fermion system is given by 
F = — T Y^, n tr \n(iu) n — H), where H = ^(p — A ) 2 + V and V represents the single- 
particle potential of the problem (the sum of disorder and confining potentials), we 
obtain /(<!>) = —T^2 n tv(G n d<s>H) = ^ Y^, n tr(G n t)||). Expanding the Green function in 
the impurity operator and constructing ladder diagrams at one-loop order, we identify 
the two cooperon (cl and c2) and two diffuson (dl and d2) diagrams in Fig. 9.20. 
Focusing on the cooperon sector, we note that the ladder diagram depends only on the 
sum q— ($i+$ 2 )/A of the fast momenta p— ($j/A)ei| carried by the Green functions. 115 
More specifically, an individual cooperon ladder contributes a factor (see Section 6.5) 
~ r q -^ 1+ ^)/L,nn' = 2 ^T [ D (<1 ~ ( $ i + $ 2 ) / L f + | A - u n '\] -1 . Up to constants, the 
contribution of diagram cl is thus given by T q -^ 1 -\-^ 2 )/L,nn' ■ Turning to diagram c2, 
we need to take into account the fact that the vector vertices ~ vii indicated by the dots 
are now integrated independently over the fast momenta (because they are separated 
by impurity lines; recall the momentum structure of a ladder diagram). In the limit of 
zero momentum difference, q — ('Iq + $ 2 )/L — > 0, the two Green function insertions 
carrying a dot vanish (think about it). This means that diagram c2 will be proportional 


This statement conforms with the expectation that the minimal time t ~ a ; -1 required to sense the flux 
through a disordered ring is the diffusion time, i.e. the time for a quantum particle to traverse the ring at least 
once. 

Due to the thinness of the ring, fluctuations of the cooperon in the transverse direction are negligible, i.e. the 
momentum q is a scalar quantity measuring fluctuations in the longitudinal direction. 
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to ~ (Q ~ ($1 + ^)/LfTl_ {9i+ ^ )/Lnnl , where the factor F 2 accounts for the two 
ladders. The diffuson diagrams dl, d2 are obtained by replacing dq + $2 — > $1 — $ 2 - 

(b) Defining Z = f D(ip, ip) exp(— S[ip, ip]) where 

S[ip, ip] = J d d r dr V> at ( d T + H( A t ) — p)ip at , 

and A t = (<f> t /L)e y, we have the representation (7(<f>i)7(<l>2)) = lim^^o 
All we need to do to deduce the structure of the low-energy action is to notice that 
the composite index (a, t) can be identified with a “replica index” of doubled dimension 
27?. Noting that the flux dependence of the Hamiltonian may be formally (i.e. at the 
price of changing boundary conditions) removed by the transformation %/; — > e''' I> ‘ r » , 
where r|| is the coordinate along the ring, and recalling the discussion above Eq. (6.64), 
we conclude that the action is given by S'fQ] = ^ / dry tr ^ (( d — i[A, ])Q) 2 — u>qJ , 
where the vector potential A = {5 at ’ a t ' (<& t /L)} 0 af. 

(c) Using the symmetry relation W = — a^Wa^ => B T = —cr^B^cr^, we verify that 

B d = E\\ b d - 7?22 &d , B c = E | fee + £21 K i 

where fed, c = {fe”" 4 ’” 0 * } are complex matrices. Substituting this representation into 
the action and expanding to quadratic order, it is a straightforward matter to obtain 

SW[B,B'] = A ]T tr((fe d )«;,r ? _ ($t _ $t/)W (fet){:,g 

q,n>0,n'<0 

+ (fed fee, + $i')- 


We finally integrate over the fields fed, c and arrive at 

z = (r g -($ 1 -$ 2 )/L,w)' R + ($1 - $2 -> $1 + $2), 

q,n>0,n’<0 


where the factor of 7? 2 counts the independent replica channels, constant factors 
c R2 1 have been omitted, and we also omitted those contributions to the integral 
that depend exclusively on $1 or on $2 (as they do not contribute to the two- fold 
derivative lim/j^o 7? _2 9|, i5)2 [check!]). Differentiating with respect to the flux and 
sending 7? — >• 0, we obtain Eq. (9.87). 

(d) Our strategy will be to do the momentum sum by contour integral methods. As it is not 
convenient to integrate over a function containing branch cuts (such as our logarithm), we 
do one of the flux derivatives, d $ 1 , say, first. Focussing on the diffuson contribution, and 
introducing the shorthand notation $ = $1 — $ 2 , M = \u>„ — ui ' n \ , we have 


D{q-*/L) 

* 2 ^ D(q-<f>/L) 2 + M 


2 Am 


dz coth(zL/2) 


D(-iz-<f>/L) 
D{-iz-A>/L) 2 + \uj\ 


— I m coth([(|u;|/E c ) 1 ' /2 + ?$]/ 2), 



592 


Topology 


where the integration contour is a circle at infinity avoiding the two poles of the inte- 
grand. 116 Notice that our result is 27r-periodic in the flux 4>, as it should be. 

Turning to the frequency summation, we use the fact that |w|/U c < 1 to approx- 
imate coth^lculZ-E 1 ,;) 1 / 2 + *$]/ 2) gj 2[(|w|/£ l c ) 1 / 2 + i(<3> mod 27r)] _1 . This leads to the 
expression 


d%^T 2 - -d* 2 (2tt) 2 Im 

q,n,n' 

= d<i, 2 (2nE c ) 2 <f> J dx ^ = {2-KE c fd^ 2 - $ 3 ln 


duo 


(M /£ c )V 2 + i$ 

$ 2 + l> 

$2 


For small values of the flux, the logarithm can be approximated as — ln(<I> 2 ) and 
&i > 2 $ 3 ln(<3> 2 ) ~ 3T 2 ln(4> 2 ). Using this approximation and adding the cooperon contri- 
bution, we obtain 


</($i)/($ 2 )> S -(27 r£ c ) 2 (3($i - $ 2 ) 2 ln(($i - $ 2 ) 2 ) - 3($i + $ 2 ) 2 ln(($ x + $ 2 ) 2 )) . 


Setting = $ 2 we arrive at the required result. 


9.7.2 Working with the SU (N) Wess-Zumino term 

In this problem, we develop some of the amazing mathematical features of the SU(1V) WZ term 
Eq. (9.58). The problem also illustrates the superiority of the coordinate-free calculus of differential 
forms in topological quantum field theory. (Expressed in the standard languages of calculus, all formulae 
below become intolerably messy - not to mention the fact that the underlying structures are much more 
di cult to understand!) 

(a) WZ terms generally originate from a closed (but only locally exact) differential form on 
the target manifold w. Show that, on SU(iV), the form u> = tr(g~ 1 dgAg~ 1 dg/\g~ 1 dg) is 
closed. (Why is it only locally exact? This question will be addressed in part (c) below.) 

(b) Verify Eq. (9.61). Let us try to understand the normalization of the WZW action. 
To start with, let us recall a statement from group theory stated in the Info block on 
page 549: as far as topology is concerned, the subgroup SU(2) c SU (N) preempts the 
structure of SU(AT), i.e. without loss of generality, we may restrict our discussion to the 
SU(2) WZ action. 

We next turn to the normalization of the WZW action. To start with, let us consider 
S 2 as the boundary of the northern hemisphere S 3+ of the 3-sphere. We define the 
WZW term by integration over S ' 3+ . 117 To parameterize S' i+ . we introduce a third 
coordinate £3 (in addition to the two coordinates (aq, a: 2 ) parameterizing S 2 ), such that 
£3 = 0 defines the north pole and £3 = tt/2, the equatorial plane, S 2 . For any field 
g(x i,£ 2 ) € SU(2), the WZW action then becomes r + [g] = 1AA JJ 1 ^ 2 dxs r[g, £ 3 ], where 


116 In spite of the weak convergence ~ jz 1 of the integrand at large values of z, it is permissible to do the integral 
in this manner: the second flux derivative implicitly levels the decay up to ~ . 

Topologically, this domain is equivalent to the three-dimensional unit ball B 3 used in the text (just as the 
northern hemisphere of the 2-sphere is topologically equivalent to the unit disk B 2 ). 
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r[g, £3] = / dx 1 dx 2 e /i ‘" T tr(g~ 1 d^gg~ 1 d u gg~ 1 d cr g), 5, v,a = 1,2,3, and g(x 1,2:2, 2:3) 
is a smooth extension of g(x 1,22) from S' 2 to 5 3+ . But, of course, it would have been 
just as good to take the southern hemisphere S 3 ~ as our reference domain. To explore 
the consequences of this ambiguity, we extend the definition of the third coordinate in 
such a way that 2:3 = tt corresponds to the south pole of S 3 . In this case, the WZW 
action would assume the form T _ [g(] = — iAf /^y 2 dx % , T [<7,2:3], where r[gr, 2:3] is defined 
as above (albeit for an extension 5(2:1, x 2 , X3 < 0). 

For any reference field g(x 1,2:2), the ambiguity, 


r + — F = iAf J dx 3 J dxi dx 2 e^ v<7 tr (5 x d il gg l d v gg 1 d a g) = 2nn, 

must be an integer multiple of 27t. 118 Consider, thus, the field g(x\ . x 2 ) = m(xi,x 2 ) ■ cr, 
where 11(2:1, x 2 ) is a unit vector. As its extension into the northern/southern hemisphere, 
we choose 5(2:1, x 2 , X3) = 003(2:3) + isin(x3)g(xi,x 2 ) = exp(tx3n(xi,2;2) • cr) g SU(2). 
This is an SU(2)-instanton, i.e. a mapping S 3 — > SU(2) that cannot be continuously 
deformed to the unit mapping. 

(c) Show that r+[5] — r _ [5] = iAf 24 n 2 x (integer), which enforces Af = 1/127T as the 
normalization of the WZW action. As a corollary, we note that the form ui on which 
all of our discussion is based cannot be globally exact. (If it had been, Stokes’ theorem 
would imply a vanishing of the integral over the boundary- less manifold S 3 .) 

(d) Show that, for the particular reference configuration considered in part (c), g = in - cr g 
SU(2), the WZW action reduces to the (9-term for the unit-modular field n g S 2 : 
r[in • cr] = -.S' top iii|; where Stop^] = ^ J g2 d 2 x n • (<9 x n x d 2 n). 


Answer: 


(a) Using the fact that g~ x dgg~ x = —dg~ x (why?) the form to can be rewritten as u> = 
tr(dg~ x AdgAdg~ x g), i.e. dur = —tr(dg~ 1 AdgAdg~ 1 Adg). To show that this expression 
vanishes, one may make use of the fact that, for arbitrary matrix-valued forms, tr(cu x A 
tn 2 ) = (-jdeglu-OdegC^)^^ A W1 ).H9 Applied to duo, this yields du = +tr (dg A dg- 1 A 
dg A dg _1 ) = tr(d5 _1 A dg A dg _1 A dg) = —dui, where in the last step we applied the 
formula g~ x dg = — dg -1 g to all derivatives. 


118 Geometrically, this is the integral of the differential 3-form g*tu (i.e. the pullback of the SU(2) form u> by the 
field g to a form on S 3 ) over S' 3 . 

119 Proof: tr(w A 7 ?) = Ey A m = (_)deg(u>)deg( ) JJr A u .. _ (_)deg(u,)deg( A 
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(b) We want to explore what happens to the integral rjjjjjj = — f B3 g*u> upon variation 
g — »• e w g ~ (1 + W)g. Using the relation g~ 1 dg — > g~ l dg + g~ 1 dW g , we obtain 


r[(i + w) 5 ]-i%] ~ 




tr(5 1 dWgAg 'dgAg 1 dg) 

33 

tr (dW Adg A dg~ l ) = j d [tr(W A dg A dg 1 )j 
tr(W Adg A dg _1 ), 


where, in the last line, we have made use of Stokes’ theorem, 

(c) By an elementary rearrangement of fields, 


T + — T = 3iM J dx 3 J dx 1 dx 2 e* J tr (gd^g 1 (kg djg *), 

where the coordinates i,j = 1,2. A straightforward calculation shows that gd^g -1 = 
—in • cr. Using the auxiliary identities dig = i sin(x:i)d t n ■ a and tr(ni • cr ri 2 • cr n 3 • cr) = 
2ini • (n 2 x ns), we thus obtain, 


T + — T = 6 iM J dx 3 sin 2 (23) J dx 1 dx 2 e y n • (din x djn) 

= fUnN J dxidx 2 n- (din x ^n) = 24i7r 2 x (integer), 

where the integer is the number of times the unit vector n wraps around the unit sphere, 

(d) By analogy with the calculation performed in (c), we obtain F + [in • cr] = F + = 
6i7rW / dx 1 dx 2 n • (<9in x <9 2 n) = 7r5t op [n]. 


9.7.3 Renormalization group analysis of the SU(N) Wess-Zumino model 

Before embarking on this problem, it is helpful to recapitulate the RG analysis of the SU(1V) nonlinear 
cr-model discussed in Section 8.5. We want to study the RG flow of the SU(iV) WZW model, as specified 
by Eq. (9.65). As in our previous analyses of nonlinear cr-models, we can split a general field configuration 
g = g s g f into a slow and a fast part, and expand the latter as g { = 1 + W+ W 2 /2 + • • • , where 
W G su (N), i.e. W lives in the Lie algebra su(lV) (the algebra of anti-Hermitian traceless matrices). 
To compute the one-loop RG equations, we need to expand the action to quadratic order in W and 
compute all contributions to the functional integral that contain (a) one fast momentum integration 
and (b) no more than two derivatives acting on a slow field. 

(a) Show that the expansion of the action S^ggif] to second order in the generators takes 
the form S[gfg s ] = + S[g s , W] + where 

SW[W] = ~ [ d 2 xtx(d li Wd ll W), 

A Js2 (9.88) 

%s, W] = -j J ^ d 2 x tr [d^W, W\ j . 
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(b) One-loop corrections to the action are obtained by expanding the functional to second order 
in S^\g B , W] and integrating over W: S'ft/] — >• S[g s ] — W] 2 )w- Use the results of 

Section 8.5 (in particular those derived in the exercises on page 463) to confirm that (i) only 
the gradient term of the action is renormalized and (ii) the RG equation for its coupling 
constant is given by Eq. (9.66). 


Answer: 


(a) The first line of Eq. (9.88) and the first term in the second line are obtained by sub- 
stitution of g s gf into the gradient term of the action (9.57) and expanding to second 
order in the generators W. The second term in S[g s , W], a descendant of the WZ action, 
is best derived in the language of differential forms: substitution of (g s gf)~ 1 d(g s g{) = 
97 (<7 6 T 1 dg s + {dgf)g7 )<?f into the pullback ( g a gt)*u = to(g~ 1 dg A g _1 dg A g~ 1 dg)\ g = gsgf 
gives 

(g s gf)*w = ( w ) + ^tr(d(/“ 1 A dg s A [W, dW}) + 3tr(dW A dW A g~ 1 dg s ) + 0(W 3 ) 

= g* s (u) + | d [tr ([W, dW } A g^dg,)] + 0(W 3 ). 

Application of Stokes’ theorem thus leads to 

Tbsfff] = J B3 (g s g { )*u} = T\g s \- J B ^d[tr([W,dW]Ag- 1 dg s ^fO{W 3 ) 

= r bs] - £ J tr([W, dW] A g-'dg,) + 0(W 3 ) 

= r bs] - [ d 2 xe^tr([W, d^W] A g~ x d„g s ) + 0(W 3 ), 

8tt Js 2 

where the 0(W 2 ) term appears as the second contribution to the action S\0^, W]. 

(b) Defining = g~ 1 {d tl -(- | ^e^, v d v )g s , the action S[g s ,lE] assumes the form Sjgjj, W] = 
§lg- Y7, pq Pp^ T {^^i7)- But for the difference in the definition of the field <3>, this is equal 
to the fast-slow action of the standard SU(iV) model. Using the results derived in the 
exercises on page 463, we thus obtain 

-\(Sb.,W]*)w = -^jd\ tr(4yf„) 

= ( x - (b ) ) I 

This result confirms that only the gradient term in the action is renormalized. By 
proceeding in direct analogy to the discussion of Section 8.5, it is a straightforward 
matter to derive the corresponding RG equation. The result is given by Eq. (9.66) 
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Figure 9.21 On the formation of surface wave excitations in an FQHE droplet. For an explanation, 
see the main text. 

9.7.4 Fractional quantum Hall effect: physics at the edge 

In Section 9.3.7 above, we have seen that the bulk physics of an integer quantum Hall system is intimately 
connected to that of its boundaries. These connections could be disclosed from both direct physical reasoning 
(the essence of the Laughlin-Halperin semi-phenomenological approach) and an analysis of the behavior of 
the microscopic theory under gauge transformations. Following the seminal work of Wen, 120 here we show 
that an equally close connection between bulk and boundaries exists for the FQHE. Again, we will be able to 
deduce the boundary physics by phenomenological, and by field-theory-oriented, reasoning. 

Consider a finite FQHE system. For simplicity, we assume the system to be disc-shaped, 
although the details of the geometry will not be of relevance throughout. At the system 
boundaries, the two-dimensional electron gas is confined by a boundary potential V which we 
assume to be linear (see Fig. 9.21). In the first part of this problem, we want to describe the 
dynamics of edge excitations on phenomenological grounds. An important first observation is 
that the bulk system - presumed to sit at an FQHE plateau value - is in an “incompressible 
state.” This follows from the fact that, for an integer number of composite fermion Landau 
levels, the system does not support gapless excitations: dp/djsr — > 00 . Now, 121 dp/ 8 m ~ 
k _1 = —V(dP/dV)M is proportional to the inverse compressibility, i.e. the lack of low- 
energy excitations implies an incompressible state. We should, therefore, think of our system 
as a rigid “liquid” rather than as a gas. 

Given that the state is incompressible, the lowest-energy excitations of the system will 
be deformations of the boundary (similar to boundary distortions of a puddle of water). We 
may characterize these distortions by a surface density profile p(x) , where x parameterizes 
the boundary. 122 

(a) To derive the boundary action on phenomenological grounds, proceed in two steps. First 
derive the energy of a boundary distortion p. Second, argue why a distortion profile 
p(x,t) propagates along the boundary at some constant velocity v. Derive an effective 
equation of motion for p and use it to determine the canonical momentum associated 


For a review, see X. G. Wen, Theory of the edge states in fractional quantum Hall effects, Int. J. Mod. Phys. B6 
(1992), 1711-62. 

G. Mahan, Many Particle Physics (Plenum Press, 1981). 

More precisely, the geometric shape of the system is described by some height function h(x ) (see Fig. 9.21). 
Using the fact that the constant charge density of the system (cf. Eq. (9.17)) is given by dx dx j_, where 
x± is the coordinate perpendicular to the system boundary, p(x)dx = Jq^ dx± dx = %£h(x)dx, i.e. the 
height profile is proportional to a density profile. 



9.7 Problems 


597 


with the “displacement field” <j> defined by d x <p = 2np. Show that the Hamiltonian 
action of the displacement field is given by 

S[4>) = J d x d T ( v (^x<^) 2 — id x (j)d T <f) , (9.89) 

the action of a “chiral Luttinger liquid” - the terminology follows from the fact that 
Eq. (9.89) describes the uni-directional propagation of a density field. (The connection 
to the Luttinger liquid is made explicit by decomposing the coordinate-momentum pair 
describing one-dimensional charge density modes, {d x <f>,9), into left- and right-moving 
components, <j) = <j ) l + (j> r, 9 = <j ) l — 0r. Substituting this representation into the 
action (4.49), the latter decomposes into a left- and a right-moving chiral action.) (Hint: 
A boundary distortion costs energy because of the presence of a voltage gradient. The 
presence of a constant drift follows, e.g. from the finiteness of the Hall conductivity. 
Also remember Eq. (9.17) for the density of a quantum Hall system.) 

(b) We next derive the action from a complementary, bulk-oriented perspective. To this 
end, consider the CS action of the bulk system, Eq. (9.75). Throughout, it will be 
convenient (although not strictly necessary as one may translate all expressions back to 
the traditional representation) to use the language of differential forms. Interpreting a M 
as the coefficients of a differential 1-form, a = a^dx^, Eq. (9.75) assumes the compact 
form 

Scs[o] = — a A da, 

JDxS 1 

where the integral extends over the Cartesian product of the bulk of the system, D, 
and imaginary time S 1 . Show that Sqs is gauge invariant up to a boundary term. To 
“cure” the gauge deficiency of the action, we adopt a gauge- fixing condition ao\dD = 
0. Use this condition to integrate over the component ao in the entire system. Show 
that this integration leads to the global constraint fij = 0, where fij are the real- 
space components of the field strength tensor / = da = d^a^dx^ A dx v . This condition 
implies that the real-space component of the vector potential a can be represented as 
a pure gauge, a = d <j> = djCpd/x, , where we use the symbol d to denote the real space 
contribution to the exterior derivative. Plug this ansatz into the residual contribution 
to the action (after ao has been integrated out) to reduce the field strength tensor to 
the boundary action 5[0] = J gn dx f dr d x (j)d T <t>. Recalling that the CS action enters 
the theory as exp( j^Scs)/ 23 we conclude that the effective boundary action induced 
by the gauge non-invariance of Sqs reads 

= — [ dx f dr d x <j)d T (j). 

4tw J dn J 


To be precise, the coupling constant 1/4-7TI/ appeared after (a) the system had been coupled to an external 
vector potential, (b) the coupling between the statistical vector potential and the matter degrees of freedom 
had been taken into account, and (c) the statistical vector potential had been integrated out (see Section 
9.5.3). Here we assume that all these steps are implied, i.e. we should think of a as an external electromagnetic 
field and 4^7 Scs as the dominant (first-order derivative) contribution to its action. 
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(c) One may recognize S' e ff[</>] as the second contribution to the action (9.89). The obvious 
next question to ask is why we did not obtain the full action of the chiral Luttinger liquid. 
Indeed, it takes only a moment’s thought to realize that a first principles derivation 
of the full boundary action from the bulk theory is out of the question. The point 
is that the missing contribution ~ v{d x cp) 2 depends on the boundary velocity, which 
in turn is determined by the steepness of the boundary potential. However, the bulk 
action does not know of the structure of the boundary. We can, however, employ a 
trick to infuse the required information on the system boundaries into the bulk theory. 
(Although this manipulation is fairly ad hoc, it shows, at least, that the bulk and the 
boundary theory are not inconsistent.) Indeed, the gauge- fixing condition employed in 
the construction above involves a lot of arbitrariness. For example, instead of ao = 0 
the linear combination ao + va x = 0 would have been just as good. 

To explore the consequences of the new gauge-fixing condition, let us introduce a new 
set of coordinates, x = x— vt, t = t, and S:± = x± . Recalling the transformation behavior 
of differential forms (see page 537) under coordinate changes, compute the components 
of the vector potential a = a^dx^. Show that the gauge fixing condition translates to 
do = 0. We now benefit from the fact that we have expressed the CS term in a coordinate- 
invariant manner, i.e. it keeps the same form, no matter whether we express it in terms 
of the old or the new coordinates (a point on which to reflect!). However, in the new 
coordinates, the gauge-fixing condition assumes the same form ao as that considered in 
(b). This means that .S'cs can be reduced to a boundary contribution, which, however, 
is expressed in new coordinates. Finally, translate back to the old coordinates to obtain 
the full chiral Luttinger action. The message to be taken home from this construction is 
that a boundary-gauge-fixing condition can be used to establish the equivalence between 
the boundary Luttinger and the bulk CS description of the system. 

(d) Notwithstanding its innocuous appearance, the action (9.89) describes astonishingly 
rich physics. Referring for an in-depth discussion to the literature (see Wen’s review 
article 120 ), we here mention only a few of the characteristic features of the system 
described by Eq. (9.89). To establish contact with physically observable quantities, we 
first need to derive an expression for the electron operator c* (the chiral analog of the 
fermion relation (4.46)). Arguing as in Section 4.3 above, we start from the observation 
that the creation of an electron goes along with the creation of a unit charge. Building 
on the connection between the displacement field and the charge operator, show that 
the chiral bosonic representation of the fermion operator will contain a piece 



To complete the manufacturing of a fermion operator, we need to ensure that c* obeys 
fermionic exchange statistics. Use the commutator relations of the field </> to show that 

e iu~ 1 <l>(x) e iu- 1 4>(x') _ e iv~ 1 <l>(x') e iv-' l -<t>(x) x e i7ri/ _1 sgn(x-x') . 

This means that exp describes a fermion provided that v = 1/rn belongs to the 
principal sequence. 
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INFO But what will happen for a general v = n/m ? It is, in fact, not clear how to repair the 
ansatz above so as to generate fermion statistics for general filling fractions. Indeed, 
it turns out that the entire edge construction above is too narrow to describe the general 
case. We have seen in our discussion of the IQHE above that, in cases where p > 1 Landau 
levels are occupied, the edge hosts p edge channels. Similarly, an FQHE system with filling 
fraction v = p/ (2.sp+ l) (i.e. a system in which the composite fermions experience an effective 
v = 1/p IQHE) will have p chiral edge channels circulating at its boundaries. However, the 
construction of this edge channel hierarchy is beyond the scope of the present text and we 
refer to the literature. 

Notice that we may interpret e lv ^ = [e l< ^) v as the ^ -1 th power of the more elemen- 
tary object e l ^ . According to the construction above, e l<!> creates an entity of fractional 
charge e x v. Also, the states created by e 1 ^ obey fractional exchange statistics. Com- 
paring with our discussion in the main text, we identify exp (*</>) as the creator of the 
boundary variant of the fractionally charged Laughlin quasi-particles. 


Answer: 


(a) Owing to the presence of an (approximately linear) confining potential V = Ex _l, 
a boundary distortion costs an energy (notation taken from footnote 122 ) H = 
fdxj™ dx±aEx± = aMMfdx h 2 (x) = ^ f dx ( d x cf>) 2 . 

To understand why the density profile propagates along the boundary, notice that the 
(confining) electric field E perpendicular to the boundary will generate a Hall current 
density j ~ (J^E = <jB~ 1 E tangential to the boundary. The total boundary current is 
obtained by integrating the current density from 0 to h(x), i.e. I ~ pE/B = d x </>E /2 t:B. 
Integrating the continuity equation dtp = d x j over x and substituting the identification 
of the current above, we obtain the equation of motion dt<j> = vd x (j) , where the velocity 
v ~ E/B depends in a non- universal way on the boundary potential. This equation is 
solved by (j) = (p(x + vt), i.e. a uniformly propagating density distribution. Switching to 
momentum space, we may interpret <i>k = SH / St: k = vik(f>k as an Hamiltonian equation 
of motion determining the momentum 7r conjugate to (j). Comparison 124 with H = 
Ylk k 2 (t>k ( t ) -k leads to the identification i-e. the strange looking 

prediction that the variable (f>k is canonically conjugate to its own derivative, 

\d x <j){x), (j>{x')\ = 2mv8(x — x'). (9.90) 

Remembering that the Hamiltonian action of a system is given by f dr ( H — ind T (p), 
we arrive at the required result. 


Deviating from our standard conventions, 


define the Fourier transform by c/>k = L 1 / 2 f dx e zkx <p(x). 
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(b) Under a change of gauge, a — > a + dg, where g is a function, the CS action transforms 
as 

<S'cs[a + ^5] = ~ (a + dg) A d(a + dg) = Scs[o] — / dgAda 

JDxS 1 JDxS 1 


where we noticed that dg A da = d(gda) and applied Stokes, theorem. To remove the 
boundary gauge ambiguity, we need to fix a gauge at dD x S 1 , e.g. by setting ao\dD = 0. 
Decomposing a = ao + a and d = do dr + d l dx- t = do + d into temporal and real-space 
contributions, the action assumes the form 


- a) A (do + d)(ao + a) 


(ao A da + a A doa + a A dao + a A da) 


(2ao A da + a A doa + a A da) , 


where we have used the skew-symmetry of the exterior product and in the crucial second 
equality integrated by parts. 125 The boundary term corresponding to this integration 
vanishes due to the gauge-fixing condition ao\dD = 0. Linearly coupled to the action, 
the temporal component ao can be integrated out. As a result, we obtain a functional 
d-distribution globally enforcing the constraint da = f = 0, i.e. a vanishing of the spatial 
components of the field strength tensor. This in turn implies that a is a pure gauge, 
a = d (f>, where <p is some function. Substituting this representation into the action, we 
obtain S[4>] = - f DxS i d <j> A d o d0 = - f DxS i d(d(p A d 0 (p) = - J dDxS i(d(f> A d 0 (p) = 
J gD dx f dr d x (p d T tp, where in the last step we switched back to conventional notation. 
Using the fact that a = a^dx^ = a^dx^l dx v )dx v = d v dx v . and comparing coefficients, 
we obtain the identifications a x = a x , a X ] = a x L , do = a o + va x . “Form invariance” of 
the CS-action means that it does not matter whether we express it in terms of the old 
or the new coordinates: 

'S'csM = — J {a^dx^) A ^ dx u A ( a v dx v ) = — J (a^dx^) A ^dx v A (a v dx v ). 

Focusing on the second representation (wherein do|d£> = 0), and repeating the analysis 
of (b), we obtain the real-time representation of the action Scs = fg D dx f dt dx4> d^(p. 
Finally, using the fact that d x = d x and dj = vd x + dt, we arrive at SesM = 
f gD dx f dt [v(d x <p) 2 + d x (f> dt4>\ ■ Switching back to imaginary time and attaching the 
coupling constant, we obtain Eq. (9.89). 


3 have applied Stokes 
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(d) Since p = ^d x (f>, the presence of a unit charge localized at xo, p(x) = p(x — xo), 
amounts to the presence of a “kink” of height 2n in cj), i.e. 4>{x) = 2n6(x — Xo )• Now, 
according to Eq. (9.90), <j> and d x (j>/ 2m/ form a canonically conjugate pair. This implies 
that Ud(x) = exp(—idd x <j>/2'Kv) acts like a unit translation operator on <f>: Ud( x)<p(pd) = 
4>{x') + dS(x — x'). The unit-charge kink is generated by the action of f™ dx' U 2 -k{x') = 
exp(— ifi/v) on the field <fr. Reformulating Eq. (9.90) as [<fi(x), 4>(x')\ = invsgii(x — x'), 
we find that 
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This chapter provides an introduction to nonequilibrium statistical (field) theory. In the following, we 
introduce a spectrum of concepts central to the description of many particle systems out of statistical 
equilibrium. We will see that key elements of the theory - Langevin theory and the formalism of the 
Fokker-Planck equation - can be developed from the coherent framework of a path integral formalism. 
Applications discussed below include metastability, macroscopic quantum tunneling, driven diffusive 
lattices, and directed percolation. While the emphasis in this chapter is on classical nonequilibrium 
phenomena, the quantum theory of nonequilibrium systems will be discussed in the next chapter. 


The world is a place full of nonequilibrium, phenomena: an avalanche sliding down a 
sandpile, a traffic jam forming at rush hour, the dynamics of electrons inside a strongly 
voltage-biased electronic device, the turmoil of markets following an economic instability, 
the diffusion-limited reaction of chemical constituents and many others are examples of sit- 
uations where a large number of correlated constituents evolves under “out-of-equilibrium” 
conditions. Statistical nonequilibrium physics is concerned with the identification and under- 
standing of universal structures that characterise these phenomena. Notwithstanding the 
existence of powerful principles of unification, it is clear that a theory addressing the dazzling 
multitude of nonequilibrium phenomena must be multi-faceted. And indeed, nonequilibrium 
statistical physics is a strongly diversified field comprising many independent sub-disciplines. 
But this is not the only remarkable feature of this branch of physics. Considering the devel- 
opment of physics in, say, the last two decades, one may observe that nonequilibrium physics 
is a field of disproportionate growth. Occasionally one may get the impression that many 
particle theory as a whole is drifting towards the nonequilibrium! 

I N FO What does the attribute “nonequilibrium” actually define? Perhaps the most straight- 
forward definition identifies it as the opposite of thermodynamic equilibrium, where the latter is 
defined by the following two conditions : 1 

An equilibrium system is characterized by a unique set of extensive and intensive variables 
which do not change in time. 

After isolation of the system from its environment, all the variables remain unchanged. 

The latter condition is necessary to distinguish equilibrium from stationary nonequilibrium 
states. For example, the particle distribution function of an electronic conductor subject to a 


W. Ebeling and I. M. Sokolov, Statistical Thermodynamics and Stochastic Theory of Nonequilibrium Systems , 
(World Scientific, 2005). 
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strong time- independent voltage bias will be time-independent (first condition fulfilled), yet dif- 
ferent from the equilibrium Gibbs distribution - a stationary nonequilibrium situation. However, 
after removal of the voltage carrying leads, or “environment,” it will relax back to the Gibbs 
form (second condition violated). 

The restrictiveness of the two conditions above vividly illustrates that thermodynamic equi- 
librium is a rare exception, a Platonic ideal scarcely realised in “real life.” 

What are the driving forces behind this development? For one thing, nonequilibrium statis- 
tical physics is a field of unusually high application potential. Many of the rapidly evolving 
interdisciplinary activities between physics and biology, or physics and the socio-economical 
sciences, expressly address nonequilibrium problems. But there is, of course, also intrinsic 
interest in the physics of nonequilibrium. For example, the response of an electronic device 
to shake-up by a sudden electromagnetic pulse evidently contains more structured informa- 
tion on its intrinsic dynamics than the (near equilibrium) linear response. The wealth of 
information encoded in the dynamics of strongly-driven many particle systems motivates 
the extension of existing (equilibrium) theories into the realm of nonequilibrium. Also, var- 
ious many particle systems of interest are observed under nonequilibrium conditions (e.g. 
cold atom condensates), or intrinsically in a state of nonequilibrium (e.g. lasers). To under- 
stand the rich physical behavior of such systems, we need to develop a quantum theory of 
nonequilibrium phenomena. 

From what has been said above, it should be clear that we will not be able to adequately 
introduce the full body of nonequilibrium statistical physics in this textbook; we really have 
to restrict ourselves to an absolute minimum of essentials. In the following we focus on 
areas of nonequilibrium statistical physics that are of relevance specifically to condensed 
matter physics. Interesting developments relevant to, say, the physics of biological or socio- 
economic systems are not covered. In line with the general philosophy of the book, the 
material is organized according to the conceptual structure of the theory (rather than by 
fields of application). We start by introducing elements of probability theory and stochas- 
tic processes, Langevin theory, the theory of noise, and the theory of the Fokker Planck 
equation. It will then be seen that various connections between these elements can be under- 
stood from the unifying field theoretical framework of the Martin-Siggia-Rose-Janssen-de 
Dominicis (MSRJD) field integral. We apply the MSRJD formalism to introduce elements 
of the theory of dynamic phase transitions. In the next chapter we then identify the MSRJD 
field integral as the classical limit of a more general theory, the Keldysh field integral of 
quantum nonequilibrium statistical mechanics. 

INFO In this chapter, there is a lot of discussion about different types of “probability.” For 
the convenience of the reader, a few elements of probability theory are summarized below. 
(This material need not be studied in a coherent fashion, and can be used as a glossary whenever 
necessary.) 

Although we shall not enter a discussion of the foundations of probability theory, it may be 
worthwhile to note that the notion of “probability” is difficult to grasp. There exist different 
and, in fact, controversial definitions of what is meant by “the probability” of certain events. 
Here, we will sidestep these complications by assuming that the probabilities of certain basic 
events are a priori known (e.g. the probability of individual faces when throwing dice). The 
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task of probability theory then is to describe the probability of more complex, compound events 
(e.g. the probability of winning a certain game of dice). The probability P that a basic variable 
X take values Xi,i = 1,...N (or continuous values x € [a,b]), is described by the probability 
distribution, 

N 

discrete: pi = P(X = Xi), pi > 0, p t = 1, 

f b 

continuous: p(x)dx = P(x < X < x + dx), p(x) > 0, / p( x) dx = 1. 

A variable whose defining properties are encapsulated in a probability distribution is called a 
random variable. To avoid discrete/continuous case distinctions, we may express the distribu- 
tion of a discrete random variable in continuum form, p(x)dx = JT 5(x — Xi)dx. Furthermore, we 
shall formulate our discussion for scalar random variables. The extension to multi-dimensional 
variables X e M" should be obvious. 

Expectation values of functions /(X) depending on the random variable are defined as 

(/(X)) = J dxp(x)f(x). 

Below we list a number of important examples of expectation values: 

The mean value (X) = f dxp(x)x of the distribution and its moments X n = (X n ) = 
f dxp(x)x n . Notice that the moments of distributions need not necessarily exist. For example, 
the Lorentzian distribution p(x) = ^ 2 does not have moments at all. 

The cumulants 

Mi a Xi, 

P2 = X 2 - xl, 

M 3 = x 3 - 3X1X2 + 2 X® , 

M 4 •••• (10.1) 

contain information about the degree of “correlation” in a probability distribution. They 
describe the way in which high moments differ from products of moments of lesser degree. 
A systematic way to define cumulants is by expansion of the cumulant generating function: 
ln(exp(itX)) = “j - Mn- Exercise: Think about connections between cumulants and the 

connectedness of diagrams resulting from the expansion of free energies (symbolic notation) 
F = —Tln(f D<pexp(—S[(f>\)). 

The probability distribution may be formally represented as the expectation value of a 5- 
distribution, 

P{ x) = (S(x-X)). 

Far better than a useless tautology, this relation represents a vehicle to generate derived 
probability distributions: let F(X) be some function and define Y = F(X). This is a new 
random variable which inherits its probability distribution p(y) from p(x). The connection 
between the two can be obtained as 




f , , , 

dx 

p{y) = 

-.(5(y-Y)) = 

J dxp(x)S(y — F(: 

x))=p{x(y)) 
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where we assumed a unique functional relation y = F(x). This relation may also be obtained 
by direct transformation of the probability measure, p(x)dx = p(x(y))\dx/dy\dy = p(y)dy. In 
practice, however, the ^-representation is often more convenient to use. 

The full set of information on a random variable is contained in the expectation value 

g{t) = (exp(ifX)). (10.2) 

From this generating function, all moments can be obtained by power series expansion and 
all cumulants by expansion of ln<?(f): 

01 = £ 

n= 0 ' n= 0 

The full probability distribution can be obtained (exercise) from the generating function by 
Fourier transformation, 

P( x ) = J ^ e~ ltx g(t ). 

For concreteness and later reference let us discuss a few distributions of outstanding impor- 
tance. The Gaussian distribution is given by the familiar expression 

p(x)dx = 1 — exp ( ( x ^ ^ d x (10.3) 

V27r/i 2 V 2 M2 ) 

Typically, it describes the probability of variables x = J2n = i Xn that are obtained as the sum of 
a large number of elementary random variables, x n , distributed according to some distribution p 
with first and second cumulant p i and p 2 , respectively. In this case - the statement of the central 
limit theorem 2 - the probability of x is given by Eq. (10.3), with first and second cumulant 
pi = N pi and p 2 = Np 2 , respectively. (All higher cumulants of the Gaussian distribution 
vanish. Exercise: verify this statement by showing that the generating function of the Gaussian 
distribution is again a Gaussian.) The ubiquity of additive random variables in nature explains 
the importance of Gaussians in natural sciences. 

The Poisson distribution p(m) describes the probability to observe m events inn>l trials 
when the events are mutually uncorrelated 3 and occur with low individual probability p. (As 
an example, consider the probability to count m cars passing a counter at a cross country road 
late at night in a time window of lOOmin when the passage probability per minute, p = 0.05.) 


The proof of the central limit theorem amounts to an instructive miniature version of the large AT-expansion 
methods introduced in chapter 5.3: 


p(x) = (6(x — ^2 x n)) = dx n p(x n ) dke tk ^ 


— dkexp AT In dxp(x)e — ixk — dk exp (N In g(k) — ixk) 


= dk exp (AT ik^ - x/N) - k 2 p 2 /2 + 0(k 3 ) 


1 (x — N/ii) 2 

—== exp — 

y/2nNp 2 2A/> 2 


where in the first line we used the Fourier representation of the 5-function, g in the second line is the generating 
function of the distribution p, and in the crucial last equality we used the fact that anharmonic corrections to 
the quadratic fc-exponent vanish in the large N limit. 

Two random variables X and Y are said to be uncorrelated if ( XY ) = (X) (Y). The correlation obviously 
vanishes if X and Y are independent random variables, i.e. if p(x, y ) = p x (x)p y (y). (The opposite conclusion, 
“lack of correlation independence,” is not valid in general.) 
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Assuming that the probability to observe the event in question at any individual trial is p -C 1 
while m, the Poisson distribution asymptotes to 4 


/>(„,) fg ( np \ exp (—»./>). 


( 10 . 4 ) 


The Poisson distribution has constant cumulants pi = np. (Exercise: compute the generating 
function to prove this result.) 

The Lorentzian distribution a.k.a. Breit— Wigner distribution or Cauchy distribution 

is defined by 


p(x) = ' - 


a 

xo ) 2 + a 2 ' 


( 10 . 5 ) 


In physics, the Lorentzian distribution describes the energy dependence of scattering resonances, 
(relatedly) the broadening of many particle spectral functions by interactions, the line shape 
distribution of damped electromagnetic modes, and many other phenomena governed by an inter- 
play of driving/oscillation/damping. As mentioned above, its moments are undefined. Loosely 
speaking, the distribution is centered around xo, has width a, and is exceptionally broad. 

In probability theory one is often interested in computing the composite probability p(x 2,xi), 
from information about the probabilities pi(xi) and ^2(2:2) of elementary random variables X\ 
and X2. In this context, a natural question to ask is “what is the probability to obtain X2 provided 
xi has been observed?” By definition, the answer is given by the conditional probability, 
p{ %2\xi). It is implicitly defined by 


p(x2, xi) =p(x2\xi)pi(xi). 


(10.6) 


Summing over all possible realizations of the random variable X2, we get the raw distribution 
of X2, f dxip(x2,xi) =p(x2), or 

p(x 2) = J dxip(x2\xi)pi(xi). 

won’t Considering the case where xi by itself is a composite random variable, *2 — t x n and x\ —¥ 
(x n -i, ■ ■ ■ ,xi), we obtain an important generalization of Eq. ( 10 . 6 ). Iteration of the definition 
then leads to 


p{x n , = p(*„|a: n -i, . . . ,®i)p(a: n -i|2:„_2, . . . ,*1) . . .p(ai2|a:i)pi(a:i). ( 10 . 7 ) 


EXERCISE The characteristic function of a probability distribution P(x) is defined as 
g(t) = (e ,tx ) = J dxp(x)e ltx . 


Compute the characteristic function for (a) the Gaussian distribution, p(x) = ^\ lra e , 

(b) the Lorentzian distribution, p(x) = i > and (c) the Poisson distribution, p(x) = 


The probability to observe m successes is given by P(m) = ^ (1— p) n m p rn . For n 1 and p < 1, ^ ~ n m 

and (1 — p ) n ~ m ~ (1 — p) n = (1 — ( np)/n) n ~ exp (—np). Combining these estimates, we obtain the Poisson 
distribution. (Exercise: show that the derivation becomes exact in the limit n — »• oo,p 0 at fixed product np. 

Show that for n — »• oo at fixed p the Gaussian distribution is obtained.) 
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-^j-e . Consider the distribution, p(x) = * ^ = ’ f° r x ^ (Oj 1) an d zero otherwise. Show 

that its characteristic function has the limiting behavior 


9(t ) 


(1 -i)(i + e rt ) 
VZKt 


for t — > oo. 


10.1 Fundamental questions of (nonequilibrium) statistical mechanics 

From the discussion above, it should be obvious that thermodynamic equilibrium represents 
an exception, rather than a rule. The one reason why many particle theory has focused 
historically on the description of (near-)equilibrium states is that the latter are so much 
easier to describe. In thermal equilibrium we know that the state of an 7V-particle system is 
governed by the thermal density operator Z~ 1 exp(—f3H). “AH’ that remains to be done is 
to compute physical observables from this distribution. However, in thermal nonequilibrium 
the distribution of the system in phase or Hilbert space is apriori unknown, a lack of 
knowledge that bears important consequences: 

> Getting the relevant many particle distribution functions under control is an essential 
part of the theoretical challenge (and usually the first one to be addressed). 

t> Concepts which we tend to take for granted (the existence of a uniquely defined temper- 
ature, homogeneity of thermodynamic variables, etc.) need to be critically re-examined. 

> There is no reason to hope that nonequilibrium statistical physics as a whole (whatever 
that is) will be nearly as universal as the equilibrium theory. 

Imagine, then, a situation where no 
apriori information about the state 
of a many particle system is available. 

How can we compute the distribution 
of its states (classical or quantum)? 

Indeed, even the formulation of the 
problem that needs to be solved is 
not entirely obvious. 

In principle, the state of a classical 
d-dimensional A r -particle system can 
be represented by a point X e M. 2Nd 
in 2 Nd dimensional phase space. The full information about its dynamics, and all derived 
physical properties is then contained in some high-dimensional Hamiltonian equation of 
motion. 

The above view is formally exact, but largely useless in practice. However, in the late 
nineteenth century, Boltzmann introduced an alternative picture which turned out to be 
much more powerful: Boltzmann suggested describing the system in terms of a “swarm” of 
N points in 2 d-dimensional phase space, rather than by a single point in an incredibly high 
dimensional space. This idea paved the way to a statistical formulation of the problem and 
may be considered as the starting point of statistical mechanics as an independent branch 


Ludwig Boltzmann 1844-1906 

Austrian physicist famous for his 
pioneering contributions to sta- 
tistical mechanics. Concepts such 
as Maxwell-Boltzmann statistics, 
the Boltzmann distribution, or the 
logarithmic connection between 
entropy and probability remain 
foundations of this field. Boltzmann was one of the 
most important supporters of early atomic theory, at 
a time when the reality of atoms was still controver- 
sial. 
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of physics. Indeed, we may trade much of the excessive fine structure information encoded 
in the full coordinate data of the “swarm” by introducing a measure 

/(x, t)d d x, 


where x = (q, p) is a point in phase space p 1 



and the dimensionless function /(x, t) is 



the (distribution of the) number of parti- dp 



cles found at time t in the volume element 


•;.y • • 

d d x = J 2 i= 1 dqidpi (see the figure), i.e. / is 

• ••.!** • 


normalized according to 

f d d x f (x) = N, 



Jr 

dq 

9 


where f r is an integral over classical phase 

space. The function / is the prime information carrier of the theory. Average values of 
physical observables X can be calculated according to the relation, 


(X) = J d d x/(x,t)X(x), 

where X (x) is the phase space function representing the observable. For example, the energy 
carried by the system is obtained as E = ( H ) = f d d x/(x, t)U(x), where U(x) is the 
Hamiltonian, etc. 

Since the state of the system is essentially described by its distribution function, we can 
reformulate the questions raised above as follows: How can we compute / for a given system 
if the Hamiltonian and an initial state /(.,t) are specified? However, before addressing 
this problem in full, we should be able to answer two less ambitious questions: what is the 
equilibrium structure of /, and how does the system approach that equilibrium configuration 
if it is allowed to evolve unperturbed according to its own dynamics? 

To begin, let us try to understand the (time-independent) equilibrium form of the dis- 
tribution function, /(x). Imagine a system prepared in some initial distribution. Think of 
the long-time density function /(x, t oo) as a probabilistic object where averaging is over 
different initial conditions. We assume that the system is weakly interacting, or gaseous, in 
the sense that the dynamics is essentially of single particle type. Collisions between particles, 
(xi, X2) — > (xj , xy will serve as a mechanism of relaxation towards an equilibrium state by 
exchange of energy and momentum between individual particles. Under these conditions, 
it is safe to assume that the distribution of particles will be independent in the sense that 
the joint probability to observe particles at xi and X2, respectively, N' 2 p(pc -[ , X2), factorizes 
into independent particle distributions iV 2 p(xi,X2) = (Np(?t 1)) x (iVp(x 2)) = /(xi)/(x2). 
By virtue of this assumption, the function / acquires the status of a statistical distribution, 
fully describing the state of the system. 

EXERCISE Discuss: in what sense may this description become problematic once interactions 

become strong? 
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The fact that coordinate configurations (xi, X2) (x^, X2) are coupled by an elementary 

scattering event implies conservation of probability, /(x i,X2) = /(xi)/(x2) = /(x , 1 )/(x 2 ) = 
/(xi,3^). Assuming that /(x) = /(i?(x || = /(e) is a function of energy, we conclude 
that probability conservation is compatible with energy conservation, ei + 62 = + 62 , 

if In /(e) = ae + b is linear in energy. In this case, /(xi)/(x2) = exp(a(ei + €2) + 26) = 
exp(a(e , 1 + e 2 ) + 26) = /(xi)/(xg) is indeed satisfied. To fix the constants a and 6 we 
require normalization of the partition function and employ the equipartition theorem, i.e. 
we use the fact that in thermodynamic equilibrium, the expectation value of the energy 
of each of the 2Nd degrees of freedom must equal T/ 2. 5 Normalization requires (exercise) 
6 = In A' — In / d d x exp(ae) and from the equipartition theorem we obtain 


NdT = dN 


J r d d xe aH ^H(x) 

J r Ae“HW 


Nd 

a 


=>■ a = — 


1 

T 




where, again, a quadratic Hamiltonian was assumed. We thus arrive at the conclusion: 


/(x) = JV r 

Jr 


e -0 H(x.) 

dxe~P H W 1 


( 10 . 8 ) 


which is the famous Maxwell— Boltzmann distribution. Exercise: in what sense is 
Eq. (10.8) a direct descendant of the many particle Gibbs distribution? 

The derivation above can tell us the form of the equilibrium distribution, but it is much 
too indirect to describe how equilibrium is actually approached. For that purpose we need 
to develop far more explicit ways to describe the dynamics of the distribution function. The 
construction of such theories is the subject of the next two sections. 


10.2 Langevin theory 

Imagine a situation where, at time t = 0, some initial distribution /(x, 0) has been prepared. 
We wish to understand its dynamics. Specifically, we wish to understand how the Maxwell- 
Boltzmann distribution is approached if the system is kept in isolation. It will also be of 
interest to explore deviations from equilibrium if the system is externally perturbed. 

A purist might argue that phase space points x -4 x(i) = exp (t{H, . })x develop accord- 
ing to the deterministic time evolution of Hamiltonian dynamics. This view will lead to 
/CM) = /(x(-f), 0), or dj(x,t) = d s \ s=Q f{x,t + s) = -d s \ g=0 /(x(s),t) = {//./(x./.)}. 
Deterministic time evolution is thus described by the equation, 

(dt-{H, .})/(x,i) = 0. (10.9) 

Now, this equation clearly does not capture the irreversible approach to equilibrium. What 
it neglects is that we are actually concerned with a many particle system. The presence of 
other particles will affect the dynamics in two different ways. 


Here we assume that, close to thermal equilibrium, the dynamics of each particle will be approximately oscillator- 
like, and energy comprises kinetic energy ( d momentum degrees of freedom) and potential energy ( d spatial 
degrees of freedom). 
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First, it will hinder the ballistic 
motion of individual particles and, 
in a coarse-grained perspective, will 
lead to friction. Second, repeated 
collisions with other particles will 
effectively act as a fluctuating 
force. As we shall see, these two 
mechanisms, dissipative friction and 
fluctuations are not independent. In 
thermal equilibrium they do, in fact, 
completely determine each other, a 
correlation we will identify as a manifestation of the fluctuation-dissipation theorem. 

In a seminal work 6 the French physicist Paul Langevin proposed to describe the concerted 
action of dissipation and fluctuation in terms of a stochastic generalization of Newton’s 
equation, the celebrated Langevin equation, 


Paul Langevin 1872-1946 

French physicist known who 
developed the concept of 
Langevin dynamics. He is also 
known for his modern interpreta- 
tion of para- and diamagnetism 
in terms of the electron spin. 

Langevin was a devoted anti- 
fascist and lost his academic position under the 
Vichy regime. (Shortly before his death he was 
rehabilitated.) 



Mdjv + mqv — F = (t), 


(10.10) 


where v is the single particle velocity, F = —dqH a macroscopic force acting on the particles, 
7 a phenomenological friction coefficient, and a randomly fluctuating force describing 
the effect of erratic pair collisions. Langevin proposed to model the force in terms of a 
short-range correlated Gaussian random variable with zero mean, (£j(t)) = 0 and variance 


{mii'it')) =^5(1-1'), 


( 10 . 11 ) 


where A > 0 is a constant. 

INFO The development of Langevin’s theory was motivated by the phenomenon of Brown- 
ian motion. In 1827 the Scottish botanist, Robert Brown, observed 7 unceasing erratic motion 
of pollen particles in aqueous immersion. First qualitative explanations of the phenomenon in 
terms of random particle collisions appeared in the late nineteenth century. Langevin suggested 
Eq. (10.10) as an effective equation of motion controlling the dynamics of “mesoscopic” particles, 
subject to inter-particle collisions. The erratic nature of solutions of the Langevin equation - 
qualitatively consistent with Brownian motion - is illustrated in Fig. 10.1. 

Before turning to the analysis of this equation, let us make a few general remarks: 

> One of the important predictions of the theory is that, in thermal equilibrium, the fric- 
tion coefficient, 7 , and the fluctuation strength, A, are mutually dependent parameters: 
fluctuations and dissipative damping are twin effects - again, a manifestation of the 

fluctuation-dissipation theorem. 


P. Langevin, Sur la theorie du mouvement Brownien, CR Hebd. Acad. Sci. 146, 530-3 (1908). 

The phenomenon had, in fact, been observed earlier by the Dutch physician Jan Ingenhousz in 1785. 
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Figure 10.1 A few two-dimensional solutions of Eq. (10.10) for different fluctuation force configu- 
rations. The paths start at time t = 0a,tx = y = 0. Qualitatively, they resemble the paths traced 
out by fictitious “Brownian particles”. 


t> We may expect multiple particle collisions to cause effectively stochastic, or diffusive 
motion of individual particles. Indeed, we will identify the inverse A -1 as an effective 
measure of the diffusivity of the medium. 

i> A specific solution q (t) of the Langevin equation defines a realization of what is called a 
stochastic process. A stochastic process (cf. the more comprehensive discussion in Sec- 
tion 10.4 below) is described by a time-dependent random variable X(t). Here, “random” 
means that no deterministic statements about the time dependence of X can be made. 
The best we can obtain is results on the probability distribution P(X(t )) of realizations 
of X. In the present context there are two alternatives: we may discuss the Langevin 
equation in terms of v[ ], i.e. specific realizations of the velocity, randomly dependent on 
the realization . Suitable averages over will then lead to a statistical description of 
velocity distributions, etc. This is the strategy originally proposed by Langevin. Alterna- 
tively, we may aim to describe the velocity distribution P[v] directly in terms of certain 
statistical propositions. This latter strategy was proposed by Einstein in his famous work 
on Brownian dynamics (see below). 


10.2.1 Fluctuation-Dissipation Theorem (FDT) 

To discuss the connection between dissipation and fluctuation in an unperturbed system, 
we consider the Langevin equation in the absence of driving forces, F = 0. Temporal Fourier 
transformation v(cu) = f dtexp(iu>t)v(t), obtains the formal solution v(cu) = (cu). 

According to this relation, the “induced” random variable v has zero mean, ( Vi(t )) = 0, 
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and second moment (exercise) 

2/ -A Z 100 dui 1 A 

* m 2 J _ 0 0 2n u) 2 + 7 2 2rn 2 7 

However, in thermal equilibrium, the equipartition theorem establishes a connection between 
the average kinetic energy per degree of freedom and temperature, m(vf)/2 ' T/2. Com- 
parison with the result above then leads to the Einstein relation, 

A = 2to7T, (10.12) 

i.e. in thermal equilibrium, the variance of environmental fluctuating forces is proportional 
to the strength of dissipative (frictional) forces, and to temperature. 


INFO Let us briefly review Einsteins derivation of Eq. (10.12). Einstein’s approach is 
phenomenological but contains additional information inasmuch as it establishes a connection 
between the strength of fluctuation forces and the concept of diffusion. The basis of the argument 
is that, in a medium governed by frequent inter-particle collisions, the dynamics of individual 
particles will be di usive. Similarly, externally applied forces will cause drift motion, rather 
than free ballistic acceleration. The first postulate implies that a density gradient, df, in the 
medium will lead to a diffusion current, 


jd = -D f, 


(10.13) 


i.e. a current acting to restore a uniform density profile. Equation (10.13) is generally known 
as Fick s (first) law. Application of the continuity equation dtf = — • ja then shows that 

dtf = DAf (Fick’s second law) implying that the dynamics is indeed diffusive. 

The second postulate states that j ex t = 7 - 1 m - 1 /F ex t, i.e. an external force generates a 
drift current proportional to the force, the density, and the inverse of the friction coefficient. 
(This formula may be obtained by dimensional analysis, or by consideration of the stationary 
configuration, <9*(v) = 0 obtained by averaging the Langevin equation (10.10).) In thermal 
equilibrium, diffusion current and external current must compensate each other, i.e. jd = — j ex t, 
or D f = — / V/m 7 , where we assumed that the force is generated by some potential, V. This 
equation for the density profile is solved by / ~ exp(— V/Dm'y). However, compatibility with 
the Maxwell-Boltzmann distribution (10.8) requires that / ~ exp (-V/T), or 



(10.14) 


Equation (10.14) is the celebrated Einstein relation. Comparison with Eq. (10.12) finally leads 
to the identification, 

?T 2 

A=— , (10.15) 

i.e. the variance of the (microscopic) fluctuation forces in the medium is inversely proportional 
to its (macroscopic) diffusion constant. 
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Figure 10.2 Three realizations of resonator modes subject to the interplay of fluctuations and 
dissipation. Left: RLC-electronic circuit; Middle: a resonator mode subject to radiation losses; 
Right: damped mechanical oscillator. 

10.2.2 A brief compendium on noise 

Our discussion above conveys the important message, that dissipation or friction are inti- 
mately linked to the presence of fluctuating forces. However, unlike the everyday phe- 
nomenon of friction, the corresponding fluctuation forces are usually less noticeable. The 
reason is that friction acts in a directed way (cf. the action of a brake), while the response 
caused by fluctuations tends to average out. Yet, there are important exceptions to this rule. 
For example, high amplification levels render the fluctuations, or noise caused by resistive 
elements in electronic circuits a quite noticeable (and usually unwelcome) side effect. 

Figure 10.2 shows three prominent setups where the interplay of fluctuations and dissipa- 
tion is operational: an electronic RLC-circuit, an electromagnetic cavity mode experiencing 
radiative losses, and a damped oscillator. All of these systems are characterised by idealized 
non-dissipative dynamics that is harmonic. The presence of dissipation (resistor /radiative 
losses/mechanical damping) generates fluctuations. In the case of the macroscopic mechan- 
ical oscillators, these fluctuations will be largely inconsequential. However in the first two 
systems they cause noticeable effects. 


Johns on-Nyquist noise 

Let us explore the situation in the example of the RLC-circuit. In this case, we are concerned 
with the charge Q transmitted through the circuit (from which current and voltage drop 
across the resistive element are obtained as I = Q and U = RQ, respectively). In elementary 
courses, we learn that the dynamics of charge is governed by the equation, 

Ld 2 t Q + ^ + Rd t Q = U ext , (10.16) 

where t/ ex t is the externally applied voltage. This is the equation of motion of a dissipatively 
damped oscillator. However, in view of our discussion above, we know that Eq. (10.16) 
neglects the effect of the fluctuations accompanying dissipation. A correct way to think 
about Eq. (10.16) is as an equation for the coordinate ( Q ), averaged over realizations of 
those fluctuations. A realization specific generalization of Eq. (10.16) compatible with the 
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Figure 10.3 Left panel: Noise strength vs. resistance for wires made of different materials. Right 
panel: Noise strength vs. temperature for the “advanced wire”. The two data sets display near- 
perfect linearity, in agreement with the theory of Johnson-Nyquist noise. Data taken from 
J. B. Johnson, Phys. Rev. 32, 97-109 (1928). 


fluctuation dissipation theorem is given by 

Ld%Q + ^ + Rd t Q = C/ex t + Uj u (t), (10.17) 

where the correlator of the noise term is given by 

<ECjn(t)> = 0, (U in (t)U in (t’)} = 2 TR5(t - t'). (10.18) 

The noise term in Eq. (10.17) can be interpreted as a time-dependent fluctuating voltage, 
additional to the external voltage. It is important to realize that this type of noise is an 
inevitable consequence of the presence of resistive elements in electronic circuits; it cannot 
be removed by “improving the quality” of the device. 

The presence of a fluctuating voltage of strength (|C/j n (w)| 2 ) = 2 TR was experimentally 
discovered by Johnson 8 (cf. Fig. 10.3). Nyquist 9 explained the phenomenon in terms of the 
thermal equipartition of energy of electromagnetic oscillator modes. Reflecting this back- 
ground, C/j n is alternatively denoted Johnson noise, Nyquist noise, Johnson-Nyquist 
noise, or just thermal noise. 

INFO The strength of the noise correlator is fixed by slight modification of the arguments 
used in Section 10.2.1 above: Think of a non-resistive LC'-circuit as an electronic realization of 
the harmonic oscillator. The R = 0, U = 0 variant of Eq. (10.17) is obtained by variation of the 
“Lagrangian” action 


J. B. Johnson, Thermal agitation of electricity in conductors, Phys. Rev. 32 , 97-110 (1928). 
H. Nyquist, Thermal agitation of electric charge in conductors, Phys. Rev. 32 , 110-13 (1928). 
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where LQ 2 / 2 and Q 2 /2C represent the kinetic (inductive) and potential (capacitive) energy, 
respectively. We expect an interplay of fluctuation and dissipation to establish an energy balance 
compatible with the equipartition theorem, 




2 C 


{ Q % I 


T 

2 ’ 


i.e. T/2 for both kinetic and potential energy. This energy balance should be established no 
matter how strong is the dissipation (R). In the absence of external biasing, [7 e x t = 0, the 
temporal Fourier transform is given by 


QH = 


UjnM 

iojR — u> 2 L + C -1 ' 


Using Eq. (10.17) and (10.18), it is then straightforward to verify that 
Similar reasoning shows that Q 2 ) = T. 


T 
2 ' 


Although the voltage correlation Eq. (10.18) is consistent with the classical equiparti- 
tion theorem, this very consistency generates an annoying problem: the second moment 
(bjn(i) 2 ) oc 5(0) diverges, i.e. our analysis makes the unphysical prediction that a voltmeter 
measuring noise amplitudes will detect voltage spikes of arbitrary strength. Equivalently, 
we are making the unphysical prediction that the noise fluctuates with constant intensity 
at arbitrarily high frequencies. (Expressed more formally, the noise power 


S(U in ,.) = 7 lim - 


J dtUj n (t)e m 


= RT, 


(10.19) 


which is a measure of the fluctuation intensity at characteristic frequency co, remains con- 
stant.) 

This problem disappears if we replace the classical fluctuation energy of the harmonic 
oscillator, 2 x | ' = T w(e w / T — l) -1 by the energy of a quantum oscillator. (Exercise: 
explore this point.) For frequencies much larger than temperature, u> » T, the mode does 
not store energy and the problem with the diverging noise amplitude disappears. 10 


Shot noise 

Johnson noise is not the only “inevitable” source of noise. In electronic devices, the discrete- 
ness of charge quanta generates additional noise. A train of uncorrelated electrons running 
down a wire is an example of a Poisson process described in Section 10.4.2. The average 
number of electrons, n, passing through the wire in a time interval At (large in comparison 
to the mean passage time) defines the mean current, 


Notice that the expression cu(e w ^ T — 1) — 1 excludes the vacuum or zero-point energy, u>/ 2. If the vacuum energy 
is kept in the energy balance, the problem with the divergence in the integrated noise power reappears. One may 
reason that the exclusion of the vacuum energy is justified because only physical transitions can participate in 
dissipative processes. For a more satisfactory picture, see below. 
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However, it is a characteristic feature of Poisson processes that the variance in the number 
of transmitted electrons var(n) = (n), which means that the variance in the current, 

var(/) = ^(7). 

Although the signal-to-noise ratio (/) 2 /var(/j = At (I) increases with increasing observation 
times and increasing mean current strength, the discreteness of charge remains a principal 
source of noise. In a continuum model of charge transport this shot noise 11 can be modeled 
by addition of a suitable noise term to the right hand side of Eq. (10.16), in addition to 
the Johnson noise term. (In Section 11.6 of the next chapter we explore how this addition 
emerges in a microscopic construction.) Unlike Johnson noise, shot noise is independent of 
temperature and resistivity of the circuit. Both have in common that the noise power is 
largely independent of frequency. 


Other sources of noise 

A variety of physical processes produce noise. Empirically one finds that these sources of 
noise can add to an overall noise signal that is (a) non-universal in that it depends on all 
sorts of system-specific details and (b) strongly frequency dependent (unlike the “universal” 
sources of Johnson and shot noise). It is customary to denote noise signals with a power 
spectrum 

S{u) ~ 0.5 < a < 2, 

as 1//- noise (“/,” because in engineering it is customary to use / instead of ui for fre- 
quency). For further discussion of l//-noise we refer to the literature. 12 


10.2.3 Fokker-Planck equation I 

The network of connections revealed above and in Fig. 10.4 is of fundamental importance to 
statistical physics and deserves further investigation. To this end, we will adopt the second 
interpretation of the stochastic process v[ ] (t) advocated above and work in terms of a 
probability distribution p(v, t). Analysis of the effective equations controlling the probability 
distribution leads us straight to the concept of the Fokker-Planck equation, a dynamical 
equation whose importance to statistical physics can be compared to that of the Schrodinger 
equation to quantum mechanics. 

EXERCISE Carefully think about the steps in the derivation below; it is exemplary of a general 
scheme to derive Fokker-Planck evolution equations in the theory of stochastic processes. 

We wish to derive an equation controlling the evolution of the (conditional) probability 
p(v, t|vo, to) to observe a particle velocity v at time t provided we started out with velocity 


Shot noise was first observed in vacuum tubes, i.e. devices where electrons are “shot” from some cathode through 
empty space. 

For instance, see P. Dutta and P. M. Horn, Low-frequency fluctuations in solids: 1/f-noise, Rev. Mod. Phys. 
53 , 497-516 ( 1981 ). 



10.2 Langevin theory 


617 


fluctuations 


t 


diffusion 


dissipation 

T 


drift 


Figure 10.4 Different physical concepts revolving around the physics of the Langevin equation. 


vo at time to < t. The evolution of the variable v is controlled by the Langevin equa- 
tion (10.10), a first order differential equation in time. This means that the evolution v' -^4 
v from some initial configuration v' to v solely depends on the initial configuration v' (and, 
of course, on the specific realization of the random force, ). However, it does not depend 
on the pre-history of the particle, i.e. the specific way in which it made it to the initial 
configuration v'. (Exercise: consider why this is a specific feature of a first order differential 
equation.) This feature, implies the “convolution” property (again a statement to reflect 
upon), 


p(v,f|v 0 ,f 0 ) = J d d v' p(v,t\v',t')p(v',t'\v 0 ,t 0 ), (10.20) 

where t' £ [to , t] is arbitrary and the second factor under the integral is the probability to 
find v' at t' for initial data (vg,to). Weighing by the conditional probability to move on 
to v and summing over all intermediate configurations, vh we obtain the total probability 
( v o>io) — t (v, t). Equation (10.20) is the defining property of a Markovian process, 
i.e. a process which is fully determined by one-step probabilities. (For a more systematic 
discussion, see Section 10.4.2 below.) 

Evaluating Eq. (10.20) for the specific choice of time arguments t —t t + 8t and t' — > t, 
and suppressing the initial value argument (vo,fo) for notational simplicity, we obtain an 
equation for the incremental evolution of probability, 

p(v, t + 8t) = J d d uw{\ — u, t;u, 5t)p(v — u, t), 
where we have introduced the notation 

ty(v, t; Av, At) = p(v + Av, t + At |v, t), 

for the probability of “transitions” (v, t) — > (v + Av, t + At). We next make use of the fact 
that, for diffusive motion, the “step width” u in a sufficiently small time window, 8t, will be 
small in comparison to the length scales over which the probability distribution p changes 
considerably, ft then makes sense to Taylor expand in the argument v — u around u, i.e. 
to expand in the underlined arguments in u;(v — u, t; u, (it)p(v — u, t). It is important to 
appreciate that the legitimacy of this so called Kramers— Moyal expansion hinges on the 
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kinematics of the process. In cases where one-step transitions are over large distances (as is 
the case in, say, ballistic collisions in dilute gases) other procedures have to be applied. One 
of these alternative approaches, Boltzmann kinematic theory, is discussed in Section (10.3) 
below. 

Performing the expansion, we obtain the infinite order differential equation, 

p(v,t + St) = (a (0) p)(v,t) - d Vi (a ( pp)(v,t) + ^. V .(a^p)(v,t) + . .., (10.21) 

where 

af°\v,t) = J d d uw(v,t;u,8t), 
a\ 1 \v,t) = J d d uw(v,t;u,5t)ui, 
a\ 2 j(v,t) = J d d uw(v,t;u,6t)uiUj. 

To compute the coefficients, w, we use the fact that for sufficiently small St. the Langevin 
equation assumes the approximate form 

v(f + St) = v — St'yv + — (t). 

m 

The second and third term on the right-hand side tell us how the random variable v(t + St) 
relates to the random variable , provided v(t) = v was realized. This means that the 
transition probability is obtained as 


w(v, t\ u, St) = p(v + u, t + c>t|v, t ) = (<5(v + u — v(t + St )))^ 
= (<5(u + St'yv - St (f)/rn)) ? . 


Now, the functional distribution function controlling the fluctuation force is given by, 

p([ D = M f D ex p(-2^/ di l (*) I 2 ) N f D ex P (-H£t I Wl 2 ) ( cf - Eq - ( 1(U1 ))> 

where Af ensures normalization and in the second representation we used St as our funda- 
mental time step discretization. The distribution factorizes, and the probability to find 
at time t is given by 

p{ {t))=Afex (Ol 2 )) • 


Using this result, we get 

w(v, t; u, St) = J d d £p( ) 8(u + St'yv — St /m) = Afexp |u + j6tv\' 


where Af — > Aim /St is a renormalized constant ensuring normalization of the probability, 
f d d uw(v,t;u,St) = 1. With these results, it is straightforward to obtain the coefficients 
as 


afj (v, t) = Sij + (7 St) 2 ViVj. 


*(°)(v,t) = 1, 


a^(v,t) = —7 StVi, 


m‘ 
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Coefficients a (' n>2 \v,t) = 0{6t 2 ) are vanishingly small in the limit 5t — > 0. Substitution of 
these results into Eq. (10.21), followed by first order expansion in St leads to the Fokker— 
Planck equation, 13 


(d t - d Vil Vi - dl iVi D v ) p(v, t) = 0, 


( 10 . 22 ) 


where 


D v = 


A 

2 m?’ 


is the diffusion constant of velocity. In equilibrium, D v = T'y/m (cf. Eq. (10.15)), which is 
related to the particle diffusion constant (10.14) through the relation D v = j 2 D. 


Eq. (10.22) is a partial second order differ- •' 
ential equation for the probability distribu- 
tion. It has to be solved with initial condi- 
tion p(v, t = to) = 5(v — vo), i.e. Eq. (10.22) 
defines an initial value problem (much like 
the time-dependent Schrodinger equation in 
quantum mechanics). The second and third 
contribution to the Fokker-Planck operator 
describe the competition of drift and diffu- 
sion, respectively. The Fokker-Planck equa- 
tion, thus, represents the “theory” behind the chain of connections shown in Fig. 10.4. 
Derived from the Langevin equation with its interplay of dissipation and fluctuation, it 
predicts diffusion and drift for the dominant transport mechanisms at large times. By way 
of example, the figure above shows a set of trajectories with a constant drift contribution 
but subject to different noise contributions leading to diffusion. 

In the following, we discuss the dynamics described by Eq. (10.22) from a number of 
different perspectives. Specifically, we wish to understand the meaning of the diffusion and 
the drift term, the competition between the two, and at last, the approach to thermal 
equilibrium within the framework of Langevin theory. 



> To understand the meaning of the drift term, consider the Langevin equation in the pres- 
ence of a constant external driving force F. Browsing through the steps in the derivation of 
the Fokker-Planck equation, one verifies that the essential 14 effect of this generalization is 
a shift — > St(—'yVi + m~ 1 Fi) of the drift coefficient. The Fokker-Planck equation thus 

generalizes to ( d t — d Vi (qu* — m~ 1 F ) — d 2 . v .Dy) p(v. t) = 0. Multiplying this equation by 
Vi, and integrating over velocity we obtain, 

d t (v) + 7( v ) - m~ 1 F = 0, 


13 Anticipating later generalizations to the case of v-dependent coefficients, v, D v , we place the latter to the right 
of the differential operators d v . . 

14 However, we also have to bear in mind that the presence of a constant external force drives the system out 
of equilibrium. (Switching off the force will change the state of the system, a criterion for a nonequilibrium 
situation.) This means that dissipation and fluctuation are no longer related to each other by the fluctuation- 
dissipation theorem. 


620 


Nonequilibrium (classical) 


a result that can equally be obtained by averaging the Langevin equation over noise. This 
equation describes the relaxation of the velocity expectation value to a stationary drift 
configuration (v) — > ^f_F. Identifying that expectation value with the force-induced drift 
current, we obtain ja = F /rny, in agreement with the phenomenological reasoning above. 
t> The third term in Eq. (10.22) describes diffusion. The influence of the diffusion term on 
the dynamics is visualized in the figure above for the case of particles in two dimensions 
subject to a uniform driving force F = const, x (2e x + e y ). Diffusion leads to a stochastic 
spreading around the (deterministic) drift trajectory. The corresponding distribution p(v) 
is predicted by the stationary long time limit of the Fokker-Planck equation, ( d Vi ('fVi — 
m~ 1 F i ) + D v dy. v .)p(v ) = 0, or 

(qui — m~ 1 F i + D v d Vi )p(v) = 0. 

This is solved by p(v) ~ e _5 ^ v_ ™7 F l , i.e. a “diffusion cloud” centered around the drift 
trajectory. At F = 0, the diffusion constant D v = Tj/rri assumes its equilibrium value 
and p(v) = e~ M reduces to the Maxwell— Boltzmann distribution. We thus draw 
the important conclusion that 


in the absence of external driving, a conspiracy of diffusion and 
drift sends the system into a state of thermal equilibrium. 


EXERCISE Our discussion above focused on the diffusive character of the velocity distribution, 
p(v,t). To describe the di usive dynamics of the particle coordinates, q, let us consider 
a force-free (F = 0) Langevin equation in the overdamped limit, m'ydtq = £, i.e. a limit void of 
external forces where ballistic acceleration ~ mdfq is negligible. Show that the Fokker-Planck 
equation for p(q, t) takes the form 

{d t - Dd 2 )p( q, t) = 0, (10.23) 

where D is given by Eq. (10.14). This confirms the expectation that in thermal equilibrium 
the coordinate will perform drift-less diffusive motion whose level of agitation increases with 
temperature. 


EXERCISE Imagine that the single particle dynamics is governed by some Hamiltonian H(x) 
(as usual, x = (q, p)). Again, we model the influence of the environment by a combination 
of dissipation and fluctuation. This leads to the generalized phase space Langevin equation, 
dtq = dtp = — — 7P + 4- Show that the probability distribution p(x, t) is governed by 

the phase space Fokker-Planck equation, 


d*p(x, t) = —{H,p(x.,t)} + 


d_ 

dp 




p(x,t), 


(10.24) 


where D v = m 2 D v = T-ym is the diffusion constant of momentum. The second operator on 
the right-hand side describes the dissipation/fluctuation generalization of deterministic single 
particle dynamics (cf. Eq. (10.9)). 
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Figure 10.5 Schematic showing the “daily routine” of an amoeba. Left: energy intake at a rate 
q. Middle: dissipative energy losses due to basic metabolic activity, rate where e(t) is the 

instantaneous energy stored in the organism. Right: motion at a velocity v leads to an energy loss 
proportional to both, e(t) and a velocity dependent loss function, /(v). 


10.2.4 Beyond equilibrium 

We have seen how the Langevin equation describes the approach to equilibrium: dissipative 
energy losses and the intake of energy through fluctuations balance each other in a way 
to form an equilibrium configuration. However, the descriptive power of Langevin theory 
extends well-beyond the physics of equilibrium. Effective Langevin equations with dissipa- 
tion/fluctuation imbalances can be employed to describe rather complex out-of-equilibrium 
phenomena in physics, life sciences, and socio-economics. 


Active Brownian motion 

By way of example, we here discuss the construction of a minimalistic model to describe 
the phenomenon of collective biological motion (“swarming”). Of course, we do not aim to 
describe the actual behavior of a concrete species. Rather, our objective is to formulate an 
as-simple-as-possible model of collective intake of energy and its conversion into kinematic 
action. For the formulation of more powerful models and a survey of the recent literature, 
we refer to Ebeling and Sokolov. 1 

Our strategy is to endow the Brownian “particles” (you may now think of them as amoe- 
bae or another not too complex animal) with a mechanism of self-propulsion. To this end, 
we imagine the particles endowed with an energy storage. The time-dependent energy level, 
e(t), will (a) increase due to food intake at some rate, q. (b) decrease at rate — ce(t ) due 
to metabolic activity (notice the assumption that energy loss is the larger the larger the 
energy stored in the organism), and (c) decrease due to kinematic activity (see Fig. 10.5). 
We assume that motion leads to a loss at rate — e(t)/(v), where /(v) = kv 2 . The net change 
of the particle’s energy is thus given by, 

d t e = q — ce— KV 2 e. 

We next assume that the kinematic energy loss is translated to a propelling force, F. The 
form of that force F = enw follows from the condition that the work per time interval done 
by the force F ■dr/dt = F-v must equal the kinematic energy loss, Kv 2 e, and the assumption 
that F v. 
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The particle dynamics is then described by the generalized Langevin equation, 
md t v + myv = —VU + env + , 


where VU represents an external potential and 7 the background friction coefficient. Further 
progress with this equation can be made if we assume that the population is in a balanced 
state in that the energy “depots” are kept at equal filling levels, on average: d t e = 0, or 


C + KV 2 ' 

This condition can be used to replace e in the Langevin equation and to obtain an effective 
Langevin equation governed by a mechanism of activated friction 

md t v + TO 7 eff(v)v = -VU + , 7 eff(v) = 7 — Jr (10.25) 

m(c + kv 2 ) 

An interesting situation arises if the background friction is small in that 7 < Under 
this condition, the effective friction vanishes for finite velocities, 15 


v =v n = 


nq — mcry 
mK'y 


The condition |v| 2 = v( defines a sub-manifold in velocity space on which the dynamics 
becomes effectively Liouvillian, i.e. unhindered by friction. To understand how the particles 
arrange themselves around that manifold, we need to compute the velocity distribution 
function. Inspection of the derivation of the Fokker-Planck equation shows that the gener- 
alization to activated friction amounts to a mere replacement 7 — »• 7 e s(v) in Eq. (10.22). 
This means that the steady state probability distribution of the particles is determined by 
the equation, (D v d Vi — 7 eff( v M)p( v ) = 0. Integration of this equation leads to 


p(v) = Aexp (q|v| 2 - ^ ln(c + k|v| 2 ))^ , 

where J\f ensures normalization. In the absence of energy supply, q = 0, and the distribution 
reduces to the Maxwell-Boltzmann form. However, in general, p(v) is far from equilibrium. 
The center of the distribution is determined by the condition 9| v p (exponent) = 0 which 
is solved by |v| 2 = Vq. We conclude that the frictionless manifold |v| 2 = Vq is attractive 
in that it defines the center of the distribution function. The sharpness of the distribution 
is determined by the inverse of the diffusion constant, i.e. the more noisy the Langevin 
equation, the less rigid the fixation of the condition |v| 2 = Vq. 


Swarms 

The model above fixes the modulus of the velocity of individual particles. However, to 
describe more complex patterns of collective biological motion, we need to (a) think about 
mechanisms controlling the direction of velocity, and (b) correlate the motion of many 


In cases where the velocity <C ck~ x , the effective friction function 7 e ff(v) ~ c + c'v 2 may be effectively 
linearized in the squared velocity. Activated friction profiles were employed by Rayleigh to describe the complex 
energy feedback of musical instruments. (J. W. S. Rayleigh, On the resultant of a large number of vibrations 
of the same pitch and of arbitrary phases, Phil. Mag. 10, 73-8 (1880).) 
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particles to form collective entities. Crudely speaking, three prototypes of collective motion 
are frequently observed in nature: translational motion (a wandering herd of elephants), 
rotational motion (a school of fish), and uncorrelated motion (bacteria, etc.). Theorists are 
trying to mimic such types of collective dynamics by extended effective friction models. 
Referring for a more comprehensive discussion to chapter 12 of Ebeling and Sokolov. 1 , we 
here restrict ourselves to a brief review of a few concepts relevant to this field. 

One aim is to characterize a swarm of N particles in terms of a center coordinate ~ y~) 
and relative motion. Often, the dynamics of the center is of limited concern. One may, then, 
decide to impose an external mean velocity V ( t ) . The more interesting relative dynamics 
of the particles is governed by a combination of a one and a two-particle potential 

N N 

u( qi,...,qjv) = ^J7 (1) (qi) + ^ t/ (2) (q i; q^). 

*= i *,i= i 

The interactions may be attractive (e.g. when one seeks to model motion of a school of 
fish in open water), or repulsive (insects in a confined geometry). Collective and relative 
dynamics are described by the set of equations 

md t Vi + TO 7 eff(vj) + c(vj - V(t)) = — <9 qi f7(qi, . . . ,qjv) + ( t ), (10.26) 

where the term c(v$ — V (f)), c > 0 drives the particle velocity towards the mean velocity 
of the swarm. 

This is about as much as can be said in general. Approximation schemes employed to 
describe specific types of motion include approximate decoupling of the equations into an 
equation for the center coordinate and N — 1 equations for the relative coordinates, and 
mean field treatment of the pair interaction. For example, assuming that the mean field 
interaction potential is of the form, U ^ ~ |(q * — Q) 2 , where Q is the center of mass 
coordinate, the equations for the relative coordinates decouple into the form 

mdt’v'i + mj eS (v' i ) = -cq' + (t), 

where the primes indicate relative coordinates, and we assumed that |v(| V, so that the 
friction force depends dominantly on the relative velocity. The four-dimensional dynamics 
predicted by these equations converges to a limit cycle on the frictionless manifold |v'| 2 = 
Vq and with radius specified by c|q'| 2 /2 = mvg / 2 (equipartition of potential and kinetic 
energy). The dynamics of the many particle system then resembles the rotational motion 
of a swarm. For the discussion of other types of collective dynamics we refer to Ebeling and 
Sokolov. 1 . 


10.3 Boltzmann kinetic theory 

Previously, we have explored the ways in which a sequence “Langevin dynamics — > Fokker- 
Planck equation —> distribution functions” leads to predictions on the macroscopic state of 
many particle systems, both in and out of equilibrium. A central step in the derivation of the 
Fokker-Planck equation was the Kramers-Moyal expansion, an expansion of the probability 
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of short time transitions x — > x' in the difference x — x'. The Kramers-Moyal expansion 
makes sense if, for sufficiently short 6t, the hopping steps in phase space become small in 
comparison to the typical phase space coordinates themselves. More precisely, we assumed 
that the moments ((x' — x) n ) of the conditional probability p(x', t + St\x, t ) exist and scale 
with positive powers 5t p( - n \ Only terms with p(n) < 1 (in the specific case of Gaussian 
distributed noise, the first two) enter the Fokker-Planck derivative operator. Put differently, 
the evolution of the probability distribution can be described in terms of a low order linear 
differential equation only if the nth moments of the transition probability are well behaved 
in that p(ri) > 1 for all but the lowest ns. There are plenty of instances in many particle 
physics where this “locality principle” is violated. Below we derive an effective evolution 
equation that describes the dynamics of distribution functions under these circumstances. 


10.3.1 Derivation of the Boltzmann equation 

By way of an example, consider a gaseous system of particles in a container. In the infinitely 
dilute limit (where particle collisions can be neglected), the phase space coordinates of 
individual particles change according to Newtonian dynamics (cf. Eq. (10.9)). Specifically, 
the single particle energy e is conserved, and (assuming the absence of smooth potential 
gradients) momentum is constrained to an energy shell |p| 2 /2m = e. 

However, once interactions are taken into account, 
the rate at which the distribution /(xi,t) changes 
gets influenced by particle collisions: losses occur 
when particles at phase space point X 2 scatter 
off particles of the reference momentum Xi into 
final states x' x and x' 2 (cf. the figure). We denote 
the corresponding two-particle transition rate by 
w(x' 1 ,x. 2 ;xi,x 2 )d d x' 1 d d x 2 d d xid d x 2 , where the func- 
tion w encapsulates kinematic constraints (energy 
and momentum conservation) of the collision. The 
dtf due to “out” processes then reads 

<*t/i| out = J d2dl'd2' w(l , ,2'-l,2)f 1 f 2 , 

where we take into account the fact that the total transition rate depends linearly on the 
number of available collision partners in the two initial states, and introduce a (standard) 
notation wherein coordinate dependencies are indicated by numbers, d d x i — > c/1, /(x i) — >• 
fi, etc. 

Gain occurs when particles get scattered into the reference state in collisions (l 7 , 2 7 ) > 

(1,2): 

dt/iL = J d2 dl' d2l w(l,2-\',2')f v f 2 ,. 
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The total occupation change due to collisions /[/] = dt/i| in — ^t/i| out is summed up in 
what is commonly called the collision integral, 

/[/] = J d2 dl' d2' («;( 1 , 2; T, 2 ')fyfo - «/(T , 2'; 1, 2 )/ t / 2 ) . (10.27) 

Notice that the collision integral is a nonlinear functional of the distribution functions, i.e. 
we are facing up to a nonlinear theory. 

The transition rates w are subject to certain symmetry relations which reflect time- 
reversal invariance and unitarity of the microscopic laws of physics. For example, microre- 
versibility implies that a transition (1,2) — »• (T, 2') must be as probable as the time- 
reversed process (1 ,T , 2 ,T ) — > (1 T , 2 T ), where 1 T = (qi, Pi) T = (qi, — Pi) is the time-reverse 
of a phase space point, i.e. 

11 /( 1 ', 2'; 1, 2) = w(l T , 2 t ; l' T , 2 ,t ). 

The consequences of unitarity are best exposed if we interpret the coefficients w as classical 
limits of quantum transition probabilities. Identifying (1,2) <->■ |i) as a classical approxima- 
tion to a (coherent) state in two particle Hilbert space, 16 and (1', 2') f-/ |f) as a final state, 
we have the identification, 

11/(1', 2'; 1, 2) |S fi | 2 , 

where Sa is the scattering matrix. Unitarity means that X); |<Sfi| 2 = JT |5ff| 2 = 1, or 

J dl' d2' u/(l', 2'; 1, 2) = J dl'd2' u/(l, 2; T, 2'). 

Using this relation in the out-process, the collision integral can be transformed to 


/[/]= J d2 dl' d2' u/(l, 2; l', 2') {fyfy - /i/ 2 ) . 


(10.28) 


Adding /[/] = d t /i|. n — dtfi\ out as the many particle contribution to the changes in the 
distribution function to the right-hand side of Eq. (10.9), we obtain the celebrated Boltz- 
mann transport equation, 17 


(d t ~ {H, .})/(x,t) =/[/]. 


(10.29) 


10.3.2 Discussion of the Boltzmann equation 

What is the Boltzmann equation (10.29) good for? First, it has been, and still is, a very 
powerful tool in applied many particle physics. Before the advent of the more powerful 


16 Here, we consider the particles as distinguishable, i.e. no (anti)symmetrization is implied. 

17 In the Russian literature, the Boltzmann equation is usually called the “kinetic equation.’ 
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techniques reviewed in previous chapters of this book, it used to be the principal tool to cal- 
culate physical observables in interacting many particle systems. 18 Even today, Boltzmann 
equation-based approaches often represent the most economic and straightforward route to 
understanding the physics of interacting many particle systems. 

Second, the Boltzmann equation is of great conceptual value. Many of the principal 
questions raised in the beginning of the chapter have a relatively straightforward answer in 
terms of this equation. We first note (cf. the more schematic discussion before Eq. (10.8)) 
that the collision term vanishes in thermal equilibrium. In equilibrium, the distribution 
function is given by the Maxwell-Boltzmann distribution Eq. (10.8). Energy conservation in 
elastic collisions, //(x i) + //(x 2 ) = +H(x.' 2 ), means that / 1/2 = fvfy, he. vanishing 

of /[/]. In equilibrium the losses and gains due to many particle collisions compensate each 
other and /[/] does not change the distribution. (Since /(x) = /(//(x)), the single particle 
dynamics conserves / as well, {//, /} = 0, i.e. the Maxwell-Boltzmann distribution is truly 
stationary under (10.29).) 


The Boltzmann H -Theorem 

However, it is not quite as easy to show that the collision term actually drives the distribu- 
tion towards the thermal equilibrium configuration. The defining property of the thermal 
equilibrium is that it maximizes entropy, S. In equilibrium statistical mechanics, it is usually 
taken for granted (the second law of thermodynamics) that many body relaxation pro- 
cesses increase entropy, dtS > 0, before the maximum of the equilibrium configuration is 
reached. However, a more general conceptual framework such as Boltzmann kinetic theory 
should be able to actually demonstrate the ways in which a microreversible theory leads to 
macroscopic irreversibility and entropy increase. 

INFO In view of the comparative complexity of this problem, the collision term is sometimes 
linearized in what is called the relaxation time approximation. One effects the following 
replacement of the complicated nonlinear collision integral, 

0), (10.30) 

where fo is the Maxwell-Boltzmann distribution. The linearized approximation then describes 
a tendency to approach equilibrium at a rate set by r, a time scale interpreted as the mean 
collision time of interaction processes. 

The manner in which interactions increase entropy was demonstrated by Boltzmann in a 
famous construction known as the //-theorem. (It is known as the //-theorem rather than 


For a detailed account of various applications and computational schemes revolving around the Boltzmann 
equation, see L.D. Landau and E. M. Lifshitz, Course of Theoretical Physics, Vol 10 - Physical Kinetics , 
(Butterworth-Heinemann, 1981). 
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the S'-theorem because Boltzmann called entropy H .) Consider the information-theoretic 
definition of entropy 19 according to which S = — (ln(//e)), or 

S = - J d d x f (x, t) ln(/ (x, t)/ e). 

The temporal change of S is then given by 

d t S = — J d d x In fdtf = ~J d d x In / [{H, /} + /[/]] . 

Since the full phase space integral of any function is invariant under Hamiltonian flow 
(exercise: why?), we have 0 = f d d x {H, /ln(//e)} = f d d x In i.e. the change in 

entropy is entirely due to interactions. 

To explore how interactions do the job, consider the collision integral in its prototypi- 
cal variant Eq. (10.27). For an arbitrary function 0(x), and using the abbreviation dT = 
dl d2 dl' d2' , we have 

J dl0(l)/[/(l)] = J dT 0(1) (w(l, 2; l' , 2 , )/i'/2' — u>(T, 2'; 1, 2 )^/ 2 ) 

= J dr ((5(1) - ©(l')MV,.2 f 1', 2')f v fr 

= \f dT (0(1) + 0(2) - 0(1') - 0(2')M1, 2; T, 2 ')f v f 2 ,, (10.31) 

where in the first equality we relabeled coordinates (1, 2) +7 (1', 2') and the second equality 
is based on the symmetry 1 *7 2 under exchange of the collision partners. 

From this result, we can derive a few auxiliary 
identities. Setting 0 = 1, we conclude that the colli- 
sion term vanishes upon integration, 

0 = J dT /[/] = J dr m(l,2;l'.2')(/n/ 2 , -/,/,). 

where the second representation Eq. (10.28) was 
used. Introducing the abbreviations 

A =4^, X = '10(1,2; 

J1/2 

this can be written as 0 = / drX(A — 1). Setting 0 = InA, Eq. (10.31) yields 0 = 
J dT A'Aln(A). Combination of these results finally leads to 

d t S= X - J dT A'(Aln A — A + 1). 



If you are not familiar with this definition, consult any advanced textbook on equilibrium statistical mechanics. 
Exercise: show that in thermal equilibrium, the above definition reduces to the standard definition of thermo- 
dynamics. 
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Now, the functions X and A are manifestly positive. For positive A, the combination A In A— 
A + 1 is positive as well (see the figure or, alternatively, try to prove it). This demonstrates 
that elastic particle collisions indeed increase the entropy of the system. 


Mesoscopic evolution laws 

The arguments used in the derivation of the ff-theorem are useful to derive “mesoscopic’ 
dynamical equations for observables of physical interest - densities, currents, and the like 
- i.e. equations that describe their behavior at length scales much larger than the collision 
mean free path l = tv (where v is a typical particle velocity) yet smaller than macroscopic 
scales. 

These equations must reflect the fundamental conservation laws of the system, the con- 
servation of energy, particle number, momentum, etc. Now, interactions may change the 
energy or momentum of individual particles, but they will not change the total energy and 
momentum of a sufficiently large assembly of particles. To see this in quantitative terms, let 
ST = SV R 3 be a thin slice in phase space containing phase space points in the small vol- 
ume element SV at coordinate q and of arbitrary momentum pgl 3 . The average particle, 
energy, and momentum density, respectively, are then given by 



It is now straightforward to prove that interactions do not change these values. To this end, 
let us assume that the coefficients w describe an elastic point interaction, i.e. an interaction 
local in space that respects energy and momentum conservation. In this case (exercise: think 
why), a restriction of Eq. (10.31) to the volume restricted phase space element <5r holds, 
i.e. 


[ dl 0(l)/[/(l)] '= l f dT (#1) + 0(2) - 0(1') - 0(1')M1', 2'; 1, 2)fvf r . 

Jsr Jiesr 

Now, particle, energy, and momentum conservation mean that, in the particular cases 0 = 
l,e, , the linear combinations 0(1) + 0(2) — 0(1') — 0(1') on the right-hand side of the 
equation vanish. We thus conclude 

r/1 /[/(l)] | //(x> =0. 

This identity is the key to the formal derivation of conserving transport equations. For 
example, changes in the particle density are obtained as 


dtp SV= [ d d xdtf= [ d d x { H , f}= [ d d X (d qa Hd Pa f - d Pa Hd qa /) 
•/ar Jsr Jsr 

— d qa H I d d x d Pa f — [ d d xv a d q J - 0 - d q J a , 

Jsr Jsr 
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where j a = SV 1 f ir d d xv a f is the a-component of the particle current density. We thus 
obtain the particle density continuity equation, 

dtp + V • j = 0. 

If you find this result trivial, keep in mind that it has been derived for arbitrary distribu- 
tion functions, and under specific (if reasonably general) assumptions on the nature of the 
microscopic interactions. Particle number conservation as well as the energy and momentum 
conservation laws discussed in the exercise below impose important consistency constraints 
on the validity of the theory. 

EXERCISE Show that, for a translationally invariant system, energy and momentum density 
obey the conservation laws 

d t e+ -j =0, dtKa + dqp a p=0, (10.32) 

where j = SV -1 f s d d x fHw and a p = 5V -1 f s d d x fmv a vp are the energy current density 
and the momentum current tensor respectively. 

Beyond equilibrium: zero modes of the collision integral 
As with Langevin theory, the full potential of the formalism becomes visible once we go 
beyond the level of thermal equilibrium. We have seen that the collision integral favours a 
Maxwell-Boltzmann distribution, /(x) = A/"exp(— e/T). Importantly, however, it is indis- 
criminate as to the value of temperature, or the normalization factor Af (which determines 
the particle number). In fact, if it were left to the collision integral alone, local equilibrium 
configurations described by spatially varying profiles T —t T( q), Af —1 Af( q) are perfectly 
balanced; all the collision integral requests is a logarithmically linear function of energy, 
~ exp(const. x e). What prevents strong local variations of temperature and normalization 
is the left-hand side of the Boltzmann equation, where Liouville flow builds up correla- 
tions across the system. Once the system is perturbed out of equilibrium (by application 
of a strong electric field, say, or by connection to two thermal reservoirs kept at different 
temperature), we may then expect a scenario wherein the collision integral aims to estab- 
lish local thermal equilibrium, while the left hand side will try to compromise between the 
“cost” of spatial variations and the need to adjust to the conditions imposed by the external 
perturbation. 

The quantitative description of such competitions is the subject of Boltzmann transport 
theory, a substantial field by itself (again, we refer to Landau and Lifshitz 18 for a very 
detailed exposure). In the following, we discuss a few basic elements of this theory and 
apply them to an example. To this end, let us imagine a system weakly perturbed out 
of equilibrium. We may then assume that its effective distribution function is of the form 
/ = /° + . . . , where f° is a Maxwell-Boltzmann distribution with uniform parameters 
and the ellipses represent a small deviation. The fact that /° depends on the phase space 
coordinates only through energy, e(x) = H(x), suggests a representation 

/(x) = /°(e) + ^(e)x(x) = /°(e) - ^ *{x), 
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where \ describes deviations from equilibrium. Substituting this ansatz into the collision 
integral (10.28), and using the fact that f® = / 1//20 we obtain the linear integral operator 

Ihi d2d1 ' d2 ' M,(1 ’ 2; 1 '> 2 ' )/2 ° 0^ + * 2 ' ' ¥ - X 2 ) • (10.33) 

This operator has a number of manifest “zero modes” : for 


X(x) = c(q), x(x) = c(q) e(p), x(x) = v(q) • p, (10.34) 


with space-dependent c(q), v(q), 20 we have I[x] = 0. These zero modes reflect the invariance 
of the collision integral under local variations of the equilibrium parameters. For example, 
a local change in the particle density can be described by AT — > Af° + 8Af( q). Linearity of 
the Maxwell-Boltzmann distribution function in SN then gives 


/- 


1 OAT W 1 T Af° 


i.e. with the identification c(q) = TSN(q)/Af 0 , a variation as in the first equality of 
Eq. (10.34). 


EXERCISE Extend this argument to show that the second zero mode corresponds to local 
changes in temperature, and the third reflects Gallilean invariance, i.e. invariance under Gallilean 
transformation to an inertial frame moving with velocity v(q). (Again, the locality of the collision 
integral entails that the boost velocity can be spatially varying.) 


Example: thermal conductivity of a gas of particles 
Even the linearized Boltzmann equation is a complicated integro-differential equation which 
is difficult to solve in generality. Often, however, it is possible to resort to certain simplifying 
approximations - of which the relaxation time approximation is one of the more popular - 
whereupon the Boltzmann equation becomes a highly efficient computational tool. In the 
following, we illustrate some of these ideas on the example of the thermal conductivity of a 
gas. 

Suppose we have imposed a temperature gradient T(q) on a gas of interacting particles. 
The gas will respond by the buildup of a heat current j e (cf. Eq. (10.32)). For sufficiently 
weak temperature gradients, the current will depend linearly on VT, and we may define 
the thermal conductivity, k, through the relation, 

j e = — kVT. (10.35) 

In the following, we outline, how this coefficient may be obtained in a semi-phenomenological 
manner from the Boltzmann equation (for a more substantial discussion, cf. §7 of Landau 
and Lifshitz. 18 

If the temperature gradient is small, we may assume a distribution of the form 

/(x) = /°(x) + ^x(x), 


Here we assume that the energy conserved in elastic collisions is a function of momentum only, at least on the 
microscopic length scales relevant to the interaction processes. 
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where /°(x) = ATexp(— e(p)/T(q)) is a local Maxwell-Boltzmann distribution with spa- 
tially varying temperature, and \ accounts for the distortion of the system out of the local 
equilibrium described by f°. Since x describes a mere readjustment of the particle concen- 
tration, we have / d d x f°x = 0. The heat current density carried by the system is given by 
(cf. Eq. (10.32) 


' T5V 


d d x f° 


where we used the fact that the distribution f° does not support a current. 

We next substitute the above ansatz in the Boltzmann equation and linearize: noting 
that deviations out of (local) equilibrium are driven by dqT, we anticipate that, to leading 
order, % = O(dqT). Terms of higher order in dqT will be neglected which means that, on 
the left hand side of the Boltzmann equation, the zeroth order approximation / — /° can be 
used. On the right hand side, we meet with the linearized collision integral I[x\- Assuming 
stationarity of the distribution, we thus obtain 


-{H, A = V • dqf° = <9 t /°v • dqT =^v dqT = I[x\. 


The linearity of I[x] in x suggests an ansatz x = (v- dqT)g(p), where g is a scalar function. 
While the solution of the linear integral equation for g remains a formidable task, things get 
a lot simpler if we resort to the phenomenological relaxation time approximation Eq. (10.30), 

/[x]=-^(v-5 q T) 5 ( p), 

where r is the characteristic time of relaxation processes in the medium. Substitution of 
this ansatz into the Boltzmann equation then readily leads to the solution 


d( P) = - 


T 


Substitution into the formula for the current then gives 

je = —sv - 1 d d x f° (^) 2 r ( v • dqT)v = — ^ J A/°(^)% 2 a q r, 

which leads to the result 



for the thermal conductivity. Here, l = tv is the collision mean free path, and we have 
made use of the fact that the mean velocity of particles scales as v ~ (T/m) 1 / 2 . (Typically, 
the mean free path depends only weakly on temperature.) 

Although our discussion above focused on a specific example, it reflects a number of 

general principles in Boltzmann transport theory: 


t> In systems weakly perturbed out of equilibrium, it is convenient to use a representation 
f = f° + d € f°x for the distribution function, where f° is a (local) Maxwell-Boltzmann 
distribution. 
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> Typically, the left-hand side of the Boltzmann equation contains the perturbing influ- 
ences, while the right-hand side (the collision term) aims to drive the system back to 
equilibrium. This suggests... 

> an expansion of the left-hand side to first order in the perturbations and to zeroth order 
in the distribution function. On the right-hand side we linearize the collision integral, 
using the fact that 7[/°] = 0. 

> The expansion of the left-hand side then contains the perturbation in a form whose 
symmetries suggest a problem-adjusted ansatz for the correction, Substitution of this 
ansatz produces an effective equation for the strength of the correction which can be 
solved, e.g., in a relaxation time approximation. 

Programs of this type often provide an efficient route to results otherwise obtained by 
microscopic linear response theory. However, problems far removed from equilibrium call 
for a more tailored approach and no general schemes can be formulated. 


10.4 Stochastic processes 

In the previous sections we have introduced a few strategies to describe the approach to 
(as well as as departures from) thermal equilibrium. En route we have met with different 
realizations of stochastic dynamics, or “stochastic processes.” Stochastic processes play a 
fundamental role in a whole spectrum of sciences, including the life sciences, engineering, 
socio-economic sciences, and many more. Equally important (for us as physicists) the lan- 
guages in which stochastic processes are commonly described - the vocabulary includes 
“probability,” “rate equations,” “Markov approximations,” etc. - are spoken by a large 
community of scientists, which is an ideal basis for interdisciplinary dialogue. 


10-4- 1 The notion of a stochastic process 

A process describes the temporal evolution of a certain state. It can be described in terms of a 
sequence {o*(t*)} where £$,£ = 1 , n are the discrete times at which the states are recorded 
and aj(ij) is a (generally multi-dimensional) state variable. A stochastic process 21 is one 
where the evolution aj(tj) — »• a; + i (f,+i ) involves elements of randomness. In the descrip- 
tion of stochastic processes we then need to focus on the totality of all possible sequences 
{o-i{ti)} where each realization gets assigned a certain probability p(a„,t n ; . . . ; a \ , £j ) . The 
probability function p carries the full information on the process, and this is the object we 
need to describe. 

Before moving on, it is worthwhile to distinguish between a few general types of stochas- 
tic processes. We speak of a stationary stochastic process if p does not change under 
simultaneous translation fj -4- f, + to, i = l,...n of all time arguments, i.e. if the the- 
ory is, on average, translationally invariant. Using the notion of conditional probabilities 


For a state of the art introduction to theory and application of stochastic processes we refer to N.G. van 
Kampen, Stochastic Processes in Physics and Chemistry , (Elsevier, 1992). 
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(cf. Eq. (10.7)), the joint probability function describing a stochastic process can be itera- 
tively constructed as 

p{a n ,t n ; . . . ; oi, ti) = p{a n ,t n \a n -\,t n -\\ . . . ; oi, t\) 

xp(a„_ 1 ,t„_ 1 |o„_ 2 ,t„- 2 ; • • • ;ai,#x) x • • • x p{a 2 ,t 2 \a 1 ,t 1 ) x p(a 1 ,ti). 

The process is called purely random if the probabilities p(aj , ■ • • ;ui,ti) = 
p(a,j,tj) are independent of the history of events. In this case, 

p(a n ,t n -,...;a 1 ,t 1 ) = 

Jlft*' ' 

is the product of n random numbers lacking intrinsic correlation. Evidently, purely random 
processes do not display very interesting dynamics. Next in the hierarchy of complexity are 
conditional probabilities for which 

p(a n , tn\a n -i,t n -i; . . . ; ai, ti) = p(a n , i„_i). (10.36) 

Processes of this type are called Markov processes. Markov processes enjoy an enor- 
mous spectrum of applications in statistical sciences, which is why they deserve a separate 
discussion. 


10.4.2 Markov processes 

In a Markov process, the passage to the state (a n +i,t„+i) depends 
on the current state ( a n ,t n ), but not on the history that got us there, 

(a n _i, t n _i; . . . ). Put differently, a Markov process lacks memory. 

Markov processes owe their popularity to the fact that they optimally 
compromise between descriptive power and analytical tractability; if 
possible, one will try to (approximately) reduce a given process to an 
effective Markovian process. In doing so, it is important to keep in mind 
that 22 the Markovian property holds only in an approximate sense. In 
modeling a process, the update times At = t n +i ~t n have to be chosen 
long enough to eradicate the short time memory of a process, yet short 
enough not to loose relevant aspects of the dynamics. (For example, 
in the process of Brownian motion, At must be chosen to be much 23 
larger than microscopic collision times.) 

Whether or not a process assumes a Markovian form generally depends on the coordinates 
used. For example, in a process where a velocity-like variable evolves in a Markovian process, 



22 For a more substantial in-depth discussion, see van Kampen. 21 

23 To understand the meaning of “much larger,” suppose that At = 0(collision time). If at a given instant, £ n , 
the real space coordinate difference |x n — x^-il happens to be atypically large, we know that the velocity at 
t n must have been high. It is then likely, that |x n _|_i — x n | will also be large, i.e. the probability to reach x n +i 
depends not just on the conditional probability p(x n +i, tn+ilx^, £), but also on the history of events before 
(x n ,t). Averaging over a large number of collisions is necessary to average out this memory effect. 
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the corresponding coordinates - the integrals of velocity over time - may store long time 
memory and, therefore, be non-Markovian. (To see this, imagine a process of Brownian 
motion represented in coordinate space, qi — > q2 — >• • • • — > q n - 2 — >• q n - 1 — > Qn , where the 

coordinates are recorded at equal time steps. If the spacing \q n -\ — q n - 2 I was exceptionally 
large, the particle will have high velocity. Chances then are that the next step q n -\ — q n will 
also be large. In other words, the conditional probability p(q n \q n ~i, q n ~ 2 , . . . ) ^ p(q n \q n -\) 
and the process is not Markovian. Exercise: consider why velocity is a variable with less 
memory.) For a given process one will naturally try to identify coordinates that make it as 
“Markovian” as possible. 


Chapman-Kolmogorov relation and master equation 
The Markovian nature of a process can be expressed in a manner alternative to Eq. (10.36). 
Consider the probability to observe a at t given initial data («;, %) and some intermediate 
event ( a',t '), p(a. i; a/, t! \ a;, t(). From this object, the “transition probability” p(a, f; a;, t;) 
is obtained by integration over all realizations of the intermediate event, 

p(a,t\ai,ti) = J da' p(a,t;a' ,t/;ai,ti) = J da' p(a, t\a! , t’\ ai,tj) X p(a', i'; ai, ti), 

where we have used Eq. (10.7). For a Markovian process, p(a,t\a' ,t'; a{,ti) = p(a,t\a' ,t'), 
this reduces to the Chapman-Kolmogorov relation 


p(a, t; a,, tj 



da' p(a, t\a ' , t') x p(a ! , t’\ a- 1: t(). 


(10.37) 


The usefulness of Eq. (10.37) becomes evident if we 
consider the case \t — t'\ -C |t — f;|. Eq. (10.37) then 
factorizes the description of the process into its three 
most relevant compounds, (a) our quantity of inter- 
est, i.e. the probability to get to a given we started at 
Oi, (b) the probability to reach an intermediate stage 
a' shortly before the final time, t, and (c) the short 
time transition probability a' ^4 a (see figure). This 
iterative, or “transfer matrix type” description of the process stands as the basis of most 
theories of Markovian dynamics. 

To push this iterative approach somewhat further, consider what happens if t — t' = St 
becomes (infinitesimally) small. To a very good approximation, p(a,t\a',t — St) — S(a — 
a') + 0(8t) will then be stationary. The term of <D( 8 t) comprises losses out of a' due to 
transition into some state a." . and input due to transition from a' . Denoting the probability 
of transitions ai — )■ a% by W{a' 2 \a{) 8 t, we may thus write 
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Figure 10.6 The evolution of Markovian probability, as described by the master equation: proba- 
bility gain of states a i (here considered as discrete entities, while the text uses continuum notation) 
due to in-processes is counteracted by loss due to out-processes. The sum over all feeds cancels, 
reflecting probability conservation. 


Substituting this expression into Eq. (10.37), taking the limit St — >• 0 and suppressing the 
initial time argument for notational simplicity p(a, t|aj,ij) = p(a,t), we then obtain the 
so-called master equation 24 (see Fig. 10.6). 


d t p(a,t) = J 


da' \W{a\a')p{a' ,t) — W(a'\a)p(a,t)\ . 


(10.38) 


Notice that the master equation conserves the total probability, i.e. dt J dap(a,t) = 0. 


INFO In general, little can be said about the master equation. An important exception regards 
the master equation for physical systems that are closed and isolated in that no matter is 
exchanged with the outer world, and energy is conserved. The attribute “physical” means that 
we consider systems whose microscopic dynamics is governed by some Hamiltonian. At large 
time scales, the solution of the master equation will relax to a time-independent equilibrium 
distribution function p eq (a), typically a Maxwell-Boltzmann distribution of the “energy” corre- 
sponding to the state a. We thus have, 0 = f da' [W (a|a')p e q(<0 — W (a'|a)p e q(a)]. This equation 
should be interpreted as an effective sum rule obeyed by the transition matrix elements W(a\a'). 
Using the invariance of microscopic Hamiltonian dynamics under time-reversal, it can in fact 
be shown 24 that the above balance equation not only holds in an integral sense, but even for 
individual pairs (a, a'), i.e. W (a\a')p e q{a') = W{a'\a)p e ^{a), or 


W(a\a') _ p e<l (a) 
W(a'\a) ~ peq(a') ’ 


(10.39) 


Equation (10.39) is known as the principle of detailed balance. 


The term “master equation” was coined in A. Nordsieck, W. E. Lamb, and G. E. Uhlenbeck, On the theory 
of cosmic-ray showers I: the furry model and the fluctuation problem, Physica 7, 344-60 (1940), where a rate 
equation similar to Eq. (10.38) appeared as a fundamental equation of the theory. 
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The principle of detailed balance constrains the equilibrium behavior of systems containing 
different compounds (as represented by the state variables a). For example, it states that, in a 
solution containing several chemical agents, the equilibrium reaction rates will be such that the 
reaction rates A — » B between any one pair of agents A and B exactly compensate for the back 
reaction rates B — » A. 


EXERCISE Kangaroo process: Consider the master equation, 

d t p(a,t) = J da' \W (a\a)p(a ,t) — W(a'\a)p(a, t)] , 

with a factorizable transition matrix W(a\a') = u(a)v(a'). What is kangaroo-like for the above 
transition matrix? Solve the master equation with the help of the Laplace transform p(a, z ) = 
J 0 °° dte lzt p(a,t). In a first step, show that the following relation holds 


p{a, z) 


=sQ)'+u(u 
-iz + v(a)/r 


wherel/r = f dau(a) a,ndcr(z) = f dav(a)p(a, z). Determine <7(2) by substituting the expression 
for p(a , z) and solving the resulting algebraic equation. 

Kubo— Anderson process: Taking v(a) = v, simplify the resulting equation for p(a, z) and 
compute the inverse Laplace transform to obtain the time evolution of the distribution p(a,t). 
Compare with the Ornstein-Uhlenbeck process (Problem 10.9.2). 


Example: Gaussian process 

Let us illustrate the general concepts 
introduced above on two impor- P{ n ify 
tant examples. Consider a one- 
dimensional random walk, i.e. a 

process wherein a particle performs 
random motion on a one-dimensional 
lattice (see figure). Setting the lattice 
spacing to unity, the state variable 4 — • — * — * — - 

a = n £ Z takes integer values, and 
our goal is to compute the probability 
p(n,t\no,to). Defining the probability density for an individual left-turn (right-turn) as qdt 
((1 — q) dt), the master equation assumes the form 

d t p(n, t) = qp(n + l,i) + (1 - q)p(n - 1, t) -p(n,t). 



Anticipating smoothness of the probability distribution we may convert this equation into 
a differential equation. To this end we assume (without altering the problem too much) 
that the process takes place on a lattice of N points subject to periodic boundary condi- 
tions. Defining the scaling variable x = n/N £ [0, 1] and a rescaled probability distribu- 
tion p{x)dx = p(Nx)dn = p(Nx)(dn/ dx)dx => p(x) = Np(Nx), and Taylor expanding, 
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p(x ± 1 /N) = p(x) ± N 1 d x p(x) + (1/2 )N 2 d 2 p(x), the master equation assumes the form 
of a generalized diffusion equation (for notational simplicity, we write p instead of p), 

(■ d t + jd x - Ddl) p(x, t) = 0, p(x, t 0 ) = S(x - x 0 ), 

with “drift coefficient” 7 = (1 — 2q)N~ 1 , diffusion constant D = 1/2 N 2 , and initial point 
xq = no/N. This equation is solved by 


/x f dk 

PlX ’ t) = J 2i' 


— (i'-yk-\-Dk 2 )t ik(x—xo) __ 


or, in terms of the original variables, 
1 


p(n,t) = 


(27Tt) 1 /2 


2(7uDf) 1 /2 

(n - n 0 - (1 - 2 q)tf 
2 1 




(Exercise: consider how you would solve the structurally similar Schrodinger equation of a 
free particle.) General comments on Gaussian processes: 


t> Gaussian or “normal” distributions typically arise when large numbers of statistically- 
independent events add to form a composite random variable. (In cases where the compos- 
ite variable is obtained by multiplication of elementary random variables, logarithmically 
normal distributions result.) 

t> The Gaussian form of the probability distribution is a manifestation of the central limit 
theorem: we may think of n as a random variable obtained by summing t elementary 
random variables ±1 drawn from a bimodal distribution with probability p( 1) = 1 — q 
and p(— 1) = q. According to the central limit theorem, the result will be a Gaussian 
distributed variable centered around the mean f x no + (1 - 2q)t and variance ~ y/t. 

> Random processes whose distributions are Gaussian are generally called Gaussian pro- 
cesses. The example illustrates that Gaussian processes are typically the results of the 
addition of a large number of elementary random variables. 


Example: Poisson process 


Now consider a sequence of elemen- 
tary events occurring in a manner 
uncorrelated in time (e.g. apples 
falling off a tree on an autumn day, 
signals of a Geiger counter exposed 
to weak radiation, etc.). The random 
variable of interest is the number, 
n, of events occurring in a certain 
time t (see figure) . Assuming that the 
probability of an event to occur in a 

short time window dt is udt, the master equation takes the form 



dtp(n,t) = vp(n — l,t) — op(n,t). 


(10.40) 
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For definiteness, we assume that this equation has to be solved with the boundary condition 
p(n,t = 0) = S n>no . 

Again, we may approximately solve this equation by assuming smoothness of the distri- 
bution and Taylor expansion. However, this procedure is a little bit too coarse to resolve 
the specific statistics relating to the discreteness of events. (Exercise: solve the equation by 
Taylor expansion and compare with the results below.) In the following, we employ concepts 
borrowed from quantum mechanics to construct a more accurate solution. Later on, we see 
that specific aspects of stochastic processes - the dynamics of very rare events, to mention 
one example - afford a general description in terms of “quantum” theory. (The quotes are 
used here because, of course, there is no true h involved.) 

We start by rewriting the equation as d t p(n,i) = v(E_ i — 1 )p(n,t), where E m is the 
translation operator in discrete n-space, i.e. E m f(n) = f(n + m). Now, the translation 
operator affords a representation in terms of the “momentum” operator <j> conjugate to the 
“number operator” n: With [</>, n] = —i, we have E m = e im A In the “n-representation,” 
the translation operator E_i takes the form £’_ 1 = exp(— d n ). Its eigenfunctions are given 
by fk{n) = (27r) -1 / 2 exp(*£:n) with eigenvalue A k = exp(— ik). Finally, the initial configu- 
ration of the probability distribution affords the spectral representation p(n, 0) = 5 n , n 0 = 
C e~ lk( ' n ~ n °\ By analogy with the solution of a quantum initial value problem, we may 
thus represent p{n, t ) as 


p(n,t ) = J** ^ e~ vt{ - e ik -V e iHn-no)' 

Using the identity * e* Anfc _ ^ An 0; a straightforward Taylor expansion of the exponent 
obtains the result (exercise) 


p{n,t) 


(l/t) n-n 0 

(n — n 0 )! 


(10.41) 


t> The process is described by a Poisson distribution (cf. Eq. (10.4) where vt f-F np represents 
the product of “attempts” (t f-F n) and “success probability” {v f-F p)), hence the name 
Poisson process. 

t> The first and second cumulant of the Poisson distribution are given by, respectively, 
pi = vt, i-i 2 = vt, i.e. centered around /q = vt, the width of the distribution is given by 
~ ( vt ) 1 / 2 . 

> The variable t is a bookkeeping index, which need not necessarily be physical time. For 
example, the statistics of energy levels on the energy axis of integrable quantum sys- 
tems is usually described by a Poisson distribution, which reflects the lack of inter-level 
correlations. 

EXERCISE One-step processes: The master equation of a one-step process is given by 


Pn = r n +lPn+l + Sn—lPn—l — ( T n + S„)p„. 



10-4 Stochastic processes 


639 


To solve the master equation it is useful to make use of the generating function g(z,t ) = 
S^L-oo zn Pn(t), where g(l,t) = 1 reflects the conservation of probability. Show that for a given 
g, the probability follows from the inverse Mellin transform 

Pn (t) = ^lj dz *)•• (10.42) 

where the integral is taken around a contour enclosing z = 0 (but no singularities of g.) 

(a) Symmetric random walk: Consider the special case r„ = s„ = 1 with the initial condition 
p„(0) = <5n,o- From the master equation, derive a differential equation for g(z,t) and show 
that its solution is g(z,t) = exp((l/2 + z — 2)f). Solve the integral (10.42) in the saddle-point 
approximation and show that the resulting behavior for large times t —¥ oo is diffusive, p n (t) — 
e-^^/y/Wt. 

(b) Furry process: Consider the case r n = 0 and s n = jn with the initial condition p n (0) = 
<5 n ,i- Determine g(z,t). You can solve its partial differential equation with help of the method 
of characteristics. Using Eq. (10.42) show that the probabilities for n > 0 are given by p n (t) = 
e -7t (l — e i' 4 )" -1 . 

(c) Population growth: Consider the case r n = an and s n = f3 n with the initial condition 

p n ( 0) = Show that the probability that the population dies out at time t is given by 

Po{t) = ^ m , where e = Discuss the short and long time limits. 


10.4.3 Fokker-Planck equation II 

The master equation (10.38) is a linear integral equation for the probability density, p. It 
can be formally solved in terms of a spectral decomposition of the integral kernel {W{a\a')} 
(exercise). However, in cases where W{a\a!) exhibits nontrivial dependence on the state 
variables a (which may be multi-dimensional!), an explicit eigenmode decomposition of p 
may be not attainable. 25 However, in many cases of practical interest: 

i> The integral kernel W{a\a') decays rapidly as a function of the “distance” |a — a ' |, i.e. 
for sufficiently short times 6t, transitions between remote states are strongly suppressed. 
(Here, we assume that state space { 0 } comes with a meaningful notion of distance, which 
usually is the case.) 

t> We may anticipate smoothness of the distribution p(a) on the scales of variation of W. 

Under these conditions, a Kramers— Moyal expansion can be applied to reduce the 
integral equation to the more benign form of a quasi-local equation (i.e. a differential equa- 
tion). Repeating the sequence of steps carried out in Section 10.2.3, we change notation 
W(a|a / ) = Wb(a'), b = a — a ’ , where Wi,(a / ) is the transition probability for a’ a' + b. In 
this notation, 

dtp(a) = J db [(W b p)(a - b) - (W- b p)(a)} . 


Although a numerical diagonalization of W may still be the most efficient route to obtain the (long time) profile 



640 


Nonequilibrium (classical) 


Using the relation, f db (Wb(a) — TU_f,(a)) = 0, Taylor expansion of (Wbp)(a — b) in b around 
a (the index in Wf, remains passive!) obtains the series 

d t p(a, t) = d™ ( a n p ) (a, t), (10.43) 

where the coefficients a n are the moments of the transition probabilities, 

ot n {a) = J dbWb(a)b n . (10.44) 

The Fokker— Planck approximation 

d t p(a,t) = d a ai(a) + ^c^a 2 (a)^ p(a,t), (10.45) 

with ai = f dbWb(a)b and a 2 = f dbWf,(a)b 2 assumes that all terms beyond n = 2 are 
negligible. As exemplified in Section 10.2.3 above: 

> The first derivative, or drift term ~ d a a\p is “deterministic” in that it describes the 
evolution of the averaged variable (a) . Indeed, 

d t {a) = J daad t p{a,t ) 

= J da a ^ -f ^d;fa 2 («)^ p(a,t) = J daai(a)p(a, t) = (au), 

where, to prove this identity, we have applied an integration by parts. 
t> The second derivative, or diffusion term describes the diffusive spread of probability 
around the center of the distribution, (a). 

In many cases, the Fokker-Planck approximation does an excellent job at describing the 
probability distributions of stochastic processes. However, as with any other approximation 
scheme, it is not perfect. Some care must be exercised, e.g. if one is asking questions about 
the tails of probability distributions, “rare event” regions where p has become small. 
In the next section, we discuss an example illustrating success and partial failure of the 
Fokker-Planck approximation in a very simple setting. 


10-4-4 Quality of the Fokker-Planck approximation: an example 

The Fokker-Planck approximation is routinely applied as a tool to explore analytically the 
dynamical evolution of probability distributions. It is therefore important to identify those 
situations where the approximation goes qualitatively wrong. 

Empirically, large deviations between the predictions of the Fokker-Planck equation and 
results obtained by (numerical) iteration of the Master equation are frequently observed in 
the tails of the probability distribution. Often these tails carry information about “rare” 
events far from the typical behavior of the system. Such rare events can be of profound 
importance. For example, in cases where the master equation describes the epidemic prolif- 
eration of a virus, one will likely be interested in the probability of exceptionally persistent 
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infections. Equally, in a master equation modeling a nuclear chain reaction, the eventuality 
of an uncontrolled chain reaction, no matter how unlikely, will be of definite interest, etc. 

In the following, we explore the shortcomings of the Fokker-Planck approximation for the 
familiar example of the Poisson process. In Section 10.4.2 we introduced the master equation 
of the Poisson process and obtained Eq. (10.41) for its rigorous solution. Now, let us see 
what we would have got instead had we subjected the master equation to a Kramers-Moyal 
expansion. 

Introducing a scaled variable, x = n/N,N 1, with associated continuum probability 
distribution p(x, t)dx = p(n, t)dn => p(x, t ) = p(xN, t)N, we may subject the equation to 
a second order Kramers-Moyal expansion. For later reference, we note that this expansion 
amounts to second order expansion of the translation operator E -1 — 1 = e~ dx ^ N — 1 ~ 
— 2 ^ r2 d 2 . As a result, we obtain the Fokker-Planck equation 



with p(x, 0) = This equation is solved by p(x,t) = N ( ^| -y/ 2 exp(~^j (Nx — tv) 2 ) 
or, upon translating back to the original variable n, 


/ 1 \ ] 

72 

i 

( — ) 

exp 

(n 

\ 27ri4 J 

2tv y 


This “Gaussian approximation” displays ^ 
the two principal characteristics of the Pois- q 
son distribution, mean value vt and width 
~ (vt) 1 / 2 . However, in the tails of the dis- -10 
tribution, the approximation is poor. This 
is illustrated in (the logarithmic representa- " 20 
tion of) the figure: At values of n about four _ 30 
times bigger than the mean value, in the far 
tails of the distribution, the Fokker-Planck -40 
prediction is by many orders of magnitude 
off the true result. 

The origins of the this error can be traced to the second order expansion of the operator 
Ei = e~ dx / N . Doing so, and replacing this operator by an effective long-range approximation 
wherein the “momentum operator” — id x is considered small, we forget about the integer 
nature of the variable n. (Exercise: why?) To obtain a better result in the far tail regions, we 
ought to keep the integrity of this operator intact. This makes the theory more complicated. 
However, pushing the quantum analogy further, and thinking of p(n,t) = p(n, f|0,0) as 
a “transition amplitude” we may turn the smallness of p to an advantage: using a path 
integral oriented language, and writing p ~ f exp (-S) as the integral over an exponentiated 
“action” , we know that S 1 in the far tails. This suggests that semiclassical approximation 
schemes might be applicable. In the following, we formulate a semiclassical approach to the 
Poisson process, and demonstrate that it does an excellent job of describing the tail regions. 
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Defining the “Hamiltonian operator” , H = —v (e ip — l) , the master equation assumes 
the form of a Schrodinger equation for an imaginary time evolution operator, 

(d t + H)p(n, t ) = 0, p(n, 0) = 5 n , 0 . 

Here, we are working in the system of original variables n, instead of the rescaled coordinate 
x. This equation has the formal solution 

rq(t)=n 

p(n,t) = (n|e“ tH |0) = / D(q,p)e *('>/ 

Jq{ 0)=0 

where (n\n') = S ritn > is the scalar product in n-space, and in the second equality we switched 
to a path integral representation. To avoid the mixed appearance of real and imaginary 
terms in the exponent, we subject the integration over the momentum variable to a Wick- 
rotation, p — > —ip. In the new variables, 

rq(t)=n 

p(n,t) = / D(q,p)e-f 0 * (p9-- h '(«>p)), H(q,p) = — v (e -p — l) . 

Jq(0)=0 

We evaluate this path integral in the crudest form of a stationary phase approximation, 

p{n,t) ~ e~ s[q ’ p \ S[q,p} = - [ dt' (pq - H(q,p)), 

Jo 


where ( q,p)(t ') are solutions of the extremal 
equations with the configuration space bound- 
ary conditions g(0) = 0 ,q(t) = n. Varying the 
action we obtain the equations of motion, 

dt’Q = d p H = i/e~ p , 
dvp = -d q H = 0. 

The solution of these equations reads 



p(t') = — In (n/vt), 
q{t') = n(t'/t). 


The conserved energy is given by H = —n/t + v. It is instructive to take a brief look at the 
“phase portrait” of these solutions shown in the figure above. Conservation of momentum 
means that the flow is parallel to the g-axis with velocity ue~ p . At p = 0, q(t') = vt' , 
which reflects the temporal evolution of the mean value ( n(t ')) of the Poisson process. 
The action corresponding to this mean trajectory vanishes, S(q,p = 0) = 0. Within the 
Hamiltonian approach it is possible to reach final configurations q(t) = n(t) very different 
from tv. However, this comes at the price of a large action: the action of the solutions above 
is readily obtained as 


S[q,p] = n\n(n/vt) — n + vt. 
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6^3 


(Notice that this action assumes its minimum value S = 0 at n = ut.) The stationary phase 
approximation to the probability distribution thus reads 

p(n,t ) ~ e -{nln(n)-n)+nln(vt)-vt = e -(ln(n)-n) , y t \n e ~ vt S 

n! 

where the last approximate equality is based on an “exponential” approximation to the 
Stirling formula, n\ ~ | ln(27 m)e" lnn_ " ~ e nlnn ~ n . The semiclassical approximation is 
shown in the figure on p.641, dashed. In the center of the distribution, where S' = 0 and 
semiclassical approximations are problematic, the Fokker-Planck equation does a better 
job at approximating the Poisson distribution. However, in the tail regions, where S » 1, 
semiclassics works well! The quality of the result may be improved by including quadratic 
fluctuations around the mean field trajectory, which then leads to an improved variant of 
Stirling’s approximation. However, to “exponential accuracy,” the present approximation is 
sufficient. 

The take home message of this exercise is that variational methods can be a powerful tool 
in the study of “rare events” in a stochastic process. Significant deviations from the Fokker- 
Planck approach are to be expected if the Hamiltonian of the process is not harmonic in 
momenta (which means that the corresponding “Schrodinger equation” differs from the sec- 
ond order Fokker-Planck equation) due to, e.g., discreteness of the agents participating in 
the process. However, even if the Fokker-Planck equation is valid, the semiclassical anal- 
ysis can be a useful aid. The point is that the Fokker-Planck equation - mathematically 
similar to an imaginary time Schrodinger equation - usually defies rigorous analytical solu- 
tion. Much as WKB-type semiclassics is useful in quantum mechanics, the “semiclassical” 
approximation to a Fokker-Planck equation can be an economic route to obtain information 
on large- action/rare-event statistics. 


10.5 Field theory I: zero dimensional theories 

In the previous sections, we have introduced basic elements of classical nonequilibrium 
theory. We have seen that nonequilibrium systems can be described (see Fig. 10.7) on: 

> a microscopic level, i.e. in terms of microscopic Hamiltonians and their interactions, 

> a mesoscopic level, where the microscopic transition rates are lumped into either a 
stochastic differential equation of Langevin type, or a master equation, 

> a continuum level. In our discussion so far, the continuum description was formulated 
in terms of the Fokker-Planck equation, a second order partial differential equation in 
the state variables, and first order in time. 

Both, from a formal and a conceptual point of view, the description of nonequilibrium phe- 
nomena in terms of differential (Langevin, Fokker-Planck) or integral (master) equations 
bears similarities to the Schrodinger approach to quantum mechanics. It requires compar- 
atively little theoretical machinery, affords a lean and reasonably intuitive description of 
many physical phenomena, but also suffers from limitations. We had a first glimpse of the 
deficiencies of the Fokker-Planck approach when we explored the tail regions of the Poisson 
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| Langevin equation ] 


master equation 


Fokker-Planck equation 


Figure 10.7 Showing the three levels of description of nonequilibrium systems. 


process: To stay with the analogy above, the tails turned out to be described better in terms 
of a “path integral approach.” 

Generally speaking, much of what has been said in the introductory chapters of this 
text by way of motivation of path integral and field theory oriented approaches to genuine 
quantum systems carries over to the present context. Specifically: 


> In the description of rare events and large fluctuations, variational approaches and 
the analysis of extremal action configurations may be a powerful alternative to the solution 
of the Fokker-Planck equation. 

t> So far, our discussion has been limited to statistical problems described in terms of only 
a few random variables - the analog of “zero dimensional” quantum problems. However, 
many applications of interest are described in terms of many spatially resolved random 
variables - think of a field {n(r, t)} describing the local population size in a problem of 
evolutionary ecology. Much as in quantum field theory, field integration techniques are 
the method of choice to explore the ensuing phenomena of collective dynamics, criticality, 
etc. 

> The Langevin equations explored above have been linear equations, on account of the oscil- 
lator dynamics of the underlying non-dissipative dynamics. Extremal principles derived 
from effective path integral descriptions are an efficient method to explore the more inter- 
esting behavior shown by nonlinear stochastic differential equations. 


In this section we introduce a number of field theoretical tools that have been designed 
to study the behavior of classical nonequilibrium systems. For the sake of clarity, (a) this 
section is devoted exclusively to the discussion of formalism. Applications are addressed 
in the follow-up section. Furthermore, (b) we introduce the field theory approaches in the 
simple setting of zero-dimensional problems, i.e. as (0 + 1) dimensional path integrals. The 
extension to ( d + l)-dimensional field theories is discussed in Section 10.6. 
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10.5.1 Martin-Siggia-Rose-Janssen-de Dominicis approach 


645 


Let us begin our discussion by constructing a path integral reformulation of stochastic 
differential equations. A (first order) stochastic differential equation is an equation of 
the structure 

d t n + f(n) = £, (10.47) 

where n = n(t) is a time-dependent random variable, / = f(n(t)) a function, and £ = £(t) 
a noise term. In some applications, £ = £(«, t) depends on n. However, here we restrict 
ourselves to the case of linear noise, where £ does not depend on n. 26 For simplicity, we 
assume that the noise is Gaussian with correlator, 

m)t = 0, mw)k = AS(t - £). (io.48) 

As a first step towards the construction of a path integral representation, we discretize time, 
n(f) — > Hi, i = 1, . . . , N, according to 

m - m - 1 + A t [f(m- 1) - &_i] = 0, (10.49) 

where At is the temporal discretization interval. Denoting a solution of the differential 
equation by n[£], expectation values of observables (d[u])j may then be formally represented 
as 

<0[n]>£ = j DnO\n](8(n-nm< = j DnO[n\ |^| 5(X) 

where Dn = drij is the functional measure and S(n— n[£]) = Hi H n i~ n K]j)- I n the second 
equality, we have introduced a vector X = {X t }, where Xi = n^ — nj_i + Af [/(rij_i) — 1 ] 

and \8X/8n\ is the (modulus) of the determinant of The advantage of the discretiza- 

tion chosen above is that 8X/8n is a triangular matrix with unit diagonal, i.e. the functional 
determinant equals unity. Thus, substituting the definition of X, 

(0[n])£ = J DnO[n\(8(d t n + f(n) - £))£, (10.50) 

where dtn = (n,; — rq_i } is shorthand for the lattice time derivative. 

INFO The discretization leading to a unit functional determinant goes under the name Ito 
discretization. More general discretization schemes call for a treatment of the corresponding 
functional determinants. It is customary to represent these determinants in terms of Grassmann 
integrals. The resulting integrals jointly involve real integration variables (the functional rep- 
resentation of the 5-function), and Grassmann variables. Interestingly, these integrations are 
coupled by a symmetry principle known as BRS-symmetry. For further discussion of BRS sym- 
metries in the context of stochastic differential equations, we refer to J. Zinn-Justin, Quantum 
Field Theory and Critical Phenomena , (Oxford University Press, 1993). 


For an application with nonlinear noise term, 


Section 10.7.2 below. 
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Representing the 5-function in terms of a Fourier integral, and switching back to a continuum 
notation, we arrive at the functional integral representation 

( 0 [n]) e = J D(n,h)0[n\ 4 * /**(*"+*<*)-« ^ 

where the structure of the “action” suggests that the two fields n and n are “canonically 
conjugate” to each other. We finally average over the noise to arrive at the celebrated 

Martin— Siggia— Rose— Janssen— de Dominicis (MSRJD) functional integral 

(10.51) 

Equivalently, we may use the path integral representation to compute the conditional 
probability p(n,t\no,0) (just set 0[n] = S(n — n(t))) in terms of a path integral with 
“configuration space boundary conditions” 


(10.52) 


Before proceeding, let us make a few remarks on the structure of the path integrals (10.51) 
and (10.52): 

> Introducing a new set of variables ( q,p ) through (n, n) = (q,—ip), Eq. (10.51) assumes 
the form of an imaginary time phase space path integral, 

ip* 

p(n,t\n o ,0)= J D(q,p)e^ dt ^- H{q ’ p) 

9(0 )=ra 0 

It is important to keep in mind that the integration over p extends over the imaginary 
axis. 

t> The action of the path integral vanishes on the line p = 0. That line generally hosts 
a solution of the classical equations of motion. Varying the action we indeed find that 
(?(i)iP = 0), where q = — f(q ) is a solution. Comparison with the Langevin equation 
shows that the dynamics describes the noise- averaged Langevin variable, q(t) O ( n(t )), 
if fluctuations are weak: d t (n) = ~ — /((n)). 

t> Although the Hamiltonian does not assume the “standard form” H = T(p) + V(q), it is 
quadratic in momenta. We may thus integrate over p to obtain 

q(t)=n 

p(n,t\no,0) = J D(q) e~ & f dt l d t q +f( q )} . (10.54) 

g(0)=7io 
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Multiplying out the square in the exponent, we arrive at a Lagrangian variant of the 
theory, the Onsager— Machlup functional 27 

q(t)=n 2 

p(n,t\n 0 ,0)= J D(q)e~f dtL ^\ L(q, q) = ^q 2 + ^ (10.55) 

g(0)=n o 

This is the Lagrangian of a one-dimensional point particle with “mass” m = 1 /A which 
is coupled to a velocity-dependent one-dimensional “vector potential” of strength A{q) = 
f(q)/A , and a potential V(q) = f(q) 2 /2A. 

We may interpret the Onsager-Machlup representation of the probability distribution 
as the formal analog of a Lagrangian path integral for a quantum time evolution operator. 
On general grounds, the Schrodinger equation corresponding to this formulation 
reads 

( d t - ^( d Q + a( a)) 2 + <P(qA) = (dt - ^dq - d q f(q)j <l>(q,t) = 0, 

where (j) is the analog of a wave function. Specifically, the “time evolution operator” 
p(n, t|no,0) is a solution of the initial value problem 

(dt-^dq-dgfiq^pfat) | =0, p(n,0) =6{n-n 0 ), 

which is nothing but the Fokker-Planck description of the Langevin dynamics. We thus 
arrive at the important conclusion that 


the Fokker-Planck equation is the “Schrodinger equation” of 
the MSRJD path integral. 


The discussion above is overly-concise in that (a) we are not explicit regarding sign factors 
and imaginary is. For instance, the “momentum” operator in our Schrodinger equation 
reads d q instead of the conventional —id q . because the equation is derived from a theory 
with real exponent — S[g], instead of iS[q}. More importantly, (b) we do not keep track of 
the order of derivatives (why d q f(q), instead of f(q)d q f). To monitor the correct ordering, 
one needs to keep track of the properly Ito-discretized sequential ordering of operators in 
the action (an instructive if somewhat technical exercise). 

INFO It may be instructive to observe the functioning of the concepts above on a miniature 
application. Consider a particle at rest in the bottom of a metastable potential configuration (see 
inset of Fig. 10.8). Thermal activation may help the particle in surmounting the potential barrier 
to settle in an overall more favourable true energy minimum. We know that the corresponding 
thermal escape rates are controlled by Arrhenius factors. The detailed computation of such 
rates is of importance in, say, the chemistry of thermally activated reactions. For a given potential 
landscape, the functional integrals above can be a powerful aid in analyzing the corresponding 


L. Onsager and S. Machlup, Fluctuations and irreversible processes, Phys. Rev. 91 , 1505-12 (1953). 
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Figure 10.8 Inset: particle in a metastable potential well. Main figure: phase space portrait of the 
corresponding MSRJD functional. For a discussion, see main text. 


escape kinematics. By way of example, here we discuss the emergence of Arrhenius activation 
factors from the functional integral formalism. 

Assuming overdamped dynamics, the equation of motion of a point particle in the well is given 
by 

-yq + d q V{q) =£'(t), 

where, by the fluctuation-dissipation theorem, (£'(()§'(£')) = 2jTS(t — t'). Dividing by 7 , we 
obtain a stochastic differential equation of the type Eq. (10.47), with the identification q n, 
f = d q V and noise strength A = 2T’y~ 1 . Let us assume that thermal escape will be an unlikely 
event in the sense that V/T 1, where AV is the difference between the local maximum of the 
metastable well and its minimum. We thus expect a small escape probability p ~ exp (—V/T). 
The largeness of the exponent — lnp ~ V/T 1 then suggests an evaluation of the path 
integral (10.53) by semiclassical methods. In its crudest form, semiclassics means that we evaluate 
the Hamiltonian path integral at its stationary points and neglect all fluctuation effects. 

With H(q,p) =S; — %-(dqV(q) +Tp), the Hamiltonian equations of motion of the action (10.53) 
read 

q = d p H(q,p) = -~d q V(q) - —p, p = -d q H(q,p) = -pdqV(q). 

7 7 7 

For the purposes of the present discussion, we do not need to solve these equations; it is sufficient 
to inspect the phase portrait of the Hamiltonian. 

As we shall see, the topology of Hamiltonian flow in phase space is essentially determined 
by the zero energy contours H(q,p) = 0. The structure of the Hamiltonian implies that one of 
these curves is specified by (q,p = 0). On the line p = 0 , the equations of motion reduce to 
"/q = —d q V (q). This is drift motion down the potential gradient towards the local minimum. We 
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note that the point of the local potential maximum q = q* defines an unstable point on that 
line, while q = 0 is stable. This observation entails two important consequences: (a) we expect 
that the equilibrium coordinates on the p = 0 line, 0 and q* , must define points of origin of 
motion qualitatively different from downward drift q* — > 0. Specifically, we expect the option of 
noise-activated reverse motion 0 — } q* . If so, the equilibrium points must be the terminal points 
of another $$-*= 0 trajectory. Indeed, the equation H(q,p ) = 0 possesses a second solution, p =8- 
— T 1 d q V . At the potential extrema d q V = 0, or q = 0, q*. this curve intersects the p = 0 line, 
which means that ( q * , 0) and (0, 0) are terminal points of phase space separatrices. (Recall that 
the principle of non-crossing phase space curves requires that the motion along the separatrices 
takes infinite time. For example, you may convince yourself that in a strict mathematical sense, 
the downward drift trajectory q* — > 0 has infinite passage time.) (b) The conservation of phase 
volume (Liouville’s theorem) implies that the two phase space fixed points (0,0) and (q*, 0) 
are hyperbolic. This means that the point (0, 0) (stable in coordinate direction) is unstable in 
momentum direction, while (q* , 0) (unstable in coordinate direction) is stable in momentum 
direction. Notice that the discussion above did not rest upon many specific characteristics of 
the escape problem. Indeed, the existence of E = 0 separatrices terminating in hyperbolic fixed 
points on the p = 0 line is a principle that carries over to many other “noise” phase portraits. 

The separatrices largely determine the topology of phase flow, as exemplified in Fig. 10.8. 
Specifically, the separatrix 0 — > q* accounts for the possibility of noise-assisted motion against 
the potential gradient. Within our semiclassical framework, the probability for this happening 
p ~ exp (—S') is proportional to the action accumulated along this trajectory. The latter is 
calculated as 

/ I r q * 1 AV 

dt(p(q)q-H(q,p))=- ^ d qd q V(q) = ~(V(q*) - V(0)) = —, 

where we have used H{q,p) = 0, and the defining equation of the separatrix. As expected, we 
obtain the Arrhenius weight as the dominant factor controlling the escape probability. Of course, 
much less heavy artillery is needed to arrive at just this result. However, as usual with functional 
approaches, the advantage of the above formalism is its extensibility to more complex problems. 


10.5.2 Field integral representation of the master equation I 

In order to avoid potential confusion with the momentum variable below, the notation for 
probability are capitalized in this section, p — > P. 

By generalization of the ideas exemplified in Section 10.4.4 on the Poisson process, we 
can derive a path integral representation for the master equations describing Markovian 
stochastic processes. Unsurprisingly, this path integral turns out to be a close ally of the 
MSRJD functional integral above. 

For simplicity, we consider a one step master equation, i.e. an equation where a 
discrete state variable n changes by no more than one unit ±1 in each time step. 28 The 


The straightforward generalization to finite range equations leads to path integrals with more complicated 
actions. However, in many cases, the long time dynamics deriving from these integrals is qualitatively similar 
to the one of the single step process. 



650 


Nonequilibrium (classical) 


general form of a one step master equation reads 

d t P(n,t) = [(A - l)/_(n) + (E_! - 1 )/+(n)] P(n,t), (10.56) 

where E±\f(n) = f(n ± 1) acts by unit translation. As in our example application in 
Section 10.4.4, it is useful to think of the probability distribution as the analog of a quantum 
time evolution operator P with “matrix elements” P(n,t;no,to) = P(n,f|no,0) = P(n,t). 
Equation (10.56) then becomes an (imaginary time) evolution equation, 

(d t + H{q,p )) P(t) = 0, P(0) = 1, (10.57) 

with the ’’Hamiltonian operator” 

H(q,p) = - [e +ip - 1) /_(?) - (e"* - l) f+(q). (10.58) 

Here, it is understood that the eigenvalue spectrum of the “position operator,” q, will 
eventually be evaluated at discrete values n, 29 and p is canonically conjugate to q. In the q 
representation, p = —id q . 

We proceed by representing the formal solution of Eq. (10.57) in terms of an imaginary 
time phase space path integral 

q(t)=n 

P{n,t:rn h t Q ) = {n\e lf, \n 0 ) = j D(q,p) e-f , 

q(0)=n o 

where H'(q,p) is the classical Hamiltonian corresponding to the operator Eq. (10.58). If it 
were not for the imaginary unit multiplying the pg-term, the action of the path integral 
would be structurally identical to a Hamiltonian action. We may remove that discrepancy 
by introducing a new momentum variable p — > —ip, whereupon the path integral assumes 
the form 

q(t)=n 

P(n,t;n 0 ,t 0 )= J D{q,p) e J* ( M-»(p.fl ) >, 

q(0 )=n 0 

H{q,p) = - (e +p - 1) f-(q) - ( e~ p - l) f+(q). (10.59) 

Here, H(p, q) = H'(q, —ip) is the Hamiltonian evaluated at the “Wick rotated” momentum. 
At this stage it is instructive to make a few remarks on the structure of the path integral: 

> It is important to remember that the integration over p extends over the imaginary axis. 

> As with the path integral representation of Langevin dynamics Eq. (10.53), the action 
vanishes on the axis p = 0. Again, the line p = 0 accommodates a solution of the vari- 
ational equations, ( q(t),p = 0), where q = f+(q) — f-(q)- Comparison with d t (n) = 


One may worry that the initial condition of the operator P(q = n,t;no,0) = S(n — no) conflicts with the 
discrete condition P(n, i|no, 0) = S n ,n 0 • Should this ever become an issue, one may switch to the framework of 
quantum mechanics of operators with discrete eigenvalue spectra. However, the continuous variables used here 
are better suited to the formulation of path integral methods below. 
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(/+(n)) — ( f~{n )) —t /+((n)) — /_((n)) shows that this equation describes the dynamics 
of the fluctuationless theory, again in analogy with the Langevin theory. 

> The Fokker-Planck approximation to the master equation is obtained by quadratic expan- 
sion of the Hamilton operator, 

H ~ ip(f+{q) - f (q% + ^ (/ (q) + /+(«))• 

Substituting into Eq. (10.57), we obtain 

(ft + ft(/+ - f-)(q) ~ \d 2 q {f- + /+)(g)) P(q, t ) = 0. 

The functional integral Eq. (10.59) is strikingly similar to the MSRJD functional 
Eq. (10.53). The analogy becomes almost perfect if we replace the Hamiltonian of the 
former by the quadratic approximation above. Indeed, we may think of the discreteness 
of the state variables entering the master equation as a source of “noise.” If we are only 
interested in structures A q much larger than the “lattice spacing” An = 1, the expansion 
of the Hamiltonian operator may be restricted to the first two non-vanishing terms. If we 
keep only the first term ~ d q , we obtain deterministic drift, q = (/ + — f~)(q) (as can be 
seen by integrating over p in Eq. (10.59)). The second term describes noise due to residual 
discreteness effects. This noise affects the time evolution of the probability distribution 
much like Langevin noise. 


10.5.3 Doi Peliti operator technique 

Equation (10.59) is not the only field theory representation of the master equation. There 
exists another approach, the Doi-Peliti operator technique 30 which extensively draws upon 
the Fock space structures and coherent state field integration introduced in previous chap- 
ters. Although this technique can be introduced in rather general terms, it is tailored to 
situations where the weight functions /_ in Eq. (10.56) are low order polynomials in n. 31 By 
way of example, let us consider the case / + = 0, /_ = An. The resulting master equation, 

d t P(n,t) = X(Ei - 1 )nP(n,t), 

describes extinction of a population whose individuals A A 0 perish at a constant rate A. 

To set the stage, let us identify an integer valued 32 configuration {n} with a bosonic Fock 
space state |n) of occupation number, n. Next, let us introduce an algebra of Fock space 
operators through 

a\n) = \n—l), a|n) = (n + l)|n + 1). (10.60) 


30 M. Doi, Second quantization representation for classical many- particle system, J. Phys. A 9 , 1465-77 (1976); 
M. Doi, Stochastic theory of diffusion-controlled reaction, J. Phys. A 9 , 1479-95 (1976); L. Peliti, Path integral 
approach to birth-death processes on a lattice, J. Physique 46 , 1469-83 (1985). 

As with the formalism introduced in the previous section, the Doi-Peliti operator technique affords straight- 
forward generalization to transitions further than nil, as well as to multi-component /multi-dimensional 
situations. 

32 The generalization to other discretization intervals as well as to multi-valued configurations should be obvious. 
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It is straightforward to verify that these operators obey canonical commutation relations, 
[a, a] = 1, i.e. they are related to the standard operator algebra 33 used in second quantization 
by canonical transformation. The operators a and a can be used to represent the action of 
the linear operator on the right-hand side of Eq. (10.56) in terms of the second quantized 
Hamiltonian (exercise), 

H = — A(a — da). (10.61) 

With this definition, it is straightforward to verify that 

P(n, t) = (n|e _t ^|no), (10.62) 


is a Fock space representation of the distribution function. 

We next apply the techniques introduced in Chapter 4 to represent the matrix element 
above in terms of a coherent state field integral. Proceeding in the usual manner, we 
decompose the time interval into N 1 steps, e~ tH = (e~ SH ) N , 8 = t/N , note that the 
Hamiltonian Eq. (10.61) is properly normal ordered, and insert coherent state resolutions 
of the identity, id. = / d(ip,ip) e~^\ip)(ip\, where the coherent states \ip) obey the relations 
a\ip) = ip \ip), a\ip) = d^\ip), (ip \a = (ip\ip, { ip\a = d^(ip\. This leads to the representation 

P(n,t) = J £)(^ ) ^,) e _,5 Iln=o 1 [ l5_1 (V'n-bn+i)V’n+^(bn+i,' 0 n)]+nlnV’JV+nolnVio-lnn!_ 

Here, H(ip,ip) = —Xip(l — ip) is the coherent state representation of the (normal ordered) 
Hamiltonian and the last two terms in the action represent the boundary matrix elements 


(n\ip N ) =iPn = e nlnV,JV , 

(ifo\no) = ^ipo° = e n ° ln ^ 0-ln "° ! . (10.63) 


Switching to a continuum notation, we arrive at the Doi— Peliti functional integral 


P(n, t) = 


J D(lp, 1 p)e~ /o* dt '[- a t^V’+^(b,b)]+nlnb(t)+™o lnV>(0)-lnn 0 ! 


(10.64) 


The coherent state field integral Eq. (10.64) defines a representation of the probability 
distribution which is an alternative to the phase space path integral Eq. (10.59). But how 
do these two representations relate to each other? The answer can be found by comparing 
the respective Hamiltonians. For / + = 0 and /_ = An, the Hamiltonian Eq. (10.58) reduces 
to H = — A(e^ — 1 )q, where we used the Wick rotated form of the momentum operator, 
p — > —ip. Comparison with Eq. (10.61) suggests the identification 


a = e p q, 


a = e 


-v 


(10.65) 


The latter is defined by 

a\n) = n 1/2 \n - 1), (n + l) 1/2 |n + 1). 
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Figure 10.9 Summary of different field theory approaches to Langevin dynamics and stochastic 
processes. 

which, up to a constant, leads to an identification of the two Hamiltonians. Eq. (10.65) is 
known as a Cole— Hopf transformation. It is straightforward to verify, that the Cole- 
Hopf transformation is canonical, i.e. the relation \p,q\ = 1 is compatible with [a, a] = 1. 
Substitution of the (coherent state version of the) transformation into the functional integral 
Eq. (10.64) transforms the “bulk contribution” to the action, f dt'(—(dt’'ip)tp + 
into that of Eq. (10.59). A careful treatment of the boundary term generates the effective 
boundary condition " 0 - 

EXERCISE Explicitly represent the coherent states above as exponentials of the operators a, a 
and verify the relations (10.63). Apply the Cole-Hopf transformation to the time-discretized 
representation of the functional integral to prove the above boundary identity in the large n- 
limit. Carefully avoid usage of the hermitian adjoint operation - the operator a is different from 
ab 

Summarizing, we have constructed two representations of P(n, t) which are related by a 
Cole-Hopf transformation. Historically, the coherent state representation Eq. (10.64) was 
introduced prior to the path integral Eq. (10.59). (This is surprising inasmuch as Eq. (10.59) 
relies on a less formal construction.) It is probably not possible to decide categorically which 
of the two representations is the “better.” At least for certain types of stochastic processes, 
the Doi-Peliti functional is controlled by a comparatively simple, if non-hermitian Hamil- 
tonian operator. The price to be paid for this simplicity is that the integration variables if) 
and if) do not have a direct physical interpretation. 

The network of different theories constructed in the previous sections is summarized in 
Fig. 10.9. So far, we have been focusing on the case of low-dimensional problems, where the 
state variable is a scalar or vectorial variable. In the next section, we upgrade the formalism 
to a genuine field theory describing multi-dimensional problems. 
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10.6 Field theory II: higher dimensions 


Our aim in this section is to construct an effective field theory approach to higher dimen- 
sional systems showing dynamical or steady state nonequilibrium phenomena. To give the 
discussion some perspective, let us list a few categories of system we have in mind: 

> Interacting lattice particle systems, i.e. systems of particles whose dynamics is gov- 
erned by an interplay of single particle hopping and interactions. Important examples 
include exclusion processes, i.e. systems wherein a hardcore repulsion excludes mul- 
tiple occupancy of lattice sites, and driven diffusive lattice gases, systems where a 
uniformly applied force stabilizes a steady state nonequilibrium configuration. 

> Reaction— diffusion systems, i.e. lattice systems populated by particle species 

which may undergo reactions. Setups of this type are common in the modeling of chemical 
reaction kinetics, the description of evolutionary or ecological dynamics, traffic simulation 
and many more. Thinking, for example, about a “fast” chemical reaction A + B — > C 
in solution, the factor limiting the total reaction throughput may be the speed at which 
the chemicals diffuse through the reaction vessel. In this case, a combination of diffusion 
(conveniently modeled on a lattice) and local (on-site) reaction will be the minimal frame- 
work to describe the process. Important examples of particle reaction processes include 
birth 0 — > A, death A — > 0, transmutation A — > B, death at contact, A + B — > A, etc. 

Notice an important difference between the two categories above: while the particle number 
in interacting lattice systems is conserved, reaction-diffusion systems contain non-conserved 
particle species. 


10.6.1 Basic notions of dynamical critical phenomena 

What would be a good strategy to describe systems such as those listed above? On a 
fully microscopic level, the dynamics is governed by incredibly high dimensional equations 
of motion whose solution is out of the question. In practice we will, rather, turn to a 
description in terms of effective or coarse-grained variables. For example, in a system which 
may undergo a phase transition the order parameter field, rn(x), of the transition would be 
an obvious choice of an effective variable; in a reaction diffusion system one might think of 
the coarse-grained concentration variables n^(x),ns(x), . . . of the particle species, etc. For 
definiteness we will denote our effective variables by cj) a , a = 1, ... ,n throughout. 

As in our “zero-dimensional” discussion in Section 10.2, the presence of microscopic “hid- 
den degrees of freedom” affects the dynamics of the effective variables in two different 
ways: (i) a conspiracy of dissipative and macroscopic forces drives the effective variables 
towards local stationary configurations. However, (ii) we have observed that the presence of 
dissipative relaxation mechanisms is generally accompanied by microscopic stochastic fluc- 
tuations. Finally, (iii) for conserved effective variables, time-dependent changes, d t <p must 
be consistent with a continuity relation d t <j> + V • j = 0, where j is some current field. 

Before turning to the discussion of effective theories consistent with the principles (i)-(iii) 
above, it is worthwhile to formulate a set of good questions (observables) to ask. We are 
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interested in out of equilibrium systems wherein particle correlations generate collective or 
even critical phenomena. As usual in many particle physics, we aim to identify fixed points 
separating phases with distinct universal properties. One may then proceed to an analysis 
of power law correlations emerging in the vicinity of critical points. In the next section, we 
set the stage for these types of discussion by introducing a few elements of the theory of 
dynamical critical phenomena. This material follows closely a seminal review article 34 by 
Hohenberg and Halperin. 

Consider a system with some second order (continuous) phase transition. We measure the 
distance to the transition point in terms of some relevant scaling field r (e.g. r = (T— T c ) /T c , 
where T is temperature, and T c the critical temperature.). We denote the order parameter 
of the transition by <t>. We assume that the order parameter is conjugate to a field h, in 
the sense that (< f> a ) = — dhF[h], where F[h] is a suitably chosen functional (in thermal 
equilibrium, the free energy of the system). 

In the vicinity of a critical point, the system builds up long range correlation. For example, 
the instantaneous or static correlation function C(x) = (^(x)^>(0)) is expected to show 
power law behavior 

<7(x) = |x|-( d - 2+ ^F(|x|/0, (10.66) 


where £ ~ |r \~ v is the correlation length and Y a dimensionless scaling function. The corre- 
lation length is a measure for the characteristic length scales of order parameter fluctuations. 
Turning to dynamics, we anticipate that the corresponding order parameter time scales 
will increase as 


t(r) ~ £(rf ~ |r| 


(10.67) 


where the dynamical exponent z depends on how the order parameter relaxes in large 
correlation volumes. The growth of the relevant time scales is a phenomenon known as 
critical slowing down. Equation (10.67) can be converted to a statement on the disper- 
sion of order parameter fluctuations. The characteristic frequencies corresponding to 
Eq. (10.67), (u(t) ~ t(r) -1 ~ t uz , relate to the momenta |q| ~ £ _1 ~ \t\ v 


u ~ |q| z /(|q|C), 


where / is a dimensionless function. 

Turning to the temporal fluctuation behavior of the order parameter, imagine a situation 
where <f> has been prepared in some initial state and is subsequently allowed to relax. It is 
reasonable to describe the relaxation profile by the scaling ansatz 

= \Tfg(t/t{r)), 

where /3 is the order parameter exponent and g a dimensionless scaling function with asymp- 
totics g(x — > oo) — »• const, (relaxation towards the static value (— t)& below the transition 
point r < 0, and zero above). For short times, t — »• 0, the relaxation process cannot be 


P. C. Hohenberg and B. I. Halperin, Theory of dynamic critical phenomena, Rev. Mod. Phys. 49, 435-79 (1977). 
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susceptible to the full extension of the correlation volume, and r) must be independent 
of r. This leads to the condition g(t/t(r)) t ~° \t\~P, or 

<j>(t) ~ t~ a , a = fi/i'z. 

We next define two important correlation functions characterising the fluctuation/correlation 
behavior of the order parameter: The dynamic susceptibility, %, is defined through the 
linear response relation 

(cj)(q)) h =° (10.68) 

where q = (q, u>). Causality (i.e. the fact that <f>(t) can depend only on h(t' < t)) means 
that x(w) is an analytic function in the upper complex w-plane. (Why was that?) The 
spatio-temporal fluctuation behavior of the order parameter is described by the dynamical 
correlation function, 

C(x) = {<p(x)<t>( D)); - (0(O)) 2 , (10.69) 


where x = (x, t). As we review in Section 10.6.4 below, the classical fluctuation- 
dissipation theorem now states that, in thermal equilibrium, 


C( q,w) 


2 T 
u> 


x(q,^)- 


(10.70) 


Violations of this relation will be an important tool to diagnose nonequilibrium conditions. 


10.6.2 Field theories of finite dimensional Langevin systems 

Armed with the definitions above, we are now in a position to formulate an effective field 
theory approach to dynamical phenomena. We imagine that the energy stored in a particular 
order parameter configuration is described by some energy functional H [</>]. The question 
now is how to model the dynamics of the order parameter field. Following Hohenberg and 
Halperin (HH), 34 we argue that: 


> In a model for a system with non-conserved order parameter (model A, according 
to the conventions of HH), the order parameter will relax to the minima of the energy 
functional, i.e. the equations of motion will contain a contribution 


d t (j)(x,t) = — D 


6(f>(x, t ) 


+ ..., 


where D is a characteristic relaxation constant. 

> In contrast, in a system with conserved order parameter (HH’s model B), the continuity 
equation d t (j) + V • j has to be obeyed. We assume that the current acts to evade regions 
of increased energy, j(x, t) = — DV + . . . , where the ellipses represent fluctuating 




10.6 Field theory II: higher dimensions 


657 


contributions to the current to be discussed in a moment. (Exercise: consider the meaning 
of this definition.) 35 The continuity relation then assumes the form 

^< X - () = DA ^) + - 

The two evolution equations above can be subsumed as <9 t </>(x, t) = DA n + . . . , where 
n = 0, 1 for models A and B, respectively. According to principle (ii) above, the relaxational 
character of the dynamics implies the presence of fluctuations. The full evolution equation, 

a generalized Langevin equation, thus reads as 


d t (j)(x, t) 


-D(- A)” 


SH\£ 

5(j>(x, t) 


+ £(x,i), 


(10.71) 


where the noise is correlated as 

(£(x, t)£(x',t')} = 2 S(t — t')K(x — x'$ = 2 K(x — x'). (10.72) 

This correlator presumes that, on suitably chosen time scales, the noise can be modeled 
as Markovian. However, at this stage, not much can be said about the spatial correlation 
kernel K. 

We finally note that correlation functions can be generated by addition of a “conjugate 
field” contribution to the Hamiltonian, 

H[cj>] — > H[<p] — j d d x cp(x,t)hfx.,t). 

Functional averages (0[<^])f may now be computed by straightforward generalization of the 
(0 + l)-dimensional functional (10.50): 

(O[ 0 ])<= j DoO[d>: 1 { 6 (t)d(x) + -./J)(-A) n S 6 "W - £(*)) . 

In practice, F[</>](x, t) = F(<^(x, t),d x </>(x, t ), . . . ) will be a local function of the field and 
its spatial derivatives. Representing the ^-constraint in terms of a Fourier integral over a 
“momentum field” ip and integrating over the noise field we arrive at 


(O[0])e = J D(cp,ip) 0[(p\ e 


if dx'ijj(x)(dt(f>(x)+D(—A) n S 5 H^ )~f dxdx' ij>(x)K(x—x / )i/>(x / ) 


Typically, the dominant contribution (in the sense of dimensional relevancy) to the Hamiltonian will be a 
quadratic form in the order parameter H[(j> ] = ^ f d d x <f> (x, t) + ... In this case, the current reduces to 
j = —Dr <f> + . . . , which is a generalization of Fick’s law. 
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where f dx = f d d xdt. A “Wick rotation” ip — > —iip finally brings us to the finite dimensional 
generalization of the MSRJD— functional 


(0[<p])t = j D(cp,ip) 0[<p}e S dxHx) ^ dt ^ x)+D{ ~ A)n±s ^)+S dxdx ' ^ X ) K ^- X ')^') _ 

(10.73) 

As in the zero-dimensional cases above, we may integrate over the momentum field ip to 
obtain the generalized Onsager— Machlup functional 

(Om = J 

(10.74) 


where K 1 is the inverse of the noise kernel. 


INFO It is an instructive exercise to derive from Eq. (10.74) a functional variant of the 
Fokker— Planck equation - “functional” because the distribution function we are after, Pt [</>], is 
a functional of <p, and the partial derivatives d ni P(n , t) appearing in the standard Fokker-Planck 
operator (cf. Eq. (10.22)) become functional derivatives. (To avoid the simultaneous appearance 
of rectangular and conventional brackets, we denote the time argument in pt[4>\ by an index.) 
Our starting point is the Onsager-Machlup functional for the probability functional, 

Pt[<§ = ^n<5(0(x) - 0(x,t)) 

,0 J e -\! dxdx' ^ 

where we have introduced the abbreviation F[(p](x) A) n • an d </> = {<M X )} -fe the 

argument of pt[<f>] while 0(x, t) is the (d + 1) -dimensional integration variable. 

As a first step towards the derivation of a Fokker-Planck equation, we switch back to a time 
discretized representation, p t — » pi, where l = t/ 8 is an integer counting variable and <5 a small 
time increment. The Ito-time discretized Onsager-Machlup functional then reads as 

pi[0] J D4> e - i ^= 0 ^+ 1 ,^ (1.0.75) 

<Kx)=<b( x ) 

where 0\cp, cf>'] is shorthand for 

0[<P,<P'] = J d d xd d x’ Q(0 - <p') - Ftf]) (x)A _1 (x - x') Q(0 - 0') - F[0']j (x'). 

We next have to realize that the integral over the first l — 1 fields <pk above defines the probability 
distribution pi- 1 : 

Pl [4>] = J J Dpe-^'t-rtpi-iW-p], 

where in the second equality we have renamed integration variables. Introducing the shorthand 
notation (f\K\g) = f d d xd d x' /(x) A(x — x , )gr(x'), we next represent the operator 0[(p, <p — p] as 

0[4>a-iA= ^-f[<I>-p]\k\^-f[<p-p] . 
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Using the abbreviation {...) = f Dpe « F K \F) (...), an expansion to leading order in 5 - the 
structure of the Gaussian weight entails that each power of p scales as S 1 ^ 2 - then obtains 


(i+\(o\K\F[<t>])-\ o\k\ 5 -^» +lmK\Fm 2 - 5 -(F[m\m >) 


5pi- i 


\S 2 pi-i 


J 5pi - 1 


m 


_ s / 1 \6JM)p l _ l[( i, ] - S 


(F^lKlF^pt^ 


J 2 pi - i 


Here, ,|S;: ■ Dividing by 8, taking the limit <5 — >■ 0, and switching back to a 

continuous time notation we obtain 

a »-M + ( ttN) + (* w) - 0 ’ 

or, in a coordinate resolved notation, 


|jL| 



f d d xd d x' tf(x - x') f ft 0. 


(10.76) 


Equation (10.76) is a functional generalization of the Fokker— Planck equation. Roughly 
speaking, (— A) n <5.ff/<50 replaces the “friction coefficient” of the finite dimensional equation, i.e. 
the instance that drives the distribution towards a relaxation equilibrium. The “diffusion term” 
is controlled by the correlation matrix of the noise. 

So far, we have not specified the noise kernel, K. However, we expect that in equilibrium it 
relates in a definite way to the drift term - the fluctuation dissipation theorem. By “equilibrium” 
we mean a time-independent distribution of generalized Maxwell-Boltzmann type, 

jW ( io.77) 

where AT effects normalization. We now show that the condition that p eq is a solution imposes a 
condition on the form of K. Substitution of Eq. (10.77) into (10.76) indeed obtains 




+ ^ J d d xd d x K(x — x') 


SHtf] \ 1 8H[4 8H[<f>\ 1 

I 5<p(-x') J T <50(x) <50(x') 


This equation is solved provided that we declare that 


7f(x-x') = DT(-A) n i5(x-x'). (10.78) 

Notice that for a Hamiltonian functional that is local in (f> (think of H[<$ = | f d d x((d<f>) 2 + 
r(j> 2 + ...)) the functional derivatives appearing in the Fokker-Planck equation are singular (for 
example, , 5 , ; , : - x .),< 0 (x! flf||fl ~ — (c? 2 +r)5(0), etc.) and need regularization. However, for the purposes 
of our present discussion, a symbolic understanding of the derivatives is sufficient. 

To conclude, let us summarize several “take home messages” implied by the construction 
above: 
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The concept of a Fokker-Planck equation affords a generalization to finite dimensional prob- 
lems. 

However, such an equation involves an infinite dimensional partial integro-differential equation 
which does not look very inviting. 

We do not have to solve the equation explicitly to verify that the underlying generalized 
Langevin equation relaxes to a Maxwell-Boltzmann (or Gibbs) equilibrium configuration, 
provided that 

the noise correlator is chosen appropriately. 

Finally, it is worth noting that the strategy pursued above, i.e. the construction of an inte- 
gral recursion equation in discrete time space (Eq. (10.75)), followed by a Taylor expansion in 
the incremental variables, reflects a general scheme for the derivation of transfer matrix type 
equations. 


10.6.3 Field theory of finite dimensional stochastic processes 

Problems in reaction kinetics or population dynamics are often encoded in terms of master 
equations defined on a finite dimensional lattice, 

dtPt[n\ = ^2 {W[n, n']p t [n'\ - W[n' ,n]p t [n \) . (10.79) 

The notation Pt[n\ indicates that the distribution depends on the “function” {rq} where i is 
a lattice point and n t a container variable comprising all relevant local information (the local 
concentrations Aj, Bi , ... of chemicals, etc.). The transition rates, W[n. n '] depend, likewise, 
on lattice functions. In many applications, transition rates W describe a competition of 
diffusive spreading (a random walk on a lattice) and particle reactions. Systems of this 
kind are termed reaction— diffusion systems. Field theories for reaction-diffusion systems 
can be obtained by straightforward generalization of the (0 + l)-dimensional techniques 
introduced in Sections 10.5.1 and Eq. (10.61). 

By way of example, consider a pure extinction process, A A 0 wherein a certain agent 
A perishes at a constant rate. Defining rq = (# of As at lattice site *), we may then ask 
how the probability p t [n] evolves under the joint influence of annihilation and diffusive 
spreading. A field theory representation of this process can be obtained by generalization 
of the Doi-Peliti operator algebra, 

Oj| . . . ,n,, . . . ) = | . . . ,n* — 1, . . . ), 

a,| . . . , rij, . . . ) = (n* + 1)| . . . , rij + 1, . . . ), (10.80) 

where we represent the population {rij} in terms of a Fock space state. Again, the operator 
algebra obeys canonical commutation relations, [a, : , dj\ = 6ij. A straightforward extension 
of our discussion of Section 10.5.3 shows that the Hamiltonian generating the stochastic 
process above is given by 

H = —Py' djAijOj — \y ] dj(l — a^, 

ij i 


(10.81) 
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where A ^ is the lattice Laplacian and D a diffusion constant. The corresponding coherent 
state field integral action then reads 

S[ip, V>] = J d d xdt [$(d t - DA)ip - A^(l - ij>)] , (10.82) 

where we have switched to a continuum notation. 

EXERCISE We may also derive a field theory by generalization of the techniques introduced 
in Section 10.5.2. Derive the corresponding action and show that it relates to Eq. (10.82) by a 
Cole-Hopf transformation. 


10.6.4 Fluctuation-dissipation theorem (revisited) 

The field theories above represent our most general descriptions of non-equilibrium many 
particle systems. A question that frequently arises in applications is whether or not a given 
system is in thermal equilbrium. A straightforward answer to this question may be given 
as follows: compute the long time stationary probability distribution and check whether it 
is of Maxwell-Boltzmann form. This procedure is generally valid but can be unnecessarily 
cumbersome in practice. (You have to compute the stationary distribution first, and this can 
be hard.) Often, it is more efficient to check whether the fluctuation-dissipation theorem 
- a hallmark of equilibrium systems - is fulfilled. Now, we have encountered the FDT in 
many different forms and contexts. Given its practical importance, it is worthwhile to briefly 
summarize some of the more important incarnations. The last of these, a formulation of the 
FDT for a linear variant of MSRJD field theory, plays an important role in the applications 
below. 


FDT I: Equilibrium linear response 

In Chapter 7 we discussed the apparatus of linear response for equilibrium systems. We 
considered the expectation value of certain observables X(x) 3e (with x = (x, t) as usual). 
Assuming that X is conjugate to a generalized “force,” F, and that X{x) = {X[x)) = 0 in 
an unbiased situation, we considered the linear relation 

X(u) = x(w)F(u;) + 0{F 2 ), (10.83) 

where x( w ) defines a generalized susceptibility. Equation (10.83) establishes a con- 
nection between a macroscopic force ( F ) and a macroscopic observable (X) in a “bath” 
containing a large number of microscopic degrees of freedom. In this sense, x is a quantity 
probing “dissipative” physical processes. In Section 7.3 we showed that 

X(w) = C + (w) = -i©(f)([X(t),X(0)]), 

where C + was the retarded response function. We then established a connection between 
C + and another correlation function, the time ordered correlation function, 

C T (t) = -i(T t X(t)X( 0)), 


Chapter 7 was formulated for quantum theories, but that aspect will not be of relevance throughout. 
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which describes “fluctuations” of the observable X. Specifically, we found that (cf. 
Eq. (7.21)) 

Im C T (lj) = ImC+(w) coth( j 8w/2). (10.84) 

This is the quantum fluctuation dissipation theorem. We next wish to explore what 
happens to this relation in the classical limit. 

In the classical limit, X — > X(x) becomes a function in phase space (formally, the 
Wigner transform of the operator X. For more information on the Wigner transform, see 
Section 11.5.1), the time dependence, X(t) — > X(x(t)) resides in the classical time evolution 
of the argument, and the time ordering of operators involved in the definition of C T becomes 
inessential. This means that the classical limit of the time ordered correlation function is 
given by 

<?!:(<) = (X(t)X(0)), 

where 

(X(t)X(0)) E |d^p(x)T((x(t))X(x). 

It is not difficult to verify that the Fourier transform C c i(w) = Im(7 c i(u;) is purely imaginary. 
Likewise, it can be shown that the retarded response function approaches the form C + = 
hC^ - note the extra “ft” - with 

<%(*)' = Xd (t) = G(t)({X(t),X( 0)}), 

where the Poisson bracket replaces (— ( i/ft ) times) the quantum commutator. (However, 
this expression is not too useful in applications. In practice, one will often try to obtain the 
linear response susceptibility by different means.) 

EXERCISE The expressions above, Poisson brackets replacing commutators, etc., contain the 
“natural” classical counterparts of the quantum formulae. To actually verify their validity, employ 
the Wigner representation of the operator X (cf. Section 11.5.1) and then take the classical limit 
by leading order expansion in ft. 


Finally, the Fourier argument in Eq. (10.84) has the dimension of a frequency. This means, 
that the bosonic distribution function must be understood as coth(/3/kn/2) 

Combining everything, we obtain the classical fluctuation dissipation theorem 


2 T 

<7(fy) = — Im X (w), 

w 


(10.85) 


where we removed the subscript “cl.” In the theory of steady state phenomena, it is often 
useful to consider a static variant of the FDT. To this end, let us define the equal time 
correlation function as 


C = J ^C(u) = (X(0)X(0)). 
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The analytic properties of C(uS) and x(cn) (explore this point) then imply the static version 
of the FDT, 


C = T X { 0). 


( 10 . 86 ) 


FDT II: linear Langevin equations 

In Section 10.2.1 we discussed the FDT within the framework of linear Langevin theory. 
(Here “linear” means Langevin equations that are linear in the dynamical variable.) We 
saw that consistency with the equipartition law of equilibrium thermodynamics implies the 
presence of fluctuations in any dissipative medium. But how does this observation correlate 
with the linear response picture above? 

To understand this connection, let us look at the Langevin oscillator equation 

q + ulq + jq = £, 


where u o is the oscillator frequency and mass has been set to unity. The FDT as discussed 
in Section 10.2.1 requires that Iff = 2 jTS(t — t'). In Fourier space, this may be 
solved as 

?( w ) = x(w)C(w), xM = ~ u2 _^2 + — (10.87) 

The notation indicates, that the kernel x can be interpreted as the dynamical suscep- 
tibility describing the response of the coordinate q to the “driving force” £. The point to 
appreciate here is that the dynamical susceptibility of a system describes both its response 
to fluctuations (as in Langevin theory) and the response to external driving (as in linear 
response theory). 

The Fourier transform of the dynamical correlation function C(t) = (q(t)q(O)) reads as 


C{u) 


2 jT 

(«? - + (ury) 2 


2 T 
u> 


Imx(w), 


( 10 . 88 ) 


where we have used the relation (£(w)£(u/)) = 2 / yT5(uj +u'). Comparing Eq. (10.88) and 
Eq. (10.87) we see that the general fluctuation dissipation theorem Eq. (10.85) and the two 
approaches “FDT from equipartition” and “FDT from linear response” describe different 
sides of the same coin. 


FDT III: MSRJD field theory 

In later sections, we explore the collective behavior of many particle systems by field theo- 
retical methods. The FDT will turn out to be an important diagnostic tool to test whether 
a system is in equilibrium or not. Let us, then, explore how the FDT can be probed within 
the framework of field theory. To this end we consider the MSRJD functional Eq. (10.73) 
with noise correlator (10.78). While the FDT Eq. (10.85) is of general validity, we begin by 
considering linear (free) field theories. Assuming that the relaxation of <fi is controlled by 
the quadratic Hamiltonian 

H[<t>] = J d d x Q(d# + ^<j> 2 - h$J , 
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where the field h will be used to probe the linear response of the system, the corresponding 
MSRJD functional takes the form 

Z = J D((p,1p) e f dx W( d t4>+ D (-&) n (-A-4>+r<t>-h))+DTi’(-A) n ilj]' 


Switching to momentum space and introducing a matrix notation, 

Z = J ip) exp [<p,ip]- D J dqtp q q 2n h_^J , 


So[<f>,ip] = -^ J dq (<f), ip)- q 


0 ( 9 -)- 1 

( g + )~ 1 2 DTq 2 " 


where f dq = f ^ y+i , 9± = (t iu + Dk 2 n (k 2 + r)) \ and we are using an index notation 
g q , instead of g(q) for improved notational clarity. 37 Now, let us employ this functional to 
compute the dynamic susceptibility. Due to the evenness of So in the fields, (<p ) o = 0, where 
(. . . )o = / D(cf>, ip) exp (— So[<p, V’])- To leading order in the driving field, we thus obtain 


{ <f> q ) ~ D J dq' (<p q q' 2n ip— q i)h q i = Dg+ q 2n h q = x q h q . 

This leads to the identification x(q) = Dq 2n g q ■ On the other hand, the fluctuation correla- 
tion function is given by 

/ o t 
dq'WM o = 2 DTq 2n g+g- = — Im X „ 

consistent with the FDT. This consideration shows that the free equilibrium field theory 
conforms with the FDT. However, it is much more challenging to show that this compati- 
bility pertains to the interacting case. In practical applications, one will want to verify (for 
example by perturbative methods, or by renormalized perturbation theory) that (a) the 
equilibrium theory does obey the FDT. One may then proceed to (b) detect deviations from 
equilibrium by violations of the FDT. 

INFO In the theory of continuum systems, the correlation function C(q) = C(q, uj) = S(q, uj) is 
often referred to as the dynamic structure factor. This notation alludes to the fact that the 
function C(q, ui) describes spatio-temporal fluctuations in the system at the characteristic length 
scale Iql' 1 and time scale « -1 . The FDT establishes a connection between the dynamic structure 
factor and the dynamic susceptibility. Likewise, the static version of the FDT Eq. (10.86) relates 
the static structure factor C(q) = ,S'(q) to the static susceptibility. 

Finally, note that we may be confronted with a system that is kept at some predetermined 
noise level (possibly determined by the action of an underlying microscopic agent, or due 
to coupling to an external environment). One may then argue that an effective equilibrium 
emerges wherein the noise strength determines the effective temperature by way of the FDT 
condition. 


If you find the sign-difference between the retarded and the advanced Green function unfamiliar, keep in mind 
that we are considering irreversible dynamics. Compute the time representation of for the case n = 0, r = 0, 
to show that the sign properly distinguishes between retarded and advanced propagation. 
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Having invested considerable effort into deriving various field theories of nonequilibrium 
systems, it is now time to apply the formalism. However, before doing so, let us identify a 
number of questions we may want to address: 

i> In Section 10.5.2 we assumed knowledge of a certain “Hamiltonian” H[<j>] that drives the 
order parameter field (j) to an equilibrium configuration. But how can such a Hamiltonian 
be motivated and constructed for concrete physical systems? 
i> What can field theory say about the different phases realized in and out of equilibrium? 
i> And what can be said about the transitions between these phases? The theories con- 
structed above bear structural similarity to different equilibrium theories discussed earlier 
in the text. To what extent can concepts developed earlier for equilibrium systems (the 
notion of upper and lower critical dimension, mean field theory, renormalized theory of 
fluctuations, etc.) be generalized to the present context? 
i> How do critical correlations reflect in the dynamical behavior of a system? 

These are but a few questions that can be asked. Rather than attempting to find general 
answers, we shall address some of these questions on two specific examples. Even so, our 
discussion will be rather superficial and can provide little more than a glimpse of the theory 
of nonequilibrium critical phenomena. (Some guides to further reading are given below.) 


10.7.1 Driven diffusive lattice gases 

Imagine yourself confronted with the following exercise: engineer a genuine many particle 
system (a system containing particle dynamics and interaction) that is (a) simple, (b) 
couples to an efficient mechanism driving an out-of-equilibrium situation, and (c) reduces 
to one of the “large” universality classes of physics in the equilibrium limit. It is actually 
not so easy to come up with a good answer. 

One system that meets these criteria reasonably well is the driven diffusive lattice gas 
(see Fig. 10.10). As is indicated by the name, it involves a system of particles hopping on a 
lattice. The particles may hop from one site to the next at some rate t, and are subject to 
nearest neighbour repulsion as described by the Hamiltonian 

H[n] = (10.89) 

( ij ) 

Here n^j € {0, 1} reflecting that each site can be occupied by one particle at most. For 
simplicity we imagine periodic boundary conditions in all directions, i.e. the system lives 
on a d-dimensional torus. 

We next allow the particles to hop between nearest neighbour lattice sites (subject to 
the condition of no more than single occupancy). This will render the system dynamical 
and allow it to settle in a configuration compromising between single particle dynamics and 
interactions. To see this more clearly, imagine the dynamics described in terms of a master 
equation. Interactions will influence transition rates in this equation, but not change the 
configurations by themselves. Above we have seen that the changes in the configurations 
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Figure 10.10 The driven diffusive lattice gas. A system of nearest neighbour interacting particles 
hopping on a lattice and subject to periodic boundary conditions. The system is driven by a 
constant force. 

(presently caused by hopping) afford an interpretation in terms of Langevin dynamics. 38 
This means, that we may think of the capacity to hop in terms of some effective thermal 
activation, or noise acting on the system. This noise will enable the gas to settle in an 
effective equilibrium configuration favoured by the interaction, at an effective temperature 
set by the hopping rates. 

To drive the system out of equilibrium, we introduce a force acting uniformly along one 
of the directions of the torus. Moving against its action once around the torus (a closed 
loop) costs energy, i.e. the force is not conservative, and there is no corresponding potential. 
Rather, we may think about the force as an “electromotive” force due to piercing the torus 
by a time dependent magnetic field. The coupling to this “work reservoir” gives rise to a 
nonequilibrium situation. Lack of a conservative potential invalidates descriptions of the 
system in terms of a Hamiltonian; this is the ultimate motivation to describe it in terms of 
master equations and effective Langevin equations. In this language, the effect of the force 
will be a bias in the hopping rates along one of the toroidal directions. 

An important preliminary conclusion to be drawn from this consideration is that the 
equilibrium physics of our system will fall into the Ising universality class. Indeed, we 
may introduce spin variables S) = 2n, — 1 e {—1,1}. Expressed in terms of the S,s the 
interaction Hamiltonian H[S] assumes the form of an Ising Hamiltonian, where the sign of 
J determines whether the coupling is ferromagnetic or antiferromagnetic. In equilibrium, 


A quadratic approximation of the transition generators, the momenta conjugate to the occupation number 
variables, led to a quadratic contribution to the Hamiltonian action formally equivalent to the averaged noise 
term in the MSRJD functional. 




10.7 Field theory III: applications 


667 


the system will be described by a Gibbs distribution, ~ exp(—H[S]/T). at a temperature 
set by the hopping rates. Introduction of a finite bias is expected to cause departures from 
the Ising/</> 4 -universality class. 

Summarizing, the driven diffusive lattice gas conforms with the three principles above: 
it is relatively simple, contains nonequilibrium driving, and relates to the big Ising uni- 
versality class. Sadly, there do not seem to be any direct experimental realizations of this 
system, although it may be argued that aspects of its behavior reflect in the physics of ionic 
conductors, microemulsions of charged particles, and DNA gel electrophoresis. 39 

Our theoretical approach to the system below is organized in a number of steps. After 
reviewing elements of its microscopic physics, we introduce a mesoscopic theory describing 
the approach to equilibrium in the unbiased case, as much as the principal effects of the 
driving mechanism. We expect the consequences of driving to be particularly strong if the 
system is tuned to its equilibrium transition point. Less evidently, however, it turns out 
that driving also causes remarkable phenomenological consequences above criticality (the 
“disordered” phase). In our analysis below, we first address the latter regime, and then turn 
towards the more ambitious study of criticality. It should be emphasized that, to date, many 
aspects of driven diffusive systems remain poorly understood, and there is lack of direct 
experimental data. In our discussion of the system, we thus have to draw more liberally than 
usual from a mix of field theoretical evidence, simulation data, and common sense reasoning. 
The material assembled below is a fragmentary excerpt from the text by Schmittmann and 
Zia. 39 


Microscopic formulation 

Consider the master equation Eq. (10.79) for the evolution of the lattice gas population. In 
the absence of an external field, we have a Hamiltonian H[n] assigning to each configuration 
n an energy. In equilibrium, the system will settle into a configuration described by the 
distribution, 

PeqN E =° Nexp(-H[n]/T). (10.90) 

We do not have to specify the transition rates W[n,n'\ explicitly to say that compatibility 
with Eq. (10.90) implies the detailed balance relation 

W[n,n'] Peq[n] no 911 

W[n',n] p eq [n'] ' ( } 

This relation is satisfied, e.g., by the choice W[n,n'] = const, x exp(— f3(H[n\ — H[n'})/ 2). 
More generally, any W[n,n'] = f(f3(H[n] — H[n'])) with f{x)/f(—x) = exp(— x) will also 
do the job. In the presence of a driving field, we do not have a globally defined Hamiltonian 
any more. Locally, however, we may assign to a configuration n an “energy” that increases 
if we move against the field. Modeling the dynamics in terms of transitions n — > n' that 
involve only local changes in the configuration, we may generalize according to 

W[n,n'\ -A f(P(H[n] — H[n'} + kaE)), 


B. Schmittmann and R. K. P. Zia, Statistical Mechanics of Driven Di usive Systems, (Academic Press, 1995). 
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Figure 10.11 Three phases of the driven diffusive lattice gas. Left: disordered (unmagnetized) phase 
at high temperatures. Center: ordered (magnetized) islands at low temperatures and zero field. 
Right: ordered (magnetized) stripes at low temperatures in the presence of a driving field. 


where a is the lattice spacing and k the number of steps particles have to climb against the 
direction of the field E to get from configuration n' to n. 

Suppose we have modelled the system in terms of transition rules satisfying the above 
constraints. The corresponding master equation is the starting point for computer simula- 
tions. Given the lack of experimental data, and the relative unfamiliarity of the system, such 
simulations 40 have played an important role in guiding analytical work: not surprisingly, it 
has been confirmed that the field-free system falls into the Ising universality class. However, 
contrary to the standard Ising model, the total magnetization ( M ) = JT Si = 2 JT n* — IV 
(N is the total number of lattice sites) is conserved in our system. This means that, above 
a critical temperature T c , the system will be in a “disordered” phase, i.e. particles spread 
randomly across the lattice (see Fig. 10.11 left). Below T c , we enter an ordered phase char- 
acterized by the presence of magnetized domains (particle clusters). The total area of spin 
up and spin down domains is determined by the concentration of particles in the system 
(Fig. 10.11 middle). A cursory inspection of numerical data may lead to the impression that 
above T c , the application of a field has not much of an effect. We will see, however, that 
this is not the case. Finally, it has been found numerically that below T c and for E ^ 0, the 
magnetized areas arrange in stripes parallel to the field direction (see Fig. 10.11 right). 


S. Katz, J. L. Lebowitz, and H. Spohn, Nonequilibrium steady states of stochastic lattice gas models of fast 
ionic conductors, J. Stat. Phys. 34, 497-537 (1984). 
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Below we try to explain aspects of these observations analytically. In principle, master 
equations based on the transition rates W[n,n'] may serve as a starting point towards the 
construction of an analytical theory, along the lines of our discussion in Section 10.6.3. 
However, the reduction of the resulting “microscopic field theory” to an effective long range 
theory will arguably be laborious. Instead, we will build our construction on a “mesoscopic 
formulation” wherein the dynamics of the site occupancy is described in coarse-grained 
terms. 


Mesoscopic formulation 

To provide coarse-grained description of the system, we describe its state in terms of its 
continuous magnetization d(x. £) = (2n(x, t)) — 1), where (...) is symbolic for an average 
over many lattice sites. Let us consider the field free case first. We then know from our 
discussion in section 5.1, that the effective Hamiltonian governing the static profile of the 
magnetization variable in the absence of driving is the ^-functional, 

H[4>] = \J ddx (W) 2 + r( t> 2 + ||#) • (10.92) 

This Hamiltonian will determine the form of the Langevin equation. Second, we know that 
magnetization is a “conserved quantity,” f d d X(j) = const, by virtue of particle number 
conservation. Thus, our model belongs to class B in the classification of Section 10.6.2, and 
we may write down an effective Langevin equation of motion, 

d t <f = DA 51 ^ + d-f = DA(r(/,-d 2 ( j) + “</> 3 ) + d • £. (10.93) 

Here we have absorbed the derivatives appearing in the model B noise correlator Eq. (10.78) 
into the definition of a vectorial noise field {fi(x)fi>(x')) = 2ASu>6(x — x'), and the constant 
A determines the effective equilibrium temperature as A = DT. 

Let us now explore what happens once driving is switched on. As discussed above, the 
Hamiltonian cannot accommodate the non-conservative driving field. However, we may 
argue that the right-hand side of the Langevin equation represents the divergence of a 
current. An external field will cause a current that (i) is proportional to the field E in 
direction and strength, and (ii) vanishes if neither particles nor holes are present. This 
motivates the ansatz, 

j = 4n(l - n)E = (1 - <£ 2 )E, (10.94) 

where a constant of proportionality has been absorbed in a redefined E. The addition of 
the divergence of this current to the right-hand side of Eq. (10.93) will generate the bare 
version of a generalized Langevin equation. However, we must keep in mind that the strong 
anisotropy introduced by the field will, by way of renormalization, render the effective 
parameters of the Langevin anisotropic as well. This motivates the more general anisotropic 
representation 

d t (j) = D - ad 4 + fd 2 - ad 4 - /3d 2 d 2 ^ </> + Ed<j> 2 + (ud 2 + ud 2 ) ()> 3 j - df - d ■ £, 

(10.95) 
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where quantities carrying a caret refer to the d— 1 directions transverse to the field, and the 
various coefficients account for the possibility of anisotropic renormalization. Accordingly, 
the effective noise correlator takes the form 

(£(x)£(x')> = 2AS(x — x'), 

iii{x)ii'(x')) = 2 A8ui8(x — x'). (10.96) 

The MSRJD functional of this equation (cf. Eq. (10.53)) is given by Z = f D(q b, ip) 
where 

S[(p, ip] = J dx (ip(d t cp — D [ (rd 2 — ad 4 + rd 2 — ad 4 — f3d 2 d 2 ^J (p 

+ Edcp 2 + ^ ( ud 2 + ud 2 ^ (p 3 J ) + A(dip) 2 + A(dip) 2 ^j . (10.97) 

In the following sections we aim to extract some physical information from this seemingly 
formidable expression. 

Above criticality: consequences of FDT violation 
Let us consider the system at temperatures T > T c far above its ordering transition. Drawing 
upon our experience with equilibrium physics, we expect the system to be in a disordered 
state, void of any long range (power law) correlations. Remarkably, however, the physics of 
the driven system turns out to be different. We will see that the external field leads to long 
range correlations and critical phenomena even in the high temperature regime. 

Following the standard logic in the theory of continuous phase transitions we assume that, 
for T > T c , the intrinsic nonlinear terms (such as ~ ip(p 3 ) provide subleading contributions 
to the dynamics. Alluding to their low operator relevance, we will also neglect all terms 
containing more than two gradient operators. We thus consider the bare action 

S[(p, ip] = J dx (ip (d t <p — D [rAcp + Edcp 2 ] ) — AtpAip) , 

where we have anticipated that the bare theory is isotropic, A = A and r = r and the 
driving term provides the only non-Gaussian contribution to the action. The importance of 
the driving term can be assessed by computing its engineering dimension. To this end, let 
us consider the effect of a rescaling q — » q' = q b~ x . Requiring that all Gaussian terms in 
the action remain scale invariant, we obtain the relations 

q' = 6 -1 q, f -4 b~ 2 t, 
cp' = b^cp, ip' = b^ip. 

Substituting the rescaled quantities into the action, we are led to the renormalization E' = 
b~ 2 +1 E. Thus, the upper critical dimension of our nonlinearity is d c = 2. Below d = 2, the 
driving term is relevant, at d = 2 it is marginal, and above irrelevant. 

However, it would be premature to conclude that in dimensions d > 2, driving has no 
physical effects! To appreciate this point, imagine that on the short distance scales |q| ~ l 
where the renormalization process is started, our action is isotropic in that r(l ) = f(l) and 
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A(l) = A(l). In the course of renormalization, the effective strength of E will diminish 
(exponentially, if d > 2). However, the residual E(L) will also renormalize the coupling 
constants of the Gaussian model, and it will do so in an anisotropic manner. 

EXERCISE Identify the interaction processes that cause the renormalization of the coupling 
constants. You may want to formulate the RG equations? 


At large length scales, we thus have an effective Gaussian model, E = 0, which, however, is 
anisotropic. In other words, we have a family of Gaussian fixed points, each described 
by a free action 


S[(j), ip] = J dx (tfi ^ dt<f> — D (rd 2 


= \j dq (& VO-9 


(<?+)- 


+ fb 2 ) + A(dip) 2 + A(dV’) 2 j 

Gr)- 1 0 ( 4 >\ 

2 (Ak 2 + Ak 2 )) q U) q ’ 


where g± = (=p iui + D{rk 2 + rfc 2 )) -1 , and q = (k, u>) with k = (k, k) T . As we are going to 
show next, each of these actions exhibits “critical behavior” in that it generates anisotropic 
power law correlations in the system. Indeed, consider the dynamic structure factor 


5(k,w) = / dq 1 


>=2(Ak 2 +Ak 2 )g+g~ =2- 


Ak 2 + Ak 2 


ui 2 + D 2 {rk 2 + fk 2 ) 2 
Integration over oj generates the static structure factor 

1 Ak 2 + Ak 2 


cn , f duj Qn 1 Ak 2 + Ak 2 

^( k ) = / ^-S(k,w) = — r7 -. 

J 2tt D r k 2 _|_ f k 2 


(10.98) 


(10.99) 


In equilibrium, the FDT enforces the fraction on 
the right hand side of this equation to equal unity. 

To appreciate this point, remember our discussion 
at the end of Section 10.6.2. We had seen that a 
(model B) system may relax to equilibrium if the 
noise correlator is weighted by a Laplace operator 
(cf. Eq. (10.78)) which compensates for the Laplace 
operator in Eq. (10.71). In the present context, both 
operators get renormalized to anisotropic second 
order differential operators. Specifically, the noise- 
or “fluctuation” Laplace operator gets renormalized according to Af — > k 2 + ^k 2 , while 
the “dissipation” operator becomes A d k 2 + ^k 2 . An equilibrium configuration at some 

effective temperature T e ff can be approached (cf. the construction above Eq. (10.78)), if 
and only if Af oc Ad. While this does not necessarily imply isotropy in space, a minimal 
requirement reads 
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This condition entails .S'(k) = const.. Equivalently, violation of the FDT condition w = 1 
implies singular (albeit finite) behavior of the structure factor. The singularity can, e.g., be 
characterized by the ratio 

t-, _ S(0, fc — )■ 0) 

“ S(k^ 0 , 0 )' 

The anisotropy singularity of the structure factor implies power law correlations in coor- 
dinate space. To see this, let us consider the case d = 3 for concreteness and compute the 
Fourier transform 


s(x)E /i 

= -(wdl + dl 


d 3 k 

Sir 


S(k) oc J 


A k 2 + k 2 


?fc 2 + fc 2 


. [ j 3 7 „ -ifxfe+x-k) w ^ 2 + 

'] dke “W 




= (w-l)dl — 


(x 2 +x 2 y/ 2 


= (w- !)t^ 


2x 2 - f 2 


(£ 2 + £2)5/2 ’ 


where x = (r/r)x is a rescaled coordinate and in the third line we discarded a contribution 
( d\ + d 2 ) ^- 2+ ^ 2 )i /2 oc <5(| x |), because our effective theory holds only for large |x| anyway. 
We thus come to the surprising conclusion that 


the high temperature (“disordered”) phase of the system sup- 
ports power law correlations in coordinate space. 


These correlations reflect the presence of a line of inequivalent Gaussian fixed point theories. 

INFO Can these power law correlations above criticality actually be observed in real sys- 
tems? Anisotropies in the structure factor such as those shown in the figure above have actually 
been measured in uniaxial ferromagnets with dipolar interactions 41 However, these systems sup- 
port long range correlations on the bare level, and it is not really clear whether they share a 
universality class with our short range Ising type benchmark model. However, singular behavior 
in correlation functions is definitely observable in Monte Carlo simulations 40 of the system. 

It is also worth noting that the mechanisms whereby long range correlations were generated 
in our discussion above appear to be of rather general nature: to appreciate this point, consider 
the temporal decay of correlations in a diffusive environment C(0, t) ~ t~ d ^ 2 . The dynamical 
exponent of diffusive dynamics is given by z = 2. Power counting arguments would then suggest 
that G(x,t) ~ | x | -d at fixed t. However, in equilibrium statistical mechanics, we are more 
accustomed to exponentially decaying correlations (power law correlations indicating criticality 
being an exception). 

Our discussion above signals, that the paradigm “no long range correlations away from crit- 
icality” may not have quite as generic a validity as is suggested by the theory of equilibrium. 
Rather, we saw that violations of the FDT may generate power law correlations far away from 
critical points. 


J. Skalyo, B.C. Frazer, and G. Shirane, Ferroelectric-mode motion in KD 2 PO 4 , Phys. Rev. B 1, 278-86 (1970). 
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EXERCISE Retrace the discussion above for an isotropic system with conserved diffusive 
dynamics. Convince yourself that the FDT enforces vanishing of power law correlations in the 
correlation function. 

Inasmuch as equilibrium (obedience to FDT) is an exception rather than the rule, we may expect 
that long range correlations in nature are of more frequent occurrence than what is implied by 
equilibrium statistical mechanics. 


The system at criticality 

At some critical T c the system undergoes a transition into its ordered phase. Numerical 
analysis has shown that, in the presence of strong driving, this ordered phase contains 
stripes of uniform magnetization. The formation of stripes may be diagnosed by monitoring 
the behavior of the static structure factor upon changing the temperature. Specifically, 
we expect that: (i) the proximity to the phase transition point is characterized by some 
correlation length, £. (ii) At length scales larger than £, the structure factor will exhibit the 
~ |x|“ d scaling discussed in the previous section, (iii) At length scales smaller than £ we 
expect anisotropic scaling; correlations in the direction parallel to E will be qualitatively 
different from those in the transverse direction. In the following, we tune the system right 
into the critical point and explore its correlation behavior. 

However, before doing so, we first have to identify the transition point. In equilibrium 
<jf> 4 -theory, the transition occurs when the (renormalized) coefficient, of the quadratic term 
in the action ~ <p 2 vanishes. Presently, however, we have two such terms, with coefficients 
r and f, respectively. This raises the question whether the ordering transition occurs at (i) 
r = 0, f 7^ 0, or (ii) r 7^ 0, f = 0, or (iii) r = r = 0. In the following, we go through a 
construction that self-consistently shows (ii) to be the correct variant. 42 

Let us assume that r ^ 0, r = 0 characterizes the system at criticality. We begin by 
determining the engineering scaling dimensions of the various contributions to the action. 
Specifcially, we should like to identify the upper critical dimension, i.e. the dimension below 
which nonlinear terms become qualitatively important. To this end, let us use the freedom 
to independently scale q, q, t, (j >, ')/; in a. manner to leave the dominant quadratic contributions 
to the action invariant. Defining the scale transformation 


q— »6q, q—ib a q, t^b z t, 

<t> -*• b d +(j), ip^b d ip, (10.100) 


At first sight this claim may seem surprising. Since we expect ordering in the field direction, would not one 
expect the coefficient corresponding to that direction to vanish? 
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let us consider the scaling behavior of different contributions to the action Eq. (10.97), 
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We now choose the scaling dimensions and a so as to make the first four contri- 

butions to the list above scale invariant. This generates the conditions 

0 = 4, a = 2 , d* = ^J% ( 10 . 102 ) 

With these values, we can estimate the engineering dimensions of the remaining terms. 
Specifically, we find that (a) the noise term oc A is strongly irrelevant in all dimensions, 
(b) the interaction term oc u is less relevant (by two dimensions) than the interaction term 
oc u, (c) the driving term oc E carries dimension (5 — d)/ 2, and (d) the interaction term 
oc u carries dimension 3 — d. Specifically, point (c) implies that the model has upper 
critical dimension d c = 5. Notice that the high value of the upper critical dimension 
reflects the strongly intrusive role of the driving - below d = 5, the physics of the model 
becomes fluctuation dominated, earlier than in the equilibrium model with its d c = 4. In 
addition, the driving term is more relevant than the interaction term oc u, which becomes 
relevant only below d = 3. 

The estimate above suggests that we should drop the terms proportional to A, u, u. We 
will thus model the critical system by the reduced effective action 

S[cj>, ip] = J dxD^ip (jD~ 1 d t — rd 2 + d 4 ^ <p + Ed(p 2 + (dip) 2 ^J 

H/d*[#,«( (9+r , ( f 2 £)C)+ DB H- <lo ■ lo3, 


where #-= ( : l), - Drd 2 + Dd 4 ) '. and we have absorbed the noise coefficient A in a 
rescaled field amplitude. 

In the following, we aim to understand the build-up of anisotropies in the critical system 
at d = 5 — e dimensions. To understand the ramifications of scaling anisotropies, let us 
begin by computing the Fourier transform of the static structure factor - the spatial 
correlation function of the system - in a tree approximation. Comparing with Eq. (10.98) 
and (10.99), we obtain 


S( k) = 


J du>S(k,u)) = J 


du) 2 k 2 

2tt i J 1 + D 2 (rk 2 + fc 4 ) 2 


k 2 

D(rk 2 + k 4 ) 
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and 


£(x) = 


d d ~ 1 k 

(27r) d ~ 1 




D(rk 2 + fc 4 ) 


3 -fc 2 |x|r“ 1 / 2 +ix-k 


This expression provides an a-posteriori justification of our choice of “mass parameters” 
r and f at criticality. We have obtained a correlation function that decays as a power law 
in the longitudinal direction and exponentially in the transverse direction. This behavior 
conforms with the build-up of critical fluctuations in the longitudinal direction, while the 
transverse direction remains in a disordered state. 


Perturbative RG 


How will critical fluctuations effect this pic- 
ture? In the following, we perform a rudimentary 
RG analysis to show that below the upper crit- 
ical dimension d c = 5, fluctuations conspire to 
strengthen the anisotropic character of the system. 
(For a full exposition of an RG analysis of this 
system, we refer to Schmittmann and Zia. 39 ) As 
in Section 8.3, we aim to explore what happens as 
fast fluctuations successively get integrated out. To 
this end, we first introduce a graphical language 
representing the building blocks relevant to the 
perturbative RG (see the figure.) 



di/i cj )' 2 


EXERCISE Refamiliarize yourself with the plan of the perturbative RG- discussed- hr Section 8.3. 
Decompose the fields as cf> = <j> s + 4>f into fast and slow sectors. To one loop order in the 
fast fields, explore the different diagrams contributing to the perturbative RG step. Specifically, 
convince yourself that no vertices other than those contained in the action Eq. (10.103) will 
be generated. Use power counting arguments to show that the one fast momentum integration 
becoming dangerous below d = 5 will act to renormalize the cl 2 0 %) vertex. 


The one diagram driving the renormalization of the theory is shown 
in the figure to the right. To explore the effect of this contribution, 
we divide the fields into fast and slow contributions. In view of the 
anisotropies inherent to the model, it is actually not so obvious how 
the space of coordinates ( q , q, ui) is to be divided into a fast and a slow 
sector. On the other hand, the RG procedure should be insensitive to 
the detailed choice of an (infinitesimally thin) layer of fast modes. (As an exercise, check it 
by inventing schemes different from the one below.) We thus meet the technically convenient 
choice, fast: A/6 < |q| < A, (q, ui) arbitrary, slow: |q| < A/6, (q,Lo) arbitrary. Denoting the 
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integration over the fast modes by jj dq, the diagram above renormalizes the coefficient r 
according to 


r + const, x DE 2 / dq G 2 i(q)G 22 (q) = r + const, x DE 2 / dqg+q 2 (g + g ^ 
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= r + const, x DE 2 j d d 1 q / dqdu 
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f 


= r + const, x 


D 


f 


d d ~ Y q 


—iui + Drq 2 + Dq 4 ) 2 (iui + Drq 2 + Dq 4 ) 
q 2 cE 2 


, /J4\2 ' 


(rq 2 + q- 


Dr 1 / 2 


In b + 0(e 2 ), 


where c = 0(1) is a numerical constant. At the one-loop level, the vertex oc E does not 
renormalize; its renormalization stems from the the rescaling of momenta following the RG 
step, E -f b^ 5 ~ d ^ 2 E = e lnbe / 2 E. This latter rescaling does not change the r-term, as the 
field dimensions have been deliberately chosen so as to make this term (engineering) scale 
invariant. We thus obtain the two RG equations 


dE e dr cE 2 

dlnb 2 ’ din b Dr 1 / 2 


(10.104) 


Substituting the solution of the first equation, E = 1/^ 2 Eq into the second, and solving 
the remaining ordinary differential equation, we obtain r ~ b~s . This fluctuation induced 
scaling needs to be taken into account in an updated variant of the determination of scaling 
dimensions. Accounting for the scaling of r in Eq. (10.101), we find that the anisotropy 
exponent changes according to 

cr = 2+|. (10.105) 

This result adds to our overall observation that driving massively interferes with the 
physics of the system. By way of summary, we have seen that: 


> In the disordered regime above criticality, external driving leads to the formation of new 
macroscopic structures which manifest themselves in new types of power law correlation. 

t> The strongly invasive character of driving is also reflected in the increased value of the 
upper critical dimension, 4 — > 5 (as compared to the equilibrium system). 

> At criticality, driving causes fluctuations which superimpose to strengthen the anisotropy 
of the system, Eq. (10.105). Qualitatively, the fluctuations generate patterns that appear 
to be strongly aligned in the field direction. 

For a deeper analysis of the system, and a discussion of its physics below criticality, we refer 

to a more comprehensive study of its scaling and renormalization behavior. 39 . 


10.7.2 Directed percolation 

Amongst the different universality classes of equilibrium statistical mechanics, the <fi 4 - or 
Ising-class is the simplest. It owes its frequent occurrence in nature to the fact that its free 
energy function, a quartic polynomial of a real scalar variable, is the most basic one that 
leads to nontrivial critical behavior. In nonequilibrium statistical mechanics, the directed 
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Figure 10.12 Cartoon of a directed percolation network with a given realization of open bonds, cf. 
ordinary, or isotropic percolation, where the network bonds are directionless. 

percolation class plays a similarly paramount role. In this section, we try to understand the 
importance of this universality class to nonequilibrium statistical mechanics, and explore a 
few of its critical properties. However, before doing so, it is useful to briefly review directed 
percolation perse. (The material presented below is inspired by an excellent review by 
H. Hinrichsen. 43 ) 


Directed Percolation: Phenomenology 

The term “directed percolation” (DP) refers to a class of models that describe the spatially 
directed spreading of a substance through a medium - think, for example, of a liquid dis- 
persing through porous rock under the influence of a gravitational force. Alternately, one 
may identify the driving direction with time, in which case directed percolation mimics, for 
example, the spreading of an epidemic, or related dynamical phenomena. 

The directed percolation model (see Fig. 10.12) is defined on a d-dimensional hypercubic 
lattice. A certain “main diagonal” of the lattice is introduced as the identifier of a sense of 
direction. The links of the lattice are then chosen to be open or closed with probability p 
and 1 — p, respectively. Like its cousin, the undirected, or isotropic percolation model, this 
system exhibits a phase transition at a critical value of the connectivity probability, p = p c , 
the directed percolation phase transition (see Fig. 10.13). The order parameter of this 
transition is the probability P 0 0 that a randomly chosen site of the lattice is at the origin 
of an infinitely large connected cluster. At p < p c , and using a language alluding to the 
liquid/porous rock application, the system is in a “dry phase.” At p > p c . it has become 
“wet.” 

Despite the simplicity of the model, the critical properties of the DP-transition are not 
fully understood. What can be said without any further calculation is that the critical value 
Pc — Pc(d) will depend on dimensionality. (In one dimension, any broken bond will truncate 
the cluster, i.e. p c ( 1) = 1; in infinite dimensions, the unlimited number of options to go 


H. Hinrichsen, Nonequilibrium critical phenomena and phase-transitions into absorbing states, Adv. Phys. 49, 
815-958 (2000). 
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Figure 10.13 The two phases of a directed percolation network. Left, dry phase, wherein individ- 
ual sites generate finite size clusters; right, wet phase with infinite size cluster generation. 


from any lattice site implies p c {d oo) 0.) Both, numerical analysis and field theory 
suggest that the corresponding critical exponents are complicated, and likely irrational. 

What gives the understanding of the DP transition increased relevance is that many 
reaction— diffusion processes appear to fall into the DP universality class. To heuristically 
understand this connection, simply note that, upon interpreting the sense of direction as 
the flow of time, the elementary compounds of a cluster afford a natural interpretation 
in terms of “reactions,” or diffusion (cf. Fig. 10.14). The generation update on the DP 
network need not be modeled simply by drawing bond connectivities according to some 
fixed probabilities; more sophisticated update rules can be invented to describe a large 
set of dynamical setups. Algorithms implementing such update rules are called cellular 
automata. Many such automata exhibit a phase transition between two “absorbing states” 
of the lattice, the empty state (above termed “dry”) and the totally occupied state (“wet”). 
One of the theoretical challenges in this field is to understand the universal properties of 
such transitions, and specifically their connection to the basic DP universality class. 

Below, we explore some of the universal properties of the DP transition by field theoretical 
methods. We also aim to understand the origins of the ubiquity of the DP class in reaction- 
diffusion kinetics. 


Elements of scaling theory 

Many physical properties of the DP transition can be conveniently described in terms of 
scaling laws. Introducing r = p — p c as a scaling field measuring the distance from the 
transition point, the correlation behavior of the system is characterized by a correlation 
length £ ~ t~ u in its “spatial directions,” and a correlation time t ~ ~ t~ vz in the 

orienting direction. The transition as such can be probed in terms of two candidate order 
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Figure 10.14 Dynamical interpretation of a DP network. Each row of sites represents a generation 
of particles. The generation dynamical update then involves diffusion, birth and death, coagulation 
and offspring creation. 

parameters. The first is the average number of active, or wet sites, 
p{t) = N 1 . 

where N is the total number of sites within one time layer, and Sj = 1 if site i is active, and 
zero otherwise. In the vicinity of the transition, p is expected to exhibit power law scaling, 
p(t —¥ oo) = p ~ t s ,t < 0. More generally, the critical time dependence of order parameter 
fluctuations is described by the ansatz 

pit) ~ M^O/tCr)), 

with the short time asymptotics (cf. the discussion of Section 10.6.1), 
p{t) ~ t~ a , a = P/vz. 

Another possible order parameter is the infinite cluster size probability introduced 
above. We can generalize to a time-dependent quantity by declaring P(t) to be the average 
probability that a cluster started at t = 0 is still active at time t. Since the “spatial” size 
of the cluster grows indefinitely with time, Poo = P(t — >■ oo). The scaling behavior of P is 
described by P, x = , r < 0, and 

P(t) ~ \rf g'{t/t{r)). 

with short time limit P{t) ~ t~ a , a' = ft' jvz. The four exponents i describe the 
basic scaling behavior of the system. (For the elementary DP class, infinite cluster size 
probability and overall wetness are equivalent characteristics, fi = ft' . For general phase 
transitions between absorbing states, however, these quantities can be different.) 

An important derived quantity, similar in nature to the pair correlation functions of many 
particle theories, is the pair connectedness function C(x, t). This function is defined as 



680 


Nonequilibrium (classical) 


the probability that an active site at (0, 0) is connected to the site at (x, t) by an open path. 
The scaling hypothesis implies that 

<7(x, t) = t°~ d/z F(\x.\/t 1/z , rt 1/vz ). (10.106) 


The dimension 9 can be fixed by running a few consistency checks: the probability that 
(x ~ 0, t) is connected to (0, 0) is proportional to both the probability of finding a cluster 
of temporal extension P(t), and the density of active sites, p(t'). Here, the notation x ~ 0 
means that the final point x should lie inside the “conical” cluster spreading out from 
0, i.e. (7(0, i) t 7t°° r /3+/3 . On the other hand, the expected stationarity of (7(0, t — > oo) 
requires that F(0,rt 1 /^) 4 tr ^~ d ^ | i.e. F(0, rt x l vz ) ~ T -M#~d/z) Comparing these two 
conditions, we obtain the identification T - vz (Q~ d / z ) Aj T 0+0 ; or 



z 



(10.107) 


Equations (10.106) and (10.107) describe much of the correlation behavior of the cluster. 
Notice, however, that the argument leading to Eq. (10.107) relied on the cluster spreading 
“underneath” the initial point. However, in high dimensions, this condition may be violated, 
on account of the cluster spreading out in directions “orthogonal” to |x — 0|. Closer analysis 
shows that the validity of Eq. (10.107) is limited to dimensions below the upper critical 
dimension of the process. 

For the discussion of scaling functions higher in the hierarchy, and describing features 
such as the average extension of clusters, their volume, etc., we refer to Hinrichsen. 43 In 
the next section, we discuss how field theoretical methods can be applied to describe the 
fluctuation behavior of the system and its critical exponents. 


Field theory 

Before analysing low energy field theories of the DP transition, we first have to construct 
one. On the face of it, it is not so obvious how to do this: we have not described DP in 
the language of master equations, nor by Langevin type evolution, so our standard recipes 
do not apply. Instead, we aim to construct a “dynamical mean field equation” of DP on 
phenomenological grounds. We then use this equation as input for the construction of a 
field theory, along the lines of Section 10.6.3. 

The mean field equation we are going to construct operates on the mean density of active 
sites, p(t). Adopting the dynamical interpretation of DP, p(t) increases at a rate that is 
proportional to the probability of open bonds, p, the density p{t) of active sites at time 
step n — 1 that may act as germs of newly generated active sites, and the density of empty 
sites at time step n that may be converted to active sites. The density decreases at a rate 
proportional to the density of active sites at the previous time step, and the probability 
of closed bonds, 1 — p. Combining all these considerations, we are led to the equation 
dtp = c\pp(l — p) — 02(1 — p)p, where C\p are nonuniversal constants depending on the 
dimensionality of the system, and on the unit at which time is measured. We may always 
rescale density and time such that the equation assumes the form, 

d t p = (A - l)p- Xp 2 . 


(10.108) 
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The defining feature of this equation is that it possesses (a) a stationary configuration p = 0, 
and (b) a non-empty stationary state p = provided A > 1. The critical value A c = 1 at 
which the non-empty “mean field” begins to emerge marks the position of the continuous 
DP transition. Notice the simplicity of the DP mean field equation. It explains the ubiquity 
of the DP class: alluding to the principle of universality, one may suspect that processes 
described by a rate equation free of constant terms generically exhibit a DP transition, 
provided that the first two terms ~ p and ~ p 2 admit a non-zero solution. 

Equation (10.108) fixes the mean field critical exponents of the system. Using the 
notation r = A — A c = A — 1 for the difference to the mean field critical point, near criticality 
p ~ r, i.e. dmf = 1. In the dry phase, r < 1, the asymptotic temporal decay of the density is 
given by p ~ exp(— |r|f), which means that —vz = 1. However, lacking any spatial structure, 
mean field theory cannot say anything about v (nor z) individually. 

The mean field equation (10.108) can be upgraded to a spatially resolved rate equation 
by upgrading p(t) —i p(x, t ) = p(x) to a local density profile and adding a diffusion term: 

d t p = Dd 2 p + rp — Xp 2 . (10.109) 

Dimensional analysis shows that v,„f = 1/2. In combination with the results above (which 
survive generalization to a local equation - exercise) we then have the mean field, or tree 
level prediction, 

Anf = & Z'mf = 1/2, Zmf = 2. (10.110) 

A field theory can now be constructed by interpreting Eq. (10.110) as a model A rate equa- 
tion along the lines of our discussion of Section 10.6.2. To this end, we upgrade Eq. (10.109) 
to a noisy equation, 

d t p = Dd 2 p + rp- A p 2 + £, 

where the noise is correlated as 


(€(x)) = o, (|(ar)^)) = 2 Ap(x)S(x - x'). (10.111) 


The crucial feature here is the scaling £ ~ p. It reflects the assumption that the system 
self-generates noise through its active sites. This type of noise must vanish in the empty 
state, p = 0. Further, we are to interpret £ as a coarse-grained variable sampling fluctuations 
in a large number of active sites. According to the central limit theorem, these fluctuations 
scale as yj~p, which explains the above correlation law. Generally, noise that scales with some 
power of the Langevin variable is termed multiplicative noise. 

We next subject Eq. (10.110) to our standard treatment of Langevin equations, thus gen- 
erating the field theory representation of DP partition function, Z = f D((f>, ip) e~' s UA] 
where 44 


S[<f>, ip] = J dx \ip(d t — Dd 2 — r)(p + nip(p{(p — V’)] * 


( 10 . 112 ) 


Curiously, the field theory Eq. (10.112) has a background in particle physics, where it is known as Reggeon 
eld theory, see M. Moshe, Recent developments in Reggeon field theory, Phys. Rep. 37, 255-345 (1978) for 
a review. 
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Figure 10.15 Three distinct phase portraits of directed percolation. Left: phase supporting a finite 
concentration of active sites, center: phase transition, right: empty state. 


and k = (AA) 1 / 2 .^ deriving Eq. (10.112), we made use of the freedom to rescale fields so 
as to make the coefficients of the two nonlinear terms ~ <pi p 2 and ~ <p 2 ip equal.) We next 
explore (cf. Eqs. (10.100) and (10.101)) the behavior of the different terms in the action 
under the rescaling transformation 


x -4 x/6,f -4 t/b z , 
(p-*b d »<p, ip^b d ip. 


Referring to the action (10.97), we have 
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(10.113) 


(10.114) 


Choosing z = 2 and d,p = d,/, = d/2, the first two contributions to the action become (tree 
level) scale invariant. The operator ~ ripcp measuring the distance off criticality is strongly 
relevant with dimension 2. Finally, the nonlinear operators ~ Xip<p 2 and ~ Aip 2 <p carry 
dimension 4 — d. This identifies the upper critical dimension of directed percolation as 
d = 4. 

Before turning to the discussion of the role of fluctuations below d = 4, let us briefly 
recapitulate the mean field picture in the language of the effective field theory Eq. (10.112). 
As with the zero-dimensional problems discussed in Section 10.4, the phase structure of the 
system is essentially determined by the pattern of intersecting zero energy lines of the mean 
field Hamiltonian, H = ip(p((p — ip — t). Depending on the sign of t, three distinct phases 
need to be distinguished (see Fig. 10.15). At r > 0, the system is in its active phase: 
on the fluctuationless manifold ip = 0, the concentration variable <p is driven towards the 
stable configuration <p = t. (Notice that only positive values of (p are physically meaningful.) 
At r < 0, the empty state (p ~ 0 is stable, the inactive phase. The mean field phase 
transition between both phases happens at r = 0. The phase space picture of directed 
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percolation was introduced in Kamenev 45 as the basis for a far-reaching classification of 
phase transitions in reaction-diffusion systems. For further aspects of the utility of the 
Hamiltonian approach in this field, we refer to the original reference. 


PeHurbative RG 


How will fluctuations alter this picture? Close to the upper critical dimension, in d = 4 — e, 
renormalization group techniques can be applied to answer this question. Carrying out an 
RG analysis similar to that detailed in Section 10.7.1 (cf. Problem 10.9.4), one finds that 
the two coupling constants r and k flow according to the equations 


dr 

dlnb 


- DCk 2 + r 



dn ( e 
dlnb K \ 2 



(10.115) 


where the constant C = an< ^ ‘-’ 4 * s the area of the four dimensional unit-sphere. 

Within the above RG scheme, the diffusion constant, D, and the pre-factor of the fre- 
quency term u> are kept fixed. The dynamical exponent and the “field renormalization” 
</> — > (j>b 5 are given by, respectively, 


z = 2- 


Ck 2 


X = 


Cn 2 

~T~ 


These equations possess a non-trivial fixed point at r* = ^ + 0(e 2 ). k* = (^ c )^‘ 2 which 
means that fluctuations shift the position of the DP transition to a non-vanishing value of 
r. At the fixed point, z = 2 — e/12 and \ = e/12. Linearization of the RG equations around 
the fixed point leads to 


1 ), 

dk 

dhil) = ~ CK ’ 

where r = t* + f and k = k* + h. The linearized equations imply that the coupling 
constant f is a relevant scaling field with t ~ 6 2_ J. This result can in turn be used to 
determine the 1-loop critical exponents: the scaling 6 -1 ~ £ ~ t~ v ~ b~ v ^ 2 ~ j) leads 
to the identification v = 5 + ^ + G){e 2 ). The critical exponent /3 follows from the scaling 
relation ~ (c p ) = b 2 ► ^(r6 2_ J). Setting b ~ r 2_ j , we obtain the identification 
p = 1 — | + 0(e 2 ). Summarizing, the 1-loop RG analysis generates the list of exponents 

z = 2-^+0(e 2 ), u =l + ±+0(e 2 ), /? = 1 - | + 0(e 2 ). 

How do these values compare with reality? Numerical simulations for (3 + l)-dimensional 
clusters 46 obtain (z, v, /3) = (1.90(1), 0.581(5), 0.81(1)) which compares reasonably well with 


A. Kamenev, Classification of phase transitions in reaction— diffusion models, Phys. Rev. E 74 , 41101-17 (2006). 
I. Jensen, Critical behavior of the three-dimensional contact process, Phys. Rev. E 45 , R563-6 (1992). 
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the e = 1 extrapolation of the exponents above, (z,v,/3) = (1.92,0.56,0.83). A two-loop 
extension of the RG 47 leads to excellent agreement. However, in lower dimensions, the situ- 
ation is decidedly worse. For example, in the extreme case of (1 + l)-dimensional percolation 
clusters, the RG exponents are off the results of simulations by ca. 40%. Even so, the field 
theory analysis sheds light on the physical mechanisms generating critical behavior, and 
this is information that cannot be obtained from direct simulations. 


10.8 Summary and Outlook 

In this chapter, we have discussed foundations of nonequilibrium statistical mechanics, with 
emphasis put on later extensibility to quantum nonequilibrium theory. We have introduced 
the two major pathways to the description of nonequilibrium systems, Langevin dynam- 
ics and probabilistic formulations in terms of master equations. We saw that these two 
approaches afford a unified description in terms of functional integrals, the MSRJD formal- 
ism. Interpreting the latter as a Hamiltonian functional integral, we looked at its Lagrangian 
partner - the Onsager-Machlup functional - and recovered the Fokker-Planck equation as 
its dynamical equation (similarly to the Schrodinger equation being the dynamical equation 
of the quantum mechanical path integral). The arch stretching from the Langevin/master 
equations over the MSR JD-/Onsager-Machlup functional to the Fokker-Planck equation 
provides a powerful framework wherein many nonequilibrium phenomena can be studied 
in a unified setting. Specifically, the functional integral description suggested a variational 
approach (similar in nature to the WKB approach of quantum mechanics) to the description 
of large fluctuations: fluctuations in higher dimensional environments could be analyzed 
by concepts borrowed from the theory of critical phenomena, etc. In the next chapter, the 
focus is on another important aspect of the formalism, the interpretation of the MSRJD 
functional as the classical limit of a generalized quantum theory. 


10.9 Problems 
10.9.1 Wigner surmise 

The problem of eigenvalue correlations of Hermitian matrices drawn from random distributions is a 
subject with a long history dating back to studies of resonances in atomic nuclei. The following problem 
concerns the distribution of level spacings in a canonical 2x2 random Hamiltonian. The result, known 
as the Wigner surmise, captures much of the behavior of the general expression for random matrices of 
arbitrary rank. 


Consider a Hamiltonian H of the form of a 2 x 2 matrix parameterized as 


H = 


Hi A \ 
A H 2 ) ■ 


J. B. Bronzan and J. W. Dash, Higher order epsilon terms in the renormalization group approach to Reggeon 
field theory, Phys. Lett. B 51, 496498 (1974). 
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Suppose that the three real numbers H i, H 2 , and A are drawn from a Gaussian distribution 
P(Hi, H 2 , A) oc e~ 5 tr H . Derive the joint distribution P(Ei , E 2 ) of the two energy levels 
Ei, i = 1,2, of the Hamiltonian H. Using this, derive the Wigner surmise describing the 
distribution of the energy splitting s = \E\ — U 2 I, and show that it has the form 

P(s) = ci se -C2S , 

with some constants Ci and C 2 . 

Answer: 

The joint eigenvalue distribution is given by 

P(P+,P_)~ J dH 1 dH 2 dAS(E + - X + )6(E_ - A_)e“ (B + +B - )/2 , 

where A± = Hl + H2 ± (A 2 + ( Hl ^ H2 j 2 ) 1 / 2 denote the eigenvalues of the matrix % and we 
assumed E + > E_ (in the opposite case, one has to interchange A+ A_). Integrating 
over (H\ + PZ 2 ) /2 and setting SH = H\ — H 2 , one obtains P(E + , E_) ~ f dSH dAS(E + — 
E_-2(A 2 + 5H 2 ) 1 / 2 )e- < ' E + +E -V 2 . Finally, setting r = (A 2 + SH 2 ) 1 / 2 , and integrating over 
the “angular” variable, one obtains P(E + ,E_) ~ J 0 °° drrS(E + — E_ — 2 r )e~^ E + +E -^ 2 ~ 
(E + — E_)e~( E + +E -^ 2 . From this result, we obtain the distribution 

P(s)~ [ dE + dE_P(E + ,E_)8(E + -E_-s)~ se~ s ' 2 / & . 

Je + >e _ 

We therefore find that C 2 = 1/8, and Ci = 1/4 is obtained from the normalisation condition. 
This result shows that the probability of finding a degeneracy is vanishing i.e. levels repel. 
As a further exercise, show that if H is a complex Hermitian, i.e. both real and imaginary 
parts of A are drawn from a Gaussian distribution, the probability distribution vanishes as 
P(s) ~ s 2 for small s. 


10.9.2 Ornstein-Uhlenbeck process 

The Ornstein-Uhlenbeck process 48 is the "harmonic oscillator” of nonequilibrium dynamics. It describes 
stochastic evolution in cases where the averaged dynamics of a Langevin particle is described by a linear 
first order differential equation. Realizations include the dynamics of a particle subject only to friction 
(but no "external forces”), or the dynamics of an overdamped particle in a harmonic external potential. 
Dissipative dynamics close to extremal potential points is often described by variants of this process. 


(a) Consider a Brownian particle in Id whose velocity is governed by the stochastic differ- 
ential equation 


v + 'yv 


m ; 

m ’ 


( 10 . 116 ) 


G. E. Uhlenbeck and L. S. Ornstein, On the theory of Brownian motion, Phys. Rev. 36, 823-41 (1930). 
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where / is a Langevin force, and (/(£)/(£').) = A5(t — t'). Determine the general solution 
of Eq. (10.116) for the velocity with the initial condition u(0) = Vo, and calculate {v(t)) 
and (v 2 (t)}. Using the long-time limit of the latter, identify the coefficient A with the 
help of the equipartition theorem. 

(b) Consider the time-dependent velocity distribution p(v,t)- It is governed by the Fokker- 
Planck equation 

[I + - °- (10 ' 117> 

With the help of Eq. (10.117), derive equations for dt(v) and d t (v 2 ) and identify the 
coefficients ai and a 2 , e.g., by using the results of part (a). Show that the corresponding 
Fokker-Planck equation for the generating function g(k,t) = f dve~’ tkv p(v,t) is given 
by 

[S^4 + ^ ! W= 4 - < mi8 > 

(c) The first order partial differential equation Eq. (10.118) can be solved for example with 
the method of characteristics. Show that a general solution is of the form 

g(k,t) = e ~^ k2 <j>{ke-^). 

Determine the function (j> from the initial condition p(v,t = 0) = S(v — Vq) and derive 
the distribution p(v,t). Discuss the short and long-time limits of your result. 

Answer: 


(a) Formally, integrating the equation, one obtains the solution, 



where Vo = u(0). Then, averaging over the distribution for the Langevin force, since 
(/) = 0, we have {v(t)) = Voe -7 *. Similarly, making use of the expression for the 
correlator, we have 

(v 2 (t)) = e- 27 * [u 2 + £ df dt" </( ^ ,,)) e 7(t,+t " } ] 

In the long time limit, (v 2 (t)) = 2 ^ 2 F = T j. e . A = 2q mT. Setting Sv(t) = v(t) — 
(v(t)), one may further show that ( Sv(t)Sv(t ')} = — e ->(*—*')). 

(b) Using the Fokker-Planck equation, we have 

d t {v) = — j dwd v (a\P)A J dv^ydlfaP) = J dva\P — ^ J dvd v (ci 2 P) = (ai). 
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From this result, we obtain a\(v) = — jv. Similarly 

d t {v 2 ) = — J dw 2 d v (aiP) + j dv^v 2 d 2 (a2P) 

= J dv 2 va\P — J dvvd v (d2P) = ( 2 vai) + (0,2). 

From part (a), we have dt{v 2 ) = — 2 ^Vpe~ 2lt + ^e~ 2jt . As a result, after a small 
amount of algebra, we find that (a-2) = 277 . 

The derivation of Eq. ( 10 . 118 ) amounts to a straightforward substitution of the 
Fokker-Planck equation into the Fourier integral defining g(k). 

(c) Using the method of characteristics, from the left-hand side of the equation in (b), we 
have —'ykdt + dk = 0 , i.e. = 7 k and k{t) = ce 7t . From the variation of g along the 
characteristic, y = — ^ kdk , we obtain g(k,t) = const. e _fcBTfe / 2m , i.e. 

g(k,t) = e- feBTfe2 / 2m ^()fce- 7t ). 

With the initial condition, g(k, 0 ) = e ~ ikv °, we have <f>{k) = e ~ ikv ° e fcBTfc2 / 2m . As a 
result, we obtain the solution 

g(k,t) = exp I"— ikvoe~ Jt — ^-fc 2 (l — e _27t )l . 


Taking the inverse transform, we thus obtain the probability distribution function, 


p{v,t) = 




-1/2 

exp 


(v- ype 7t ) 2 

2^(1 -e- 27 *) 


In the limit t —¥ 0 , p(v,t) ~ \J 2nDt ex P[— ]> with diffusion constant D = ^7. In 

the limit t -> 00, p(v,t) ~ P eq -(u) = y/ 2 ™ T exp [— ] . 


10.9.3 Ornstein- Uhlenbeck process revisited 

We here approach the Ornstein-Uhlenbeck process from the point of view of path integration. As with 
the harmonic oscillator in quantum mechanics, the path integral approach to the problem is cumber- 
some (as compared to a direct solution of the Fokker-Planck equation.) However, the path integral 
of the Ornstein-Uhlenbeck process frequently appears as an approximate integral describing quadratic 
fluctuations around the stationary points of more complex integrals (much like the harmonic oscillator 
integral describes quadratic fluctuations in quantum mechanical path integrals). For this reason, it is 
well invested time to study the integral approach to the Ornstein-Uhlenbeck process. 

Once more, we write P instead of p to distinguish probability distributions from momenta p in phase 


space. 
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Consider the stochastic equation describing a particle in a harmonic potential with over- 
damped dynamics, q + = £ with (£(t)£(t')) = AS(t — t') where A = 2 T^. In the path 

integral representation, the probability distribution reads 


P(Qf,t\qi, 0)= [ 

Jq(t)=q 


D[q,p\e 


-S[q,p] 


J q(t)=q f ,q(0)=qi 

where S = — dt' ( pq — H), and H = —apq — ^p' 2 with a = - 


(a) Solve the Hamiltonian equations of motion and determine the corresponding classical 
action S c i. 

(b) In order to go beyond the leading semiclassical approximation P(qf,t\qi,0) ~ e~ s ^- q ' p \ 
we need to determine the fluctuation determinant. In Section 3.2 we had seen that the 
fluctuation determinant of a path integral assumes the form (cf. Eq. 3.28) 


1 / d 2 S c i 

V2n\J dqidq f ’ 


(10.119) 


where the replacement i — »• — 1 of the pre- factor under the square root relates to the 
fact that we are now dealing with an imaginary time integral. However, the direct sub- 
stitution of expression (10.119) for the fluctuation determinant would be premature: 
(10.119) applies to theories whose Hamiltonian is of “conventional” type, H = T + U, 
where T and U are kinetic and potential energy, respectively. Interpreting q and p as 
operators with commutation relation [p, q] = 1, bring the Hamiltonian of the theory 
into this form. Show that this substitution changes the action to 5 c i — > S c \ — ft. (If 
you feel uncomfortable about the interpretation of the function H in terms of oper- 
ators, derive the Fokker-Planck equation of the path integral, i.e. the analog of the 
Schrodinger equation. Re-arrange operators so as to make the Fokker-Planck operator 
manifestly hermitean. Then construct the path integral representation of that hermitean 
representation to arrive at the modified action stated above. Think why the alternative 
representation corresponds to a “mid-point” discretization scheme, different from the 
Ito representation used in the construction of the MSRJD functional.) 

(c) Evaluate the determinant Eq. (10.119) to obtain the result 

, /„ a («<-«/' =“*) 2 

/ 2a \ 1/2 e ~ 3 e 2at — i 

P( 9 /,%,o) = (-) (1 _ e _ 2at)1/2 - 


Notice that the shift —at/2 we obtained as a result of the change in the discretization 
scheme cancels against a long-time divergent factor in the fluctuation determinant. 


Answer: 


(a) The classical equations of motion read 


q = d p H = — aq — Ap, 


P = ~d q H = ap, 
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with the solution p{t) = poe at and q(t) = ce at — 4^e at . From the boundary conditions, 
one finds — = and c = • Substituted into the classical action, after 

some algebra, one obtains 


Sd = - 


pt 

J dt'(pq + apq + -p 2 ) = 


a ( qj - q f e at ) 2 
A e 2at - 1 


with a/A = 7/2 T. (You may check for consistency that dS/dqi = pi.) 

(b) The form of the Hamiltonian used in the path integral is only correct for the Ito dis- 
cretization. In order to apply the familiar techniques of quantum mechanics, we have to 
symmetrise the Hamiltonian by modifying the “vector potential” term, 


A 2 pq + i 

H = -apq - —p -a — — 


where the symbol designates a correspondence that would apply if the functions 
p in the Hamiltonian were to be interpreted as an operator p = d q . This leads to an 
additional factor in the exponential which is linear in t. S c \ — > S c 1 — ft. More rigorously, 
we may observe that the full information on the path integral is stored in the Fokker- 
Planck equation 

^d t + ad q q + P = 0- 


The “Fokker-Planck” operator can be rewritten as 


^(d q p+pd q ) + ^d 2 + ^ = ^ 


d<l+ A q ) _ T4 ?2+ 2’ 


i.e. in a form resembling a harmonic oscillator Hamilton operator subject to a gauge 
potential aq/A. If we now apply the standard rules of path integral construction in 
quantum mechanics (a mid-point discretization to properly deal with the gauge potential 
understood) to the new Fokker-Planck/Hamilton operator, we arrive at the imaginary 
time integral for our Hamiltonian function H (on the level of Hamiltonian functions, 
ordering is not an issue), shifted by the constant at/2. The appearance of this constant 
can be attributed to a change in the discretization scheme. 

(c) A straightforward exercise in differentiation. 


10.9.4 Directed percolation 

This problem is of more technical nature. Within the framework provided by the field theory Eq. (10.112), 
we explore the role of fluctuations in directed percolation slightly below the upper critical dimension, 
d = 4 — e. This leads us to a set of RG equations, whose physical significance is discussed in the main 


text. 
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Phase transitions in the directed percolation 
universality class are described by the field theory 
Eq. (10.112). As with our previous examples, con- 
tractions of the fields <f>, ip w.r.t. the free action of 
the theory, k = 0, generate a Green function, 

= 9qfiq-q', 9q' = ~ T, 

(10.120) 

where g q relates to the advanced and retarded Green functions used in earlier sections of 
the text as g q = g+ = gZ q , and q = (cu, k) is the four-momentum. 



(a) Perturbation theory: In order to become familiar with the structure of perturbation 
theory that treats the interaction k as a small parameter, we consider the form of the 
perturbative corrections to the Green function and the vertex. Show that the Green 
function generalizes to g~ x — »• g~ l + £, where, to lowest order in k. the self-energy 
correction reads as 


E, = 2 k 2 


1 

(27r) rf — iui + Dk ' 2 + D(k — k ') 2 — 2 r 


(10.121) 


Show that the lowest order correction to the coupling constant k — >• k + 5k appears at 
third order and has the form, 


5k ~ -2k 3 


d d k' 1 
(2n) d (Dk 2 — r) 2 


(10.122) 


Convince yourself that the diagrams contributing to the renormalization of k and r 
correspond to the “configuration space” processes exemplified by the figure above. 

(b) Renormalization group: Consider the theory regularized with a hard cutoff A for the 
(spatial) momentum integral. We rescale (spatial) momentum by this cutoff k — » k/A 
which means that the momentum integrals now extend over the support |k| < 1 and 
all coupling constants are measured in units A dr , where d x is the relevant engineering 
dimension. 

Integrate out perturbatively the spatial fast modes within the momentum shell (1,1/6) 
with 0 < In 6 <C 1. Show that in spatial dimension d = 4 — e, this modifies the coupling 
constants in the following way, 

uj ->• oj ^1 — ^ In 6^ , D ->• D ^1 — ^ In bj , 

r -» r Tl — (— + 1^ xln6^ , k k (1 — 2xln6) , 


where we introduced 

= S * (—Y 

X ~ (2t r) 4 \DJ 


(10.123) 


and S 4 is the area of the four-dimensional unit-sphere. Next, rescale coordinates and 
fields as q — > q/b, ui — > w/6 z , <j> — > <f>b 2 ~ + . Choose the dynamical exponent z and the 
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Figure 10.16 Skeleton structure of the two diagrams contributing to the renormalization of the 
propagator (left) and interaction vertex (right), respectively. 

field renormalization exponent x so as to make the diffusion constant D and the coeffi- 
cient of frequency invariant under renormalization. Show that this condition generates 
the equations 



The “mass term” r and the interaction constant k flow according to Eq. 10.115. The 
meaning of these equations is discussed in the main text. 

Answer: 


(a) In Fourier basis, the interaction contribution to the action S = So + Sint is given by 

Sinti&lp] = K j 4>q(t>q' {^-q-q' ~ Ip-q-q'), 

where the measure dq = duid d k/( 2n) d+1 . Expanding the action to second order in k, 
and applying the contraction rule Eq. (10.120), we obtain the self energy 


= —2k 2 


dq' ( g q -q'9q ' + 9q+q'9q')- 


The frequency integration j duj' over <j q + q > q q > vanishes because the integrand falls off as 
~ 1/a/ 2 and has no poles in the upper complex plane. Doing the contour integration 
over the remaining contribution, we readily obtain Eq. (10.121). The diagrammatic 
representation of this term is shown in Fig. 10.16, where the external field vertices (not 
contributing to the self energy) have been included for the sake of notational clarity. 

To obtain the renormalisation of the coupling constant of the interaction, k it is 
necessary to develop the third order of perturbation theory. The Wick contraction of 
these terms (up to the “external field vertices” entering the interaction operator, ~ ^>^ 2 
and ~ obtains two one-loop diagrams. In one of these all Green functions have their 
poles on one side of the real axis, which implies vanishing upon frequency integration. 
The survivor diagram, whose graphical representation is shown in Fig. 10.16 has the 
analytic representation 

f 1)3 2 4 3 f f 

— -t — ^j-K 3 / dqdq'dq" (frq’^'i-tq-q'-q" +VW-9") / dpg^g-p, 
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where we neglected the “small” momenta q,q',q" in the arguments of the “fast” Green 
functions g p . The required renormalisation of 6k thus reads 6k = — (2k) 3 f dq g'gfj-q. 
Doing the integral over frequency, we obtain Eq. (10.122). 

(b) Consider the contribution 6T, to Eq. (10.121) due to the integration over the fast momen- 
tum layer Jj, d d k' = / 1 / b< |i c /| 1 d d k' . An expansion of the self energy in “small” correc- 
tions to the fast momenta obtains 


ST, q ~ -2k 2 


( 1-2 {iw + 2 t )/_4 k 2 
V 2D + (2D) 2 + 4D 




where we introduced I n = f ^ d k ' n • Likewise, the fast momentum contribution to the 

coupling constant correction Eq. (10.122) reads 


c 2 K 3 

Sk --^2 T - 4 


In dimensions 4 — e, we have 1_ 2 — I - 4 = j 0 yi In b. Substituting this result into the 
fast-fluctuation induced change of the Green function, g _1 — > c/ _1 — <5E, we obtain the 
required renormalization of coupling constants. 

The re-scaling of coordinates and fields modifies the renormalization of coupling con- 
stants according to 


oj -> u> ^1 -(- ^2x — In fej , D ^ D (l + ^2\ + z — 2 — ^ In 6 ^ , 
r T Tl + ^2% + z — — x\\nb \ , k k (l + (^ + 3\ + z — 2 — 2a^ In 6 ^ . 


Invariance of the first two terms generates the required conditions on % and 2 . We 
finally substitute x and 2 into the remaining two equations to obtain the RG equations 
(10.115). 
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In the previous chapter we have seen that departures from equilibrium generate a plethora of new 
phenomena in statistical physics. While our discussion so far has been limited to classical phenomena, in 
this chapter we want to ask how quantum mechanics interferes with the conditions of nonequilibrium. 
Experience shows that, in some cases, nonequilibrium quantum systems can be understood in terms 
of reasonably straightforward quantization of their classical limits. In others, an interplay of quantum 
coherence and nonequilibrium driving leads to entirely new phenomena - such as lasing, a paradigm for 
a steady state nonequilibrium quantum system. At any rate, we have every reason to anticipate that 
quantum nonequilibrium physics will be as diverse and colourful as its classical counterpart. 


Even minor departures from equilibrium call for a theoretical description entirely different 
from the ‘Matsubara formalism’ developed in earlier chapters of this book. The applicability 
of the latter is rigidly tied to the existence of a quantum grand canonical density opera- 
tor, the hallmark of a many particle equilibrium system. But how then, can an functional 
theory of nonequilibrium quantum systems be developed? An obvious idea would be to 
once more go through the different elements of classical nonequilibrium theory developed in 
the previous chapter, and subject every one of them to an individual quantization scheme. 
Luckily, there are more efficient ways to achieve our goal: some decades ago, a many particle 
formalism suitable to describe nonequilibrium systems under the most general conditions 1 
was introduced by Keldysh . 2 For a number of reasons, the Keldysh formalism will be the 
principal tool in our approach to quantum nonequilibrium physics: (i) it is highly flexible 
and can be applied to study the physics of practically any quantum system, in and out of 
equilibrium, (ii) Although the Keldysh technique was introduced in a pre-path integral era, 
it is tailor-made to functional integral formulations, (iii) The classical limit of that Keldysh 
field integral turns out to be the MSRJD functional integral extensively discussed in the 
previous chapter. In this sense, the Keldysh functional will be our ‘theory of everything’; 
from it the entire body of nonequilibrium theory, both quantum and classical can be derived 
by reduction. 

The discussion above motivates a strategy different from that pursued in our introduc- 
tory discussion of nonequilibrium physics above. In a “top-down” approach, we begin by 


1 By “most general,” we mean that the Keldysh formalism enjoys applicability independent of the distribution 
(operator) describing the state of the system under consideration. 

2 L. V. Keldysh, Diagram technique for nonequilibrium processes, Sov. Phys. JETP 20 , 1018-26 (1965). 
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a formal introduction of the Keldysh formalism. We alternatively interpret the theory as 
a generalization of the Matsubara formalism, and as a quantum extension of the MSRJD 
field integral. After these conceptual structures have been thoroughly introduced, we will 
turn to the discussion of applications. 

INFO In reference to the stochastic aspects of classical nonequilibrium processes, we began 
the last chapter with a short treatise on probability. Quantum mechanics, on the other hand, 
is a theory that is intrinsically probabilistic; the most we can hope for is to obtain results on 
the quantum probability to measure certain values of observables. The theory of quantum 
nonequilibrium processes, therefore, involves two “layers” of stochasticity, quantum probability 
and the extrinsic stochasticity of nonequilibrium processes. As it is important not to confuse 
these two, we begin by asking what sort of probability distributions might suitably generalize 
the distributions employed in the previous, classical chapter. 

To start, let us consider a classically deterministic system, prepared in a definite quantum 
state | ). The (quantum) expectation value to measure a certain observable is then given by 
(A) = f |A| ) = ^2 n X n \ „| 2 , where X n is the nth eigenvalue of X (the generalization to a 
continuous spectrum is straightforward), „ = (n| ), and |n) is the nth eigenstate. We denote 
the quantum expectation value by ordinary brackets to distinguish it from the average over 
fluctuations below. 

The equation above suggests an interpretation of Px{x) = Y) n b(x — X n )\ n | 2 as a “quantum 
probability” distribution of the operator X. This notation is suggestive in that it allows us to 
express the computation of quantum observables in formal analogy to the probabilistic formalism 
used in the previous chapter. For example, ( X n ) = f dx Px(x) x n , and Px(x) = (5(x — X)), etc. 
While the quantity P# underpins the probabilistic nature of quantum mechanics, its usefulness 
rapidly reaches its limits. 3 

For our present purposes, an alternative representation of quantum expectation values will 
prove more useful: the above expression may equivalently be written as (A) = tr (/5| ) A), where 
p\ ) = | )( | is a particular type of a quantum density operator, viz. a projector onto the 
one-dimensional space spanned by | ) . The alternative representation can be readily generalized 
to the - more realistic - situation where the state of the system is not known with certainty. 
In general, p = ^2 a p a \ a)( a|, where {| a )} is a complete orthogonal set, and the positive 
coefficients p a , Y2 a P a ~ 'fc represent the probability to find the system in state | 0 ). In a 
many particle system, the inevitable presence of fluctuations - cf. the principles discussed in the 
previous chapter - may render the coefficients p a effectively random. These fluctuations may be 
externally imposed, or caused by integration over microscopic degrees of freedom of the system. 
Summarily denoting the average over fluctuations by (. . . )f , we then have 

(A):S:|(A)) f = tr (pX) ( = tr((p> f A) = ^>„) f ( a |A| „>. 

This representation illustrates the division of work between quantum and statistical fluctuations. 
In a canonical (basis-invariant) manner, all aspects of quantum mechanics - interference, wave 
coherence, etc. - are encapsulated in the mathematical properties of the density operator and 


3 The function P x suggests deceptively that the basic information on the state of a system is stored in the 
(positive definite) probability | n | 2 . However, we know that the essence of quantum mechanics resides in wave 
function amplitudes. For example, it is not possible to unambiguously generalize the notion of P to that of a 
joint probability P x Y (x,y) of two non-commutative operators. This means that positive definite probability 
distributions such as the one above will not suffice to describe quantum processes. 
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the trace operation. The statistics of fluctuations (both thermal, and externally imposed) is 
contained in the average over coefficients, (. . . )f. We repeat that these latter fluctuations may 
effectively be generated by a quantum average (i.e. a trace operation) over microscopic degrees 
of freedom of the system. 

The discussion above suggests the following transcription of the probabilistic elements of 
classical nonequilibrium mechanics into quantum mechanics: Notice that if and only if p is an 


classical 

quantum 

variable X 
values of X, x 
probability distribution p 
moments (X n ) = f dxp(x)x n 

hermitian operator X 
eigenvalues of X, x 
density operator p 
moments (X n ) = tr (pX n ) 


operator function of X (which will typically happen if X = H is energy), (X n ) = tr (pX n ) = 
f dx p(x)x n , where p{x) are the eigenvalues of p(X). fn this particular case, we have a normalized 
(f dx p(x) = 1) and positive quantum probability distribution. In general, however, quantum 
statistical mechanics cannot be described in terms of positive definite probability functions. 
Rather, the relevant information is stored in density operators, and it is these objects we need 
to understand. 


11.1 Prelude: Quantum master equation 

Before turning to the systematic construction of 
the Keldysh formalism, let us introduce a few ele- 
ments of the quantum mechanics of nonequilibrium 
systems in more elementary terms. This discussion 
will get us acquainted with various “quick and dirty” 
routes to the description of nonequilibrium systems 
but will also motivate the construction of the more 
systematic theories below. (Readers wishing to pro- 
ceed in a maximally streamlined manner may skip 
this discussion and directly turn to section 11.2.1.) 

We consider the standard setup of a “system” coupled to a “bath” (cf. the figure.) The 
bath may alternatively be interpreted as a large system of external degrees of freedom 
(e.g. the oscillator modes of an external electromagnetic field), or in terms of microscopic 
internal degrees of freedom affecting the more macroscopic (but still quantum) dynamics of 
the degrees of freedom we consider to be the “system.” By straightforward adaptation of 
the principles discussed in the previous chapter, we interpret the degrees of freedom of the 
bath as unobservable. Integration over these degrees of freedom produces an effective, and 
foreseeably irreversible dynamics of the system. Our immediate goal is the derivation of the 
equations of motion governing this effective dynamics. 
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11.1.1 Derivation of the master equation 


Consider the density operator p describing the state of the total system in the product 
Hilbert space H = H s ® ~Hb of system (s) and bath ( b ). The reduced information in which 
we are interested is stored in the quantity p s = t.i>j (p) : H s — >• H s , where tr& denotes the 
trace ( “integration” ) over the Hilbert space of the bath. The reduced density matrix p s 
is a linear operator acting in the system Hilbert space H s . The dynamics of the full system 
is controlled by the Hamiltonian operator H = H s +Hb + aHi, where the coupling between 
system and bath, Hi, is weighted by a dimensionless coupling constant a. We assume that, 
at some initial time t = 0 (a) the system and bath had been independent, and (b) that the 
bath was in a state of thermal equilibrium, p( 0) = p s (0) <g> p^ q . 

We now aim to formulate an equation of motion for the reduced density matrix. Reflecting 
the applied relevance of this problem, plenty of different strategies to achieve this goal have 
been formulated. Here, we derive the equation by a formally exact projector formalism, 4 
and then compare to other approaches. The dynamics of the density operator is governed 
by the equation of motion 

(d t - L)p = 0, 

where L = L s + Lb + oiLi, and = —i[H s ,b,ii ] are “quantum Liouville operators.” We 
next introduce the projector 

V = Pfe q tr 6 ( . ), 

where the trace operation acts on everything to the right, and unit normalization of p)j q is 
assumed. (As an exercise, consider why this is a projector.) To simplify the derivation, we 
assume that the thermal trace of the interaction operator over ’Hb vanishes, 5 tr b{pl q Hi) = 0. 
Our projector thus obeys the equations, 

VL b = L b V = 0, [L S ,V] = 0, VLiV = 0, (11.1) 

where the first follows from the cyclic invariance of the trace, the second should be obvious, 
and the third expresses the presumed vanishing of the interaction under the bath-trace. 
Introducing the complementary projector Q = id — V, and the shorthand notation pp = 
Vp, pQ = Qp, the quantum Liouville equation may now be split into two, 

d t pp = L s p P + aVLipQ, 

dtpQ = QLp Q + aQLipp. (11.2) 

The second of these equations is solved by (exercise) 

PQ(t) = a [ dt' e* QLQ Lip P (t - t'). 

Jo 


S. Nakajima, On quantum theory of transport phenomena: steady diffusion, Prog. Theory. Phys. 20, 948-59 
(1958); R. Zwanzig, Ensemble method in the theory of irreversibility, J. Chem. Phys. 33, 1338-41 (1960). 
Should this condition not be met by the interaction under consideration, the term tr b(p^ ) cl Hi) = 0 can be 
interpreted as part of the system Hamiltonian operator. 
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Substitution of this result into the first equation obtains 

d t p P = L s p P + a 2 [ dt' pLi<: l ' Ql ' Q Lipp(t - '£% 

Jo 

or, employing the projector operation V, 


d t p s = L s p s +a 2 

dt' Lie 1 p s (t — t'), 

Jo 

b 


(11.3) 


(11.4) 


where (. . .) b = tr 6 ((. ..)p e b q ). 

Equation (11.4) is commonly denoted a generalized master equation. But in fact, the 
resemblance to a master equation is only formal. Rather, this equation (together with a 
complementary equation for pq) is an exact reformulation of the full quantum theory; the 
complete information on quantum unitary time evolution resides in the time non-local mem- 
ory kernel on the right hand side. In contrast, the master equation generally describes time- 
local (Markovian) and irreversible processes. To obtain a “true” master equation one may 
eliminate information in what effectively amounts to a Markovian approximation. Assuming 
that the time scales over which p s changes are large in comparison to the relaxation times t’ 
of the integral kernel, Eq. (11.4) reduces to a Markovian equation dtp s = (L s + a 2 X)p s (t), 
where X = J 0 °° dt’ {Lie t,Q ^ Q Li) b . 

It is also customary to assume weak coupling, or small a, so that the o-dependence of 
the exponent can be neglected (a variant of a “Born approximation”). The resulting sim- 
plifications are best discussed in the language of the prototypical equation (11.3): defining 
Lo = L s + L b , the integral kernel in that equation reduces to 

vUe^^^U p P (t - t’) = p P (t - £). 

EXERCISE Use Eq. (11.1) above to verify this relation. 

Further, defining Ho = H s + H b , and 0(t) = e ltHo O e ~ ltHo for the interaction picture time 
representation of an operator O, we have 

p P (t - If) m VLiLi(-tf) p P (t - 1'). 

EXERCISE For an arbitrary operator O, show that e tLo O = 0(t)e tL °. The equality above is 

approximate because we neglect the action of e* L ° on the (slow) operator pp. 

Using this result, we find that Eq. (11.4) reduces to the (Markovian) quantum master 
equation 


d t p s =(i s +a 2 ( dt’ Li(0)Li(-t' 

) W 

V Jo 

V 


(11.5) 
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11.1.2 Example: oscillator coupled to a bath 


It is instructive to study the meaning of the master equation on a concrete example: consider 
a harmonic oscillator - the system - coupled to an assembly of other oscillators, the bath. 
We describe this setup in terms of the Hamiltonian H = H s + Hf, + aH, , where 

H s = e (aU + 0 , H h = Y^u k (a\a k + 0 , Hi = ^ (c fc 4a + h.c.) . (11.6) 

Using the fact that a(t ) = e~ l€t a and a k (t) = e~ lUkt a k , it is straightforward to verify that 

a 2 [ dt' Li(0)Li(—t') p s = — a 2 Y"' \c k \ 2 [ dt' 

Jo b 1 Jo 

x ^e.-duk-z)t (p gaa t _ a) p s a) + (hk + 1) ( aJap s - ap s aJ )) 

e +i(u k -e)t' ( GO t^ s _ aJp s a) + ( h k + 1) ( p s aJa - ap s aJ )) ) 

= a 2 7T^|c fe | 2 (5(e-Wfe) 
k 

x ((h k ) (2 aJ p s a - p s aaJ - aaJ p s ) + { h k + 1) (2ap s aJ - p s aJa - aJap s )) 

~ 7ro: 2 |c e | 2 p(e) ((h e ) (2a* p s a — p s aa * — aa^ p s ) + (h e + 1) (2ap s a^ — p s a^a — a^ap s )) . 

Here, h k = 4 a fc is the number operator of the bath, and we have neglected the subscript 
‘6’ in (...) = (.. . )(,. In the fourth equality we introduced the spectral density of the bath, 
p(u>) = 6(u> — u>k ), and we assumed that c k = c Uk and ( h k ) = (h Uk ) depend only on the 

energy of the reference state. 

EXERCISE In the second equality above we assumed that [p s , (da] = 0. Verify that a relaxation 
of that assumption will lead to terms of the structure ~ P f ^ (hj) <du. />„ . where P f 

is the principal value integral. Interpret these expressions in terms of an energy shift of the 
oscillator energy due to virtual transitions into the bath. 

Assuming once more commutativity of p s with h = aJ a. we have L s p s = 0 and thence 
d t p s = 7ra 2 |c € | 2 p(e) [(n e ) (2 a) p s a - p s aaJ - aa) p s ) 

4- ffte + 1) (2 ap s a) - p s aJa - aJap s )\ . (11.7) 

But what is the meaning of this equation? In fact, it will be instructive to look at Eq. (11.7) 
from a number of different perspectives: the commutativity \p s , h] = 0 implies that p s = 
p s (h). Using commutator relations such as aJ p s (h) = p s (h— l)at, it is then straightforward 
to bring Eq. (11.7) into the form 

d t ps(n ) = 27TQ 2 |c e | 2 p(e) [ (h e ) ( hp s (h - 1) - (h + l)p(n)) 

absorption 

+ (h e + 1) ((h + l)p s (h + 1) - hp s (h)) j . 

emission 
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The right-hand side of this equation now affords a 
straightforward interpretation in terms of absorption 
and emission of bath bosons (see figure). The absorption 
term is proportional to the number of available bosons 
((n e )) resonant with the oscillator frequency, the golden 
rule transition rate, ~ |c e | 2 p(e), and to the number of 
absorbing bosons (plus one). Importantly, the absorption 
process contributes to the right-hand side in a manner symptomatic of a master equation. 
The density operator (or better to say, the eigenvalue) p s (n) increases due to an “in” 
process oc p s (h — 1) and it diminishes due to an “out” process oc p s (n). There is also 
an emission process wherein bath bosons are resonantly created from oscillator bosons. 
Emission, too, enters the master equation as a sum of an in and an out process. 

A few more comments can be made on the structure of the master equation: 

D> The concerted appearance of in- and out- terms implies the conservation of the trace of the 
density operator (the quantum analogue of the conservation of probability): djtr (p s ) = 0 
(exercise) . 

> Assume the bath is in equilibrium, (n € ) = (exp(/3e) — l) -1 . It is then straightforward 
to verify that the distribution p s becomes stationary (the right-hand side of the master 
equation vanishes) on the equilibrium configuration 

Pain) = Z _1 e _/3e ", 

where Z = (1 — e - ^)^ 1 is the normalizing partition function. Interaction with the bath 
“thermalizes” the system at a temperature set by the bath temperature. Notice that 
equilibration relies on the interplay of emission and absorption processes. 
i> It is worth noting that the in-terms (out-terms) in the master equation derive from the 
terms ~ aJ p s a, etc. (~ p s a^a, etc.) in the parent equation (11.7). To understand the 
heuristics behind this observation, recall that the interaction picture time evolution of 
the density operator is given by (symbolic notation throughout) p s (t) = U(t)p s (0)W (t), 
where U is the time evolution operator. We may visualize the time evolution described 
by these operators by two lines in “time space,” one directed forward and one backward, 
cf. Fig. 11.1. 

If we now probe the incremental change of p s in time, i.e. compare p s (t + At) with 
p s (t), we may linearly expand either U or W in aHj,. This produces a factor aHi{t) 
acting either to the left or to the right of the time evolution operator, or an interaction 
Hamiltonian acting at the end of the forward or the backward time contour. However, to 
obtain a non-vanishing result, we need to expand to one more order. This gives a factor 
~ a f dt'Hi{tf) which, again, may act to the left or the right of the density operator, or on 
the forward or backward contour. All in all, we obtain four contributions, as visualized in 
Fig. 11.1. Now, the contributions where Hi acts on different time contours (a) are formally 
described by a density operator sandwiched between creation and annihilation operators, 
and (b) physically change the state of the system (c) alluding to diagrammatic language, 
they resemble ‘vertex corrections” in a “two-particle propagator. These are the in-terms. 
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Figure 11.1 Evolution of the density operator (shaded box) along the double time contour. Left: 
out-processes (“self-energy corrections”), right: in-processes (“vertex corrections”). For the moti- 
vation for the “left” orientation of the time arrow, see below. 


Conversely, the contributions where Hi acts twice on the same contour, (a) are described 
by pairs of operators acting on one side of the density operator, and (b) do not change the 
state of the system (more precisely, it is first changed at t' and then changed back at t.) 
(c) These out-terms resemble self-energy corrections in a two-particle propagator. Indeed, 
we have seen on various occasions that the (imaginary part) of a self-energy correction 
represents a decay rate, i.e. the rate of an “out-process.” 

It is worth pointing out that our discussion of the quantum master equation so far has 
been rather superficial (for a more substantial exposition, see Weiss, 6 van Kampen, 7 and 
Haake 8 ). Its primary purpose was to motivate a number of concepts relevant to the physics 
of quantum nonequilibrium phenomena. We next turn to the construction of a field integral 
based theory which will enable us to describe nonequilibrium phenomena in a much broader 
setting. 


11.2 Keldysh formalism: basics 
11.2.1 The idea 

Before venturing into the construction of the Keldysh nonequilibrium theory, let us first try 
to summarize what kind of theory we are after: 

1. Previously, we have argued that the object generalizing the probability distributions 
which pervade the previous chapter is the quantum density operator p. Expectation 
values of observables are evaluated by taking the traces of hermitian operators against 

the dynamically evolved density operator. 


6 U. Weiss, Quantum Dissipative Systems, (World Scientific Publishing, 1993). 

N.G. van Kampen, Stochastic Processes in Physics and Chemistry , (Elsevier, 1992). 

8 F. Haake, Quantum Signatures of Chaos (Springer- Verlag, 2001). 
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2. That dynamics is described by the (adjoint action of the) quantum time evolution 

operator, p — > U(t)pU(tf. We may interpret U (W) as the descriptor of quantum 
time evolution forward (backward) in time. The notion of chronological direct- 
edness becomes manifest in the interaction-picture representation of the evolution 
operator U as a time ordered exponential U = Texp(— i f* dt' H^(t')). Conversely, 
we may generate U t by time ordering along a reversed temporal contour, W = 
Texp(— i dt' Hint (t')), where T puts operators to the left that are chronologically 

earlier. 

3. In many applications we may safely assume that, at some initial time (set as t = 
—to for definiteness) the system of interest was in a known state. For example, 
p(— to) = exp(— f3H) may have been an equilibrium distribution, or p(— to) = p s Z> pb 
in the case of a composite involving a system coupled to a bath. It is assumed that 
the interactions/correlations/nonequilibrium conditions rendering the situation non- 
trivial are slowly switched on at some later time. 

4. From our experience with classical probability theory, we know that it is convenient to 
work with generating functions. For instance, in equilibrium statistical mechanics, 
one might wish to compute the Maxwell-Boltzmann generating functional (symbolic 
notation) G(h) = Z~ x f exp(— /3(H — hX)), where Z = f exp (—/3H) is the partition 
function, f denotes an integration over all states of the system, H is the Hamilto- 
nian and X an observable of interest. Differentiation with respect to the sources h 
generates moments of X and we have the normalization G(0) = 1. 

How can the concept of generating functions be generalized to the quantum 
nonequilibrium case? Generalization to “quantum” is relatively obvious: replace 
functions by operators, and integrals by traces. But what about “nonequilibrium”? 
From what has been said above, it should be evident that we need to endow some 
known quantum distribution p(0) with a sense of dynamics, in a manner that leads 
to a properly normalized, and generally time-dependent generating functional. 

In the mid-1960s, Keldysh formu- 
lated a theory 2 that solved the prob- 
lems alluded to above in one stroke. 9 
At first sight, Keldysh’s concep- 
tual ansatz may look strange, if not 
downright silly: suppose we are given 
an initial density operator p which 
described our system at some time 
t = —to — oo in the distant past 

when the world was nice and easy 
- no interactions, stochastic fluctua- 


Leonid. V. Keldysh 1931- 

Former Director of the Lebe- 
dev Physical Institute, Moscow. 

Keldysh has made seminal con- 
tributions to solid state theory 
from the physics of electron-hole 
excitations in semiconductors to 
the development of techniques to 
explore quantum systems driven from equilibrium. 
Recipient of the Lenin Prize, Lomonosov Prize, and 
the Hewlett-Packard Prize. 



For earlier work in the same spirit, see J. Schwinger, Brownian motion of a quantum oscillator, J. Math. Phys. 
2 , 407-32 (1961), L.P. Kadanoff and G. Baym, Quantum Statistical Mechanics (Benjamin, New York, 1962), 
R. P. Feynman and F. L. Vernon, The theory of a general quantum system interacting with a linear dissipative 
system, Ann. Phys. 24 , 118-73 1963. 
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Figure 11.2 Top: parameter dependence of the system Hamiltonian along an “abstract” Keldysh 
contour. Bottom: embedding of the contour in the complex time plane. 


tions, or similar complications. Our goal is to endow the density operator with a sense of 
dynamics. To this end, consider an abstract parameter interval, [0, S']. On this interval, 
we define an operator valued function H(s) according to the following rules: (i) H (0) = 
—H(S) = Hq. where Hq is the “trivial” Hamiltonian operator governing the system at time 
—to- (h) As s increases, the non-trivial elements of the dynamics are gradually (adiabati- 
cally) switched on (cf. Fig. 11.2) until the Hamiltonian has reached its full form H(s) = H. 
(iii) Upon approaching the center of the parameter interval, we let the Hamiltonian diminish 
down to H(S/ 2) = 0. (iv) Beyond s = S/2, the profile of H is anti-symetrically continued, 
i.e. H(s) = -H(S - s ) for s e [5/2, 5]. 

Now consider the unitary operator U = T s exp(—i jf dsH(s)), where T s orders along the 
s-contour. This operator is but a complicated representation of unity, U = 1. Heuristically, 
the triviality of U follows from the fact that the dynamic “phases” ~ J dsH(s) accumulated 
along the forward branch of the contour cancel against those from the backward branch. 
More formally, one may consider the generalization U(s 2 ,si) = T s exp (— i f S2 dsH(s)) to 
arbitrary segments on the contour. These operators satisfy the composition law (proof 
is analogous to the one for conventional time ordered operators) U(si, S 2 )U(s 2 , S 3 ) = 
U(s u s 3 ), where 8l > s 2 > s 3 . Thus, U = U{S, 0) = U(S,S/2)U(S/2,0). But 17(5/2, 0) = 
U{S, S/ 2) -1 , by construction of the Hamiltonian. The equality U = 1 trivially implies 

Z = tr(Up ) = 1, (11.8) 

on account of the normalization of p. We therefore have an object that is obtained by 
dynamical evolution (1) of a known initial density operator (3), is time ordered (2), and 
manifestly normalized (4). What is not clear, however, is what the “partition function” Z 
might be good for! 
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At the same time, the operator Up is not too different from the objects studied in the 
previous section. There, we had considered dynamically evolved density operators U pW , 
U = U(to,—to). The trace over these operators, tr(UpU^) = tr (U^Up) = Z can be inter- 
preted as a variant of the above partition function, where the curve [0, 5] is parameterized 
in terms of a time argument running from —to to to and back. Pictorially, (cf. Fig. 11.1), 
the trace operation can be visualized by a closure of the lines emanating from the initial 
density operator at a large time to- 

To get a first hint at the usefulness of the closed contour construction, let us add a 
perturbation to H(s ) that breaks the symmetry between the contours: we generalize H{s) — » 
H(s)+aS(s— so)X, where so € [0, 5/2] is in the first half of the contour, a is a number, and X 
is some operator. Let flint be the time evolution operator in the “interaction representation” 
with respect to this perturbation. This representation is defined in perfect analogy to the 
interaction representation of conventional time-dependent quantum mechanics, only that 
the “time integrals” extend over the contour [0,5], instead of over an interval on the real 
time axis. Specifically, U(a) = f7o£lmt(o) — flint ( g), where flo = 1 is the unperturbed 
evolution operator, and Ui n t(a) = 1+ia dsS(s — so)X(s)+0(a 2 ) = \+iaX(so) + 0(a 2 ), 
where X(s o) = flo (s o, 0)XUo(so, 0). Insertion of this result into the trace leads to the result 

-id a \ a _tx{U(d)p) = tr(t>+ (s 0 , 0)Xfl(so,0)p) = tr(Xp(so)), 

where p(s 0 ) = U(s 0 , 0)pf/ t (s 0 , 0 ) is the density operator evolved to so and we omitted 
the subscript 0. Thus, differentiation of the density operator with respect to the source 
generates expectation values of operators in the dynamically evolved state of the system. 
Put differently, Z(a) = tr (/317(a)) is a normalized (Z(0) = 1) generating function for 
quantum expectation values of the operator X in the dynamical state of the system - 
which need not be in equilibrium. It thus appears that the closed time formalism efficiently 
addresses the points (l)-(4) listed above. This is the basic idea behind Keldysh’s formalism. 


11.2.2 Case study 

As a first step towards the construction of a functional integral implementation of the 
Keldysh partition function, we consider a miniature quantum system containing only a single 
bosonic state of energy w. In a rather natural way, the construction of the corresponding 
functional integral will introduce most concepts central to the Keldysh formalism, while the 
notation remains refreshingly simple. 

Consider, then, the partition function 

Z=str , (11.9) 

where, setting Z 0 = (1 — e 1 . 

po = Z {) 1 (11.10) 

is the equilibrium density operator, H = ua^a the Hamiltonian operator and N = a) a, as 
usual. The operator T 7 orders along the contour 7 : (t = 0) — > (t = T) > (t = 0), and 
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we denote the initial (final) time of the theory by t = 0 (t = T) instead of the t = — to 
(t = to) used in the previous section. Finally, the time-dependent Hamiltonian operator 
reads H(t) = H sgn(T — t), where we ignore the stages of adiabatic switching on and off 
alluded to in the previous section. (The straightforward generalization to a smoothly varying 
H(t) will render the notation slightly more complicated but is otherwise inconsequential.) 


EXERCISE Show that 




( 11 . 11 ) 


where (-0 1 and \ift) are boson coherent states. Hint: differentiate by c to derive a first order 
differential equation and use the uniqueness of its solution. 


Let us now construct a coherent state field integral by the usual recipe - insertion of a large 
number 2 N of coherent state resolutions of unity into a time-slice dissection of Z. This 
leads to expressions of the form, 

(V’jvl^-elV’jV— l)(V , JV— ll^-e • • • IVh XVh • • • IV’l")(V , l~|Po|V , Jv); 

where e = T/N and U e = e~ ieH . The functional integral representation then becomes 

Z = J £>(0, 0) ij+v, h , ,]) 

x e -b^b^-bf'0r+ K V'l , "'0w+'0rV'w (1112) 


where k = exp(— f3(oj — n)). At this stage, we would normally take a continuum limit, 
N — >■ oo at fixed T. In the present context, however, it is rewarding to stay for a while 
with the discrete representation, and interpret the exponent of the functional integral as a 
bilinear form with the (2 N) x (2N) dimensional matrix kernel 


1 

— a+ 1 

— a+ 1 

— K 


-1 

1 





— a_ 

1 

—a- 

i 


(11.13) 


where a± = 1=F ieoj. Let us explore a few properties of this matrix. First, it is straightforward 
to verify that 

det(— iG -1 ) = 1 — K(a + a_) Jv . 


EXERCISE Verify this result. Hint: use the identity det = exptr In. Expand the logarithm 
in powers of the difference from the unit matrix, (— *G _1 — 1) (essentially the side-diagonal 
containing the coefficients a±, plus the corner element k). Take the trace and re-sum the series 
into another logarithm. Exponentiation leads to the result. 
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Taking the limit N — > oo, we obtain 

det(— iG _1 ) = l-K(l+(eo;) 2 ) JV = 1 -k ^1 + )) ]_ K _ 1 3 ( e -o) = Zff 1 . 

Since Z = Zq 1 det(— iG -1 ) -1 , this result proves the unit normalization of the functional 
Keldysh partition function. 

In later applications we will need to 
compute expectation values ■ ■ ■ tpk) 

by Wick’s theorem. For this purpose, we 
need to know the elementary contractions 

gZ?' = -iitfWh 

where C,C' = ±. To this end, let us 
introduce the block decomposition G _1 = 
( m~+ M“ _ ^ • It is a straightforward (and 
worthwhile) exercise to show that 

[m 00 ]: 1 = 

where 0(n) = 1 if n > 0 and zero otherwise. We may now use the general formulae for 
the inversion of 2 x 2 block matrices to obtain G ++ = (M ++ — ++ ) 1 , where ++ = 
M -1 (M ) _1 M l_ . Substitution of J- + = iSn8jNa r ^~ 1 K into this formula followed by a 
straightforward series expansion in leads to the result 

j ( e<i - + 1 -Xa-r-0 • (ni4) 

INFO This formula affords an intuitive interpretation (cf. the figure above). The matrix 
G ++ is the amplitude for propagation between two discrete points j and i on the +-part of our 
closed time contour. To get from j to i we may either go directly, which is possible if i > j (the 
time ordering). In this case, we pick up i — j hopping amplitudes a+. This is the first term in the 
equation. Alternatively, we may go via round-trips through the — part of the contour. In this 
case, and no matter what the chronological ordering between i and j, we first go from j to N 
{{N — j) amplitudes a+), then proceed from 1 to TV on the bottom part ((N — 1) amplitudes a_) 
go back to the upper part (a factor k), and finally make it from 1 to * ((i — 1) amplitudes a+). 
This interprets the first order in n contribution to the second term. Now, the going round can 
be repeated infinitely. Each additional round trip gives a factor Summation over 

all these processes generates the denominator of the equation. 



The other three propagator amplitudes can be computed in the same manner. Alternatively, 
one may just derive them by summing over all paths orbiting the closed contour. As a result 
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we obtain 


G,-,- = — i<xL 


©(* - j) + 


K(a-a + ) N 1 \ 

1 — K(a_a + ) JV_1 / ’ 


GZ = -j 


G“+ = —4 


1 — K(a_a+) JV_1 ’ 
N-j i- 1 

a + J a_ 

1 — K(a^.a_) JV_1 


Now, let us take the continuum limit. To this end, we assign to lattice points on the 
upper/lower contour a time variable (cf. Fig. 11.2) 


+ :t = ie, 

— : t = (N — i)e. 

Remembering that, in the large N limit, (a + a_) N — i 1, and noting that 
a\ -> e~ iut , at -> e i ^ T ~ t \ 

we obtain the propagators in a continuous time representation 

G^, + = — t') + n(u)), 

G^ t 7 = — \Q(t' — t) + n(u)), 

G+7 = 

G-+ = +n(w)), 

where n(u>) = k/(1 — k) = — 1) is the Bose distribution function. 


(11.15) 


(11.16) 


I N FO In the literature, it is customary to denote the Green functions by the alternative notation 
G + * .= G t , G = G T , G + ~ = G K , G~ + S G > . As, however, this notation is not particularly 

easy to memorize, and the Green functions above will soon be replaced by different functions, 
we here stick to the contour index notation. 


11.2.3 Continuum field theory 


Having derived the continuum propagators, let us now cast the action in Eq. (11.12) in a 
continuum form. Using Eq. (11.15), we obtain 


Z = Z7 


D(ip, ip) e‘ 


/ 0 T dtif’<T 3 (id t —w)il> 


(11.17) 


where ip = ( ip+,ip-) T is a two component field in contour space and <7j are Pauli matrices 
in that space. Equation (11.17) is our first prototype of a Keldysh functional integral. 
Notice that the continuum representation of the action ignores the boundary terms in 
Eq. (11.12) and specifically the information stored in the initial density operator. The proper 
way to interpret the sloppiness of the compact (and customary) notation of Eq. (11.17) is 
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as follows: the field theory is defined by its correlation functions, or propagators. The latter 
must be computed with proper boundary conditions, that is, with reference to the boundary 
data stored in the density operator po, and the gluing procedure at temporal “infinity” 
t = T. Although we might encapsulate this information in a boundary contribution to the 
continuum action, the operationally safest way is by resorting to the discrete formulation 
above. Regardless, the result of such a calculation will be the set of Green functions (11.16). 
In other words, as long as we interpret these Green functions as the propagators of the action 
(11.17), the boundary conditions have been taken care of. (And of course, the propagators 
demonstrate that the upper and lower contour are not independent.) 

One more thing to notice is that the Green function G contains a certain degree of 
redundancy. Inspection of Eq. (11.16) shows, for example, that G ++ -\-G = +G G 

Defining the linear transformation 

-O’ (n ' i8) 


we obtain the alternative representation of the block Green function 

G tv ->G' tt , = UGtfU t = \{^ (c- 


-0(£' - t) 


0 


G = -i( 1 + 2 n(u))e 


G+- 

\ ( l 1 

G— 



'G K G+\ 

J = l 

LO 

G~ 0 ) 

- *') ^ 


~ t ' ) , 





(11.19) 


(11.20) 


The denotation G ± indicates that we have met with the retarded and advanced Green 
function of quantum mechanics, i.e. 


G+_ t , = -iQ{t - ^CKf),^')]), 

g~_ v = +«0(t' - 


where a(t) = e lHt ae~ lHt = e~ wt a are the Heisenberg evolved boson operators of our theory. 
The Keldysh Green function, G K , is a new acquaintance. Notice that G K , and only G K , 
does the book-keeping of the initial distribution. We obtain a better understanding of the 
meaning of G K as we proceed. 

The correspondence G G t ?' = — implies that G'£, G ' = {UGW) gg ' = 

). It is customary to denote the transformed fields as 


ip' = Uif> 


/ ip++ip- 

) = ( ^ \ 

\ ip + - ip- 

/ U, J 


( 11 . 21 ) 


Or, expressed in terms of the block Green functions, G ± = G ++ - G ± and G K = G++ + G . 
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where 'tp c and '4> q denote the classical and quantum field, respectively. To understand the 
origin of this notation, notice that the difference between classical and quantum dynamics 
is that the latter is governed by the product of two independent quantum amplitudes. 
It is di erences between these amplitudes which makes quantum dynamics distinct from 
classical. This motivates calling (ip + —ip- ) / v^2 a quantum field and (^+ +tp ~ ) / \/2 a classical 
field. 

With ip = Wtp 1 and = 'ip'U, and noting that Ua^W = ay, the theory assumes the form 
.Z = Zq 1 f e iS ^'’^ where 

«*««,) ft_„ ft) ■ h 122 ) 

This representation has two undesirable and, in fact, interrelated aspects to it: (i) given 
that the action is purely imaginary, one may wonder whether the functional integral is 
properly convergent, and (ii) the action still does not contain explicit information on the 
initial distribution. These two deficiencies can be repaired in one sweep. We first notice 
that the temporal exponents in Eq. (11.19) must be understood as infinitesimally damped, 
exp(— itui) —> exp (—itu — <5|t|), 5 > 0. Fourier transformation of the Green functions then 
leads to 

G^ K (e) = f dte ia Glf, (11.23) 

with 

0±<e) %iy*. 

G K (e) = -2mF(e)6(e - u) = F(e) [G+(e) - G-(e)J . 

Here we have defined the function 

F(e) = 1 + 2 n(e) = coth . (11.24) 

Notice that the temporal integration has been extended to infinity, in spite of the fact that 
we are living on a contour [0, T] of finite duration. The rationale behind this extension is 
that the upper limit T must be chosen large enough that the finiteness of correlation times 
~ £ -1 leads to finite decay before the end of the contour is reached, T/5 P$> 1. 

INFO Equation (11.23) suggests two di erent representations of the Keldysh Green 
function, both of which prove useful below. Noting that 5(e — u>) = p(e) is the single particle 
density of states of our one-level system, we may write 

G K (e) = —2xiF(e)p(e). (11.25) 

Alternatively, we may notice that G K is an anti-hermitian operator. (Exercise: trace back the 
origin of G K to the basic propagators on the Keldysh contour to convince yourself that anti- 
hermiticity of G K is a general feature, not dependent on the simplistic nature of our model.) 
Equation (11.23) then suggests the ansatz 


G k = G + F - FG~ 


(11.26) 
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where G ± are the resolvent operators of the theory and F is an hermitian operator. In the present 
context, insertion of the matrix elements G^, = G±_ t , G ± (e) and F t _ t i F(e ) leads back to 

Eq. (11.23). In general, we have the block structure 

xh**** xy m 

Inserting these Fourier transforms, we obtain the energy representation of the action, 


S[lpc,'lpg] = J ; 


( G ")" 1 \ M 

(G+)- 1 [(G+J-'F-^G- 1 )- 1 ]] U 


(11.28) 


where the matrix kernel defines the inverse of the Green function (11.27). In the specific 
context of our toy model where F commutes with dr , 







£ ~ u \ 

[2/r5coth (‘it)}/ \ip q J e ' 


(11.29) 


This formula makes the full significance of the ^-broadening manifest: 

D> The action now contains explicit reference to the distribution function, and we have 
arrived at a self-contained representation. 

> The functional integral is convergent. (Show that the imaginary part of the action is 
positive.) 

> At first sight, keeping track of the infinitesimal S may seem to be a book-keeping trick 
to get the distribution function into our description. Remember, however, that in physi- 
cal systems, initially small convergence generating factors mostly get upgraded to finite 
“self energies.” Similarly here. We will see that interactions will act to make the lower 
diagonal part of the inverse Green function finite. The upper diagonal part, however, will 
categorically remain zero. 

t> Mindful of the discussion in the previous chapter, one may feel reminded of the structure 
of the MSRJD action. This coincidence is, of course, not accidental. In a sense discussed 
below, Eq. (11.29) defines an elementary quantum generalization of an MSRJD functional 
theory. 


11.2.4 Generalization 

Before carrying on, it is useful to upgrade Eq. (11.29) to a functional field theory of generic 
interacting Bose systems. Nowhere in the derivation have we, in fact, made exclusive refer- 
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ence to the simplistic nature of the model system. This means that our prototypical field 
theory Eq. (11.17) affords the generalization, Z = Z 0 _1 / D(fp, ip) e iS °^ +iSi ^^\ where 

S 0 [ip,ip] = ( dtd d r - H)ip c , 

c 

SintbP,tp] = J dtd d rd d r' ^sc {ipcipc){r,t)V(r - r') (tp c ipc){^ ,t), (11.30) 

and sc is a sign factor assuming the values +1/ — 1 on the upper/lower contour, respectively, 
ip = {ip c (r,t)}, H is the single particle contribution and V(r — r') represents a two-particle 
interaction. 

We may now introduce classical and quantum fields through the transformation 
(11.21). Substitution of the transformed fields into the action leads to the representation 

So[<M ( ( °_, ) ) ^ (*) . 

Sint[ip,ip\ = -g J dtd d r (ipcip q (if§ + ip q ) +c.c.) , (11.31) 

where, for notational simplicity, we assumed the interaction to be contact, V(r) = 2gd(r), 
G ± = (; idt±iS-H )~ 1 are the retarded and the advanced Green functions, and 

(G -1 ) k = (G + )~ 1 F - F(G - ) -1 , (11.32) 

is the Keldysh sector of the inverse of the Green function. Generalizing the discussion of 
the previous section, let us now interpret the meaning of the different constituents of this 
action. 


Retarded and advanced Green function 

The infinitesimal increments in Eq. (11.31) signify that the retarded and advanced Green 
functions obey the causality constraint G^(t, t') oc 0(±(t — f')). 1 1 For a time-independent 
Hamiltonian, 


G*{t,t') =G ± (t-t') = 

G ± (lj) = 

x t id - II 


(11.33) 


If H contains explicit time-dependence, we need to include a time ordering procedure, 
G ± {t,t') = ^*©(±(4 - dl "®, 


where T± time orders in chronological/anti-chronological direction. Finally, the Green func- 
tions obey the composition law G ± (t,t') = G ± (t,t")G ± (t" ,t') and the limit behavior 
G + (t,t) + G~ (0, 0) = 0. 


11 In the discrete representation used in the previous chapter, causality emerged as a consequence of (£?"*") 1 being 
a matrix containing one side diagonal below the main diagonal. This meant that G + was a lower triangular 
matrix. Within the continuum formalism, we use imaginary increments to keep track of causality. 
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Keldysh Green function 

First notice a subtlety in the notation: (G -1 ) K is the Keldysh block of the inverse of the 
Green function, but not the inverse of the Keldysh Green function, ( G K )~ 1 . As discussed 
in the previous section, the anti-hermiticity of G K motivates the ansatz (11.32), with an 
apriori undetermined hermitian matrix F. This object plays a crucial role in the theory; it 
is responsible for the book-keeping on the state, or “distribution” of our system. 

The task of determining F gets drastically simplified by the ansatz 

F = {F(t,t')}, 

depending on time arguments, but independent of the Hilbert space indices of the problem 
(r in our current continuum field theory). 

I N FO To motivate this ansatz for F, we recall its origins in the non-interacting sector of the 
theory (cf. Eq. (11.25)) and interpret F as a generalized (quasi-)particle distribution function. As 
long as interactions and external time dependencies do not play a role, it is natural to assume that 
-F(e) = F(e, H) depends on the Hamiltonian of the system. The straightforward generalization of 
Eq. (11.25) to a higher dimensional Hilbert space G K (e) s= —2niF(H , e)S(e — H) then suggests 
the eigenfunction representation G K (e,ui a ) = —2niF{e,Ua)h(e — w a ). The important point here 
is that the spectral 5-function locks the eigenenergies u> a to the energy argument. Since F always 
appears in combination with the spectral function, dropping its Hilbert space index, F(e,oj a 
.F(e) does not imply a loss of information. 


In a time-independent context, F(t,t') = F(t — t') and the Fourier transform F(e) signifies 
the occupation of quasi-particles at energy e. (For instance, F(e) = coth(eT/2) = 1 + 
2n(e) under equilibrium conditions, where n(e) is the Bose distribution function.) In a time 
dependent environment, it is usually a good idea to represent F by its Wigner transform, 

F(e, t) = J dt ' e iet ' F(t + 1' / 2, t - t' / 2). (11.34) 


The function F(. ,t) then represents the instantaneous particle distribution function at time 
t (as an exercise, think why!). 

We finally note that retarded, advanced, and Keldysh Green function are obtained by 
computing the correlator 


-i(V>a(M)Vv(r' ,t')) 




(r, t; r' , t'). 


(11.35) 


Interaction 

There is not much we can say about the interaction vertex at this stage in generality. 
Notice, however, that the interaction vanishes for ifq = 0, which is a manifestation of the 
vanishing of the action of purely classical fields. 12 In later perturbative applications, it will 


12 The particular two body interaction considered here also vanishes on a pure quantum configuration. However, 
this vanishing does not reflect a general structure. 
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ipc 

q 


G+ = -i(ip c ip q ) 
G~ = -i(^c) 
G k = -i(ipcfpc) 



Figure 11.3 Building blocks of bosonic Keldysh diagrammatic perturbation theory. Top: field ver- 
tices, middle: propagators, bottom: interaction vertices. 

be convenient to represent the interaction by a graphical code (see Fig. 11.3). Below we 
apply this language to explore the influence of interactions on the non-interacting sector of 
the theory. 


11.2.5 Fluctuation dissipation theorem 

In the case of thermal equilibrium, the Keldysh Green function is fully determined by the 
retarded and advanced Green functions, G K = G + F — FG~ X . In thermal equilibrium, the 
commutativity of F = 1 + 2n = coth(e/2T) with G ± implies 

6 K (e) = coth (^) (i G±{4 - G~ (e)). (11.36) 

The form of this dependence is reminiscent of the quantum fluctuation dissipation 
theorem reviewed in the previous chapter. 

To make the connection to the FDT explicit, let us proceed as in Section 10.6.4 and discuss 
the correlation functions G K and Cr in terms of their formal Lehmann representation. 
Beginning with the Keldysh Green function, we turn back to a representation in terms of 
contour fields and have 

G K {t,0) = -i(V> c (^c(0)) = ^((ip+(t) +tMt))(-iM0) +<M0))), 

where we have omitted the Hilbert space indices of the fields for notational clarity. Now, 
interpret a correlation function 

(ipc(t)^c(0)) f-t (T 7 a c (f)4y( °))> 
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as the coherent state functional integral representation of a contour ordered operator expec- 
tation value. In the latter expression, the angular brackets represent the equilibrium thermal 
expectation value (...) = tr((. . . )po), and the combination (C,t) of the contour subscript 
and the time argument fixes the position of the operator on the closed time contour. We 
next employ the Heisenberg representation of the operators as 

/ ,\ —i /“*° dt H (t) -i r (< 7’* ) dt Hit ) 

ac(t) = e w oe w , 


where the time integrals in the exponents are along the time contour, and we have dropped 
the contour time ordering operator 71, e J ~ t o ’ = e J ~ t o 1 , on account of the 
presumed time-independence of H. (Exercise: Consider why this convention conforms with 
the definition of the coherent state field integral, and the contour conventions of the Keldysh 
approach in general.) If (C,t) lies on a later point on the contour than (C", 0), this repre- 
sentation yields 


{ac (t)al (0)) = e -i ^?) dt a dt A(t) a t e" 4 ^ d * 



We may now insert complete sets of eigenfunctions 
of H, {|o)}, to obtain the Lehmann representa- 
tion of these expectation values. For example, for 
C = +, C' = — , t < 0, we obtain (cf. the figure) 


(<>' (o)« . U)) = -z 0 1 pp \ x <*f}\ 2e% 


where X a p = (a|a|/3), a = E a — pN a , a p = a ~ p, the thermal weighting factor 
pp = e~P 13 , and a summation over a, fi is implied. Treating all other combinations in the 
same manner, we find 


G K (f,0) = —iZQ 1 \X a p\ 2 e i ^ t (p a +pp), 


which Fourier transforms to 


G K (e) — —2n iZ 0 1 |W a ^| 2 ^(e + a p ){Pa + Pp)- 

Comparison with the expression above for the time ordered correlation function G T (t, 0) = 
— i(T t a(t)af (0)) leads us to the important conclusion, 

G K (e) = 2 * G T (e). (11.37) 


The Keldysh Green function is equal to twice the imaginary part of the time 
ordered correlation function. 


In a similar manner, the Lehmann representations of the Green functions can be ana- 
lyzed to confirm that the latter conform with the general definition of retarded and advanced 
response functions. These identifications show that Eq. (11.37) is but a manifestation of the 
quantum fluctuation dissipation theorem. 
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11.2.6 Classical limit I 


Readers who went through the previous chapter may have noticed the striking struc- 
tural similarity between the Keldysh diagram language and the graphical code used in 
Section 10.5.1 to analyze the MSRJD functional integral. More generally, the block- 
triangular form of the Green functions, containing retarded and advanced blocks, in con- 
junction with distribution functions, is reminiscent of the structure of the MSRJD theory. In 
this section, we explore the classical limit of the theory within the framework of a stationary 
phase analysis. A more comprehensive discussion of the connection to MSRJD theory is 
given in Section 11.3. 

Prior to taking semiclassical limits, we should let K re-enter the theory. Remembering the 
origin of the action as a contour integrated quantum phase, we set iS —i j^S. In the limit 
h — > 0, the functional will then be governed by the stationary configurations of its action 


dS[ip c , ip q ] _ dS[i> c , jg , = 

di'q dl/’c 


(11.38) 


These equations may have solutions for arbitrary $ q ) about which nothing much can 
be said in general terms. But there is always one that reflects the classical physics of the 
problem. Recalling that an expansion of the action in 'tpq starts at linear order (the purely 
classical action S[^ c ,0] =0), we identify this classical saddle point by the equation, 

classical saddle point: c ’ ^ =0. (11.39) 

d % k=o 


This saddle point is “classical” in that it is defined on 
the contour ip q = V>+ — ip- = 0, i.e. in a limit where 
the time evolution becomes classical. Specifically, for 
the interacting Bose action (11.30), the saddle point 
equation assumes the form 

(idt — H — KIV’cl 2 ) V’c = 0. (11.40) 

This is the celebrated Gross— Pitaevskii equa- 
tion, a non-linear Schrodinger equation describing 
the classical physics of the evolution of a bosonic 
order parameter amplitude ij) c in self-interaction 
with its own density ~ IV’cl 2 - 

INFO We may think of the Gross— Pitaevskii equation (GPE) as the time-dependent 
Ginzburg-Landau equation of the transition into a superfluid state. Alternatively, it may be 
interpreted as a nonlinear Schrodinger equation describing the wavefunction of a Bose-Einstein 
condensate. Inhomogeneous solutions of this equation describe collective excitations of the con- 
densate field. 
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By way of example, let us consider the case of a one-dimensional spatially homogeneous system, 

H — i > — (h 2 ^ /u, where we have added a chemical potential term to H. Passing to the “energy 

representation” of the equation, ihdt — > E, 


(m 2 


+ M - V\tp c \ 2 ) = 0. 


The solution of lowest energy, E = 0, is the spatially homogeneous condensate amplitude (cf. 
the discussion in Section 6.3) 




V 


However, the GPE also contains information on spatially inhomogeneous excitations of the con- 
densate. For example, it is straightforward to verify that, for any xo, the function 


ip c = ip 0 tanh ^ 5 


(11.41) 


solves the GPE (cf. the figure above.) Here, V'o is a complex amplitude with \f)o\ 2 = /r/s, and 
£ = is the coherence length. The wave function (11.41) is called a dark soliton. The 

attribute “soliton” hints at the topological nature of the solution. The sign change at its origin 
xo means that the phase of the soliton changes by 7r. This jump cannot be removed by continuous 
deformation, and in this sense the soliton is a topological defect. 


EXERCISE For a system with attractive interactions, V < 0, try to identify the bright soliton. 


One may then expand around the “classical” saddle point to account for quantum and clas- 
sical fluctuations. However, we find it more instructive to discuss these types of fluctuations 
in a slightly different context below. 

This concludes our preliminary discussion of the generalities of the (bosonic) Keldysh 
formalism. Before carrying on, it may be worthwhile to recapitulate once more a few struc- 
tures of the theory: 


t> We start at large negative times with a non-interacting system whose state is described 
by some initial distribution Fo . In this non-interacting limit, the Keldysh sector of the 
inverse Green function, ( G ~ 1 ) K is infinitesimally small, and related to the retarded and 
advanced sector by the FDT. 

[> As interactions are switched on, the system will aim to identify its new distribution. The 
initial proportionality of {G~ X ) K to the initial imaginary damping terms id suggests that 
the Keldysh sector of the inverse Green function will no longer be infinitesimally small; 
the emergence of finite “self energies”, iS —A i Im S will render it finite. Put differently, 
the self energies of the theory, and in particular the Keldysh component of the self-energy, 
are expected to contain essential information on the state of the system. 

In the following, we explore the interaction-mediated formation of distribution functions on 
an instructive, and important example. 
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11.3 Particle coupled to an environment 


Let us go back to the general paradigm formulated in Section 11.1.1, a quantum system 
coupled to a bath. For concreteness, we will assume the system to be a single quantum par- 
ticle, while the bath is an ensemble of harmonic oscillators. In the particular case where the 
dynamics of the particle is harmonic, this gets us back to the oscillator-oscillator coupling 
studied in Section 11.1.2. 


Keldysh theory of a quantum particle 

Consider the quantum mechanics of a single point particle in one-dimensional space. In the 
distant past, we prepared the particle in some state (pure or mixed, it does not matter) and 
then let it evolve under the dynamics of its Hamiltonian operator H = + V (q). This 

situation can be described by an obvious modification of our Keldysh functional theory. 
All we need to do is encapsulate the initial value information in a density operator po, and 
replace the coherent state field integral by a Feynman path integral. As a result, we obtain 
the Keldysh partition function 

£ s = Zq 1 j D(q,p) jT.cSC f dt{pcd t qc-H(qc,Vc)) 

— z 0 1 J Dqe'i'Ec s c f dt(^(d t qc) 2 -v( qc )) 

= Z Q 1 J Dq e l 1 dt \ fe-?,))) . | 1L42 ) 

To explore the classical limit of this expression, we re-introduce K as iS — > ih~ 1 S. Alluding 
to the discussion of the quantum-nature of configuration differences q+ — q- ~ q q above, 
we also rescale q c — > hq c . A first order expansion of the action in q c h then identifies the 
/^independent (classical) sector of the action 

S c [qc,q q ] = J dtq q - V2V' . 

Integrating over the quantum component, we obtain the constraint —mdfq c —\/ 2d q V (q c / \/2) 
0 or, upon rescaling coordinates q c /V^ = q, Newton’s equations of motion 

md^q = —d q V(q). (11.43) 

But of course, the formalism can do much more than recover deterministic classical motion. 
This is seen the moment we couple the system to its bath: 

Coupling to an oscillator bath 

The bath as such is described by a partition function such as Eq. (11.31). Specifically, 
we assume the bath degrees of freedom to be non-interacting, V = 0, and oscillator-like, 
ip = Y2k V’fc V ’ ki with a wave-like dispersion cok = c\k\. Thus, 

SbbP, if] = J ^ ^ 


(11.44) 
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with ip = (ip c ,ip q ) T , and the oscillator Green function, 

a — 1 = ( 0 tu - \ 

k,u + iS — u> k [2*<5 coth(o;/2T)]y 

The coupling between system and bath is modelled by the action 

S s b[>P,t,9]= j dt'^2'^2s c q(t)('y k fpc,k{ t )+ c - c -)= j dt^2,q(t) T o 1 {ytp k (t) +c.c.) 
d k c d k 

= J ^ 12 bkql^itpk-u + ikipk, u <7iq- u ) , (11.45) 

where j(k) is a set of complex coupling constants, and in the second line we switched to 
the Keldysh representation g T = ( q c ,q q )• 


Integration over oscillator modes 

At this stage, the oscillators may be integrated out. Using the fact that —i(ip , ip T ) = G (cf. 
Eq. (11.35)), we obtain the effective system action, <S e ff = S s + .S' c h sk , where the dissipative 
action is given by 

SdissM = - J (^|7fc| 2 Gfc, 

and the Green function has the form 

A _ /[— 27 ri coth ( 2 “.) 5(u) - oj k )] (u + i5- u; fe ) _1 
k,u V (w - iS - uik)^ 1 0 

To make progress with this expression, we assume that the coupling constant exhibits power 
law dependence on |/c|, 

i7fei = A|fcr, 



where A > 0 is a real coefficient. Noting that the Green function enters in a frequency- 
symmetrized form, YluOuGuq-u = \ + G_ w )g_ w , the summation over the k- 

modes then produces the expression (exercise) 

Sdi ss [q ] = J ^qZkwqu, (n. 46 ) 

where the dissipation kernel K is given by 


K u = iruX 2 - 


uj^ 2a ( 0 


2 i coth ( ) 


(11.47) 


Here we have introduced the density of modes, v = Y^ k d(u> — u> k ) and omitted a frequency- 
independent contribution to K (which can be absorbed in the chemical potential). We may 
think of K u as a “self-energy” of the particle due to its coupling to the oscillator bath. The 
frequency dependence of K u signals that S^iss is a time-nonlocal contribution to the action. 
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The temporal profile of the dissipation kernel depends on the coupling 7 . In the following, 
we concentrate on the case of Ohmic dissipation, where a = 1/2 and 


with g^m. 


( 0 — iu \ 

\iu) 2 iuj coth ( ^p)) ’ 


(11.48) 


INFO The power law a = 1/2 is not quite as arbitrary as it may seem. Specifically, the choice 
7 = i\ sgn(fc) \f\k\ describes the frequent situation of dissipation due to coupling to an elastic 
medium. To see this, recall the representation of the coherent state amplitudes in terms of 
oscillator coordinates and momenta, 


^ = ^(^ kqk + -L= Pk ). 


Substitution into the coupling term generates the expression 


+ c.c.) ~ ^2 sgn {k)^/\k\ (^Jukqk + + cc ) 

~ 9 xq{x) | x=0 , 
k 

where, noting that functions q(x) and p(x) are real, we have made use of the identities = q-k 
and pk = p~k- The last term describes the local coupling of the system coordinate to the stress 
d x q(x) acting on an elastic string. In many dissipative environments, macroscopic degrees of free- 
dom are coupled to approximately harmonic microscopic modes in this way. Prominent examples 
include the coupling of the electric field applied to a resistor to harmonic electromagnetic modes 
in the medium. This coupling (cf. our discussion of Johnson noise in the previous chapter) is 
responsible for the denotation “Ohmic” dissipation. 


Langevin equation 

We next explore the classical limit of the effective theory of the particle coupled to the bath. 
To this end, we let H enter at three different places in the functional, 

(i) : iS -> l -S, 

o T 

(ii) : coth(cu/2T) -»■ coth(ujh/2T) -»■ — , 

rvjj 

(iii) : ip q -t hip q . (11.49) 

In the semiclassical limit, and at fixed characteristic frequency u, the “coth” can then be 
linearized as indicated in (ii). The extra power of H appearing in the denominator of this 
expression compensates for the additional power in h due to two factors q q in the quantum- 
quantum sector of the action. Summarizing, the effective classical action is given by 

S c [qc,q q } = j dt | q q (~md?q c - y/2 V' (^ q ^j ~ 2 9 d tq^j + 4iTp^j . 
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This has the characteristic structure of an MSRJD action. Comparison with the discussion 
of MSRJD functionals in the previous chapter shows that the (scaled) quantum field 2gq q 
plays a role identical to the MSRJD “momentum.” Specifically, a “Langevin” equation 13 
can be derived by Hubbard— Stratonovich decoupling of the q c term: 


Sc[q c ,q q] -t S c [q c ,q q ,£\ = J dt | q q ^-md?q c - V2V' (j^j ~ ZgdtQc + vW)j 


8gT 


Introducing the rescaled variable q = q c /V 2 and integrating over q q , we obtain the Langevin 
equation 

md^q + V'{q) + 2gd t q = £, (11.50) 


where the noise correlator 

(£(f)£(f')> = 4 Tg6(t-t'), (11.51) 


conforms with the FDT. To summarize, the coupling of the system to a bath renders the 
classical limit noisy. The effective equation of motion assumes the form of a Langevin 
equation, at a noise level that will drive the particle to an equilibrium distribution at the 
effective temperature of the bath. 

INFO Notice that the present theory solves a problem that plagued the purely classical 
theory discussed in the previous chapter. Discussing dissipative damping of voltage fluctuations 
in a resistor network, we had seen that Johnson noise is problematic inasmuch as the white noise 
correlator (£(f)£(f / )) = 2 RT6(t — t') will predict singularly strong noise levels (£(t) 2 )- We had 
cited the expectation that these singularities will be cut off if we take into account the fact that 
the actual coupling is to a bath of quantum, rather than classical oscillators. 

This expectation is confirmed by our present theory. Comparison with Eq. (11.48) and the 
subsequent steps of the derivation shows that the frequency representation of the actual noise 
correlator reads 

<«-«-> + ^co,h(g). 

In the classical limit h — > 0, this reduces to the white noise limit + a/), which Fourier 

transforms to Eq. (11.51). However, at large frequencies hu> 3> T, we obtain 

At first sight, it seems that the situation has become worse: the increase ~ |w| implies ultravi- 
olet behavior (short time correlations) even more malicious than in the white noise case. The 
resolution to this problem lies in a more careful interpretation of what is actually meant by the 
expression (£ t £ t /) . The decomposition u> coth ~ (1 + n(ui)) + n(ui) suggests that the voltage 
fluctuations are due to absorption from (n) and emission into (n+1) the field of quantal resonator 


We put quotes around “Langevin” because the stochastic differential equations derived here contain second 
order time derivatives stemming from the acceleration terms in Newton’s equations. (We are not working in 
the purely dissipative, overdamped limit.) The term “Langevin equation” is commonly reserved for first order 
equations, but this difference is of no relevance to the present discussion. 



720 


Nonequilibrium (quantum) 


modes of the oscillator system. Specifically, there is response to its vacuum fluctuations (the “1” 
in 1 + n). At low temperatures, T/hui — > 0, it is these fluctuations that cause our ultraviolet 
problems. On the other hand, it seems implausible that vacuum fluctuations will be probed in 
an actual measurement of noisy voltage fluctuations in a resistor. (The vacuum is a definite 
state, while the generation of fluctuations requires real transitions in the environment.) Closer 
inspection (cf. e.g., the pedagogical article G.B. Lesovik and R. Loosen, On the detection of 
finite-frequency current fluctuations, JETP Lett. 65, 295-99 (1997)) shows that (i) the expression 
(£t£t') §((VtVp + Vt'Vi) corresponds to a symmetrized correlation function of the quantum 
operators representing voltage fluctuations (exercise: to make this statement plausible, interpret 
(£(£,'} ~ G k as a Keldysh correlation function and use the connection between G K and time 
ordered correlation functions discussed in Section 11.2.4.) However, (ii) a concrete measurement 
will not correspond to this ordering. Specifically, it will not couple to the fluctuations of the vac- 
uum. Once the vacuum contribution is removed, the ultraviolet problems disappear. For further 
discussion of this point, we refer to the literature. 

This concludes our discussion of the backbones of Keldysh field theory. We have intro- 
duced its fundamental generating functional, discussed the meaning of the propagators, and 
exposed the connections to the classical theories discussed in the previous chapter. However, 
to make the picture complete, we need to introduce the fermion variant of the Keldysh 
functional. This will be as necessary as it is uninspiring. As one may expect, the formal 
differences from the bosonic theory amount to a few sign factors here and there. For this 
reason, we will try to keep the discussion as concise as possible and restrict ourselves to 
listing the main differences from the bosonic theory. 


11.4 Fermion Keldysh theory (a list of changes) 

11.4-1 Single level 

As in the bosonic case, let us begin by considering a single level Hamiltonian H = ujata, 
where a and a' are fermion operators. The definitions of the Keldysh partition function 
(11.9), and of the equilibrium density operator (11.10) remain unchanged, only that the 
normalizing factor is now given by Zq = 1 + . Again, we will want to represent the 

partition function in terms of (now fermionic) coherent states. 

It is straightforward to verify that the auxiliary identity Eq. (11.11) generalizes to the 
fermionic case, which means that the discrete functional Eq. (11.12) remains as it is but for 
a sign change k — > —k. As in the construction of equilibrium (Matsubara) functionals, this 
change represents the sign we pick up when we sweep one of the 2N coherent state amplitude 
through all others, as we do in the construction of the functional integral. Consequently, the 
inverse of the discrete fermionic Green function is given by Eq. (11.13), again up to a sign 
change k — » —k. Up to this sign factor, all remaining formulae of the discrete discussion 
of Section 11.2.2 generalize to the fermionic case. Specifically, det(—iG~ 1 ) 1 + k = 

1 + e - ^G-A0 = Zq which proves normalization of the functional integral. Taking the limit 
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N — > oo, and switching to a continuum representation, we obtain the fermionic Green 
functions 

G^, + = )(0 (t — t') — n(u)), 

G~7 = \Q(t' — t) — n(u)), 

G+7 = +ie- i ^ t - t ^n{u), 

G~t = 1 - n(w)), (11.52) 

where 

"M= 1 + 

is the Fermion distribution. (The global plus sign appearing in G -1 reflects the sign factor 
picked up upon passage of the terminal point of the contour.) We may now pass to a 
continuum representation of the functional integral whose unregularized form (no account 
for infinitesimal damping and boundary conditions) is given by the Grassmann version 
of Eq. (11.17). A glance at Eq. (11.52) shows that the redundancy in the theory which 
motivated the “Keldysh rotation” now assumes the form G' ++ + G = G -1 + G In 

the case of Grassmann variables, it is customary 14 to account for this redundancy by way 
of a field transformation slightly different from the bosonic Keldysh rotation above: using 
the independence of the Grassmann variables 'if and ip, we introduce new fields by 

#1= )' V>2 = ^(t4 - V>-), 

V>2 = ^(Vr +V>-)- (11.53) 

Introducing the notation ip = (%,+ ), ip' = ($j)> ^ = (V’+iV’-)) = (VhiV^), this assumes 

the form of a non-unitary transformation 

ip’ = Uip, ip' = ipa 3 U T , 

where U has been defined in Eq. (11.18). (Due to the non-unitarity of the transformation, 
and the fact that Grassmann fields are never “classical,” we do not speak of classical or 
quantum components.) The form of the above transformation is motivated by the trans- 
formed contraction rule 


—i{ip'ip') = U{ipip)a 3 U T = UGU t <j 3 = G', 


where G = ( ?_ + ? ) , and the transformed Green function assumes the form 


G' = 


G+ G h 
G 


(11.54) 


A. I. Larkin and Yu. N. Ovchinnikov, Vortex motion in superconductors, in Nonequilibrium Superconductivity, 
eds. D. N. Langenberg and A. I. Larkin, (Elsevier, 1986). 
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and the retarded, advanced, and Keldysh Green function are defined in analogy to 
Eq. (11.20), 

G+ = 

G~, = +i@(t' - 1| 

G« = -i( 1 - 2n(co))e- i ^ t - t '\ (11.55) 

As we will mostly work with the transformed Green function G ' , we omit the prime through- 
out, G f — ^ G. 


1 1-4-2 Generalization 

As with the bosonic theory, the form of the Green function Eq. (11.55) suggests the intro- 
duction of a fermionic distribution operator F = {E(e)}, with 

F(e) = 1 - 2n(e). (11.56) 

Expressed as a Fourier transform, the Keldysh Green function then assumes the form 

G k =G + F-FG~. (11.57) 

The generalization of the toy model above to a fully interacting theory is described by 
the Grassmann version of the functional Eq. (11.30). (The optional addition of spin-indices 
should be straightforward.) In contrast to the bosonic case, where mean field approaches to 
the native interacting theory lead to immediate results, most fermionic theories call for a 
Hubbard-Stratonovich decoupling of the interaction term. Prior to passing to the Keldysh 
rotated theory, we therefore introduce an auxiliary bosonic field to decouple the interaction 
as 

giSindiM = J D(j)e l 2 fdtd d rd A r''£c s C<t>c(r,t)V ‘{r- r')<Mr',f.) 
x e +i f dtd d rJ2c scV’c(r,t)<Mr,t)V’c(r,i) i 


We may now implement the Keldysh rotation to arrive at the effective action S[ip, ip, 4>] = 
S14>] + 5[* tp\ + *Sint[Vh ip, 4>], where 


5[0] = — f J d d rdt(p c (r,t)V x (r — r')cp q (r', t), 

S[ip,ip] = J d d rdt { 4)GQ 1 ip , 

S int [ip,ip,4>\ = ~^= f d d rdttp(<p c <j 0 + <Pqcri)ip- 


(11.58) 


G n = 


Here 


(G +)- 1 (Go 1 )* 
(Go )” 1 


(11.59) 
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is the inverse of the free Green function 15 and (Gg ) 1 = (*^t ± *0 — H). We finally note 
that elements of the Green function are generated by the fermion variant of Eq. (11.35) 


-i(^ a {r,t)ip a >(r' ,t')) 




(r, t; r', t'). 


(11.60) 


11.5 Kinetic equation 

The action Eq. (11.58) may serve as the starting point of numerous applications. Here, we 
restrict ourselves to the discussion of one interaction phenomenon that is particularly rele- 
vant to the central theme of the chapter: the interaction mediated formation of effective 
particle distributions. As a byproduct, we also introduce elements of Keldysh pertur- 
bation theory, and compare to the “equilibrium perturbation theory” discussed in earlier 
chapters. The concepts and techniques introduced in this section are very important and 
are routinely applied in the analysis of out of equilibrium quantum systems. 

In carrying out the perturbative analysis, we are guided by two principles: first, we assume 
that the system (e.g. by way of high electron density) admits an effective RPA approach. 
Second, we think of the Hubbard-Stratonovich field, <p. as an independent, bosonic degree 
of freedom. That analogy becomes more substantial as we proceed and cfr acquires its own 
dynamics. Specifically, the boson-fermion analogy suggests not to integrate over the fermion 
degrees of freedom just yet (which would be an option as a result of the quadratic nature 
of the action Eq. (11.58)), but to study boson and fermion propagators separately. 

In Eq. (11.58), quasiparticles interact by the two- fermion one- 
boson interaction vertices contained in .S'i nt . Our intermediate goal 
will be to obtain the emerging boson and fermion self energies E / 
and Efe, respectively, in the RPA approximation, i.e. an approxi- 
mation void of intersecting propagator lines. The topology of E/ is 
given by the first diagram shown in the figure, with external legs 
removed, and all propagators representing full propagators, with 
self-consistent account of the self energies. In a similar manner, E^ is obtained from the 
lower diagram. According to the general rules discussed above, the fermion self energies are 
obtained by contraction of interaction operators at fixed external field indices. Specifically, 
the fermion self-energy is defined as 16 

J dxdx 1 v'(x)yif(x, x')w'(x') ==.— ^ J dxdx , 'ip'(x) cj)(x)'tp(x)'ip(x')(j)(x') ip'(x'), 

where (j) = <ft c cra + (p q <J\ and we have switched to the space-time abbreviation x = (r, t). 
The self-consistency inherent to the RPA scheme requires that the contractions must be 

15 The off-diagonal block (G -1 )^ is given by (G~ l ) K = -(G+)“ 1 G Jf (G - ) -1 ^ »fG+) _1 F - F(G-) -1 = 2 iSF, 
where the last identity holds if F commutes with (G 1 * 1 ) -1 . 

16 Should you feel uncertain about this result, it is worthwhile investing some time to recapitulate the perturbative 
construction of self energies. 
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interpreted to produce the full Green functions of the theory. Comparison of the two sides 
of the equation and usage of Eq. (11.60) leads to the identification, 

% ^ _/s /V 

T,f(x,x') = ~- <j>(x)il)(x)xl)(x')(j)(x') = -- <p(x)G(x y x*)<j>{x') 

= — ^ (g(x,x')D k (x, x') + G(x,x')<JiD + (x,x') + a 1 G(x,x')D~ (x,x'fj , 

where we have adopted the common convention to designate bosonic Green functions by 
D + ~' k . Using this expression, the components of the self-energy operator 




are identified as 


T,±(x,x') = — ^ (G ± (x,x')D K (x,x') + G k (x, x')D ± (x, x')) (11.61) 

T,f(x,x r ) = — ^ (G k (x,x')D k (x,x') + G + (x,x')D + ( x,x r ) + G (x.x')D (x.x')) 

= — ^ (G K (x,x')D K (x, x') + (G + ( x,x') — G~(x, x'))(D + (x, x') — D~(x,x'))) , 

where we have made use of the fact that causality implies G + (x, x')D~ ( x , x') = 0, etc. Also 
notice that the causality of retarded and advanced Green functions, tfpt(:r, x')G (x, x') = 0, 
causes the vanishing of the lower right-hand block of the self-energy operator. 


EXERCISE Show that the bosonic self-energy 



is given by 


Y.±{x,x') = - l - G {x',x)G k (x,x') + G ± (x,x')G k (x',x) , (11.62) 

Ef (*,*') = - l - G k (x',x)G k (x,x') + G + (x,x)G-(x,x) + G-(x',x)G + (x,x) 

=-~| G k {x',x)G k (x,x) - (G+ -G~){x',x)(G + -G~)(x,x) . 

In the present context, where the boson field actually represents fluctuations in a scalar poten- 
tial, we may think of the self-energy as an effective polarization operator screening field 
fluctuations. 


11.5.1 Quasiclassical theory 

We now have everything in place to address our main task, the computation of the fermion 
(or boson) distributions which develop in the interacting environment. The picture we should 
have in mind is that a system of fermions prepared in some initial distribution gets exposed 
to interactions. We wish to find out how the latter drive the system to a new state of 
equilibrium. The understanding of this mechanism is the basis for the description of out-of- 
equilibrium distributions forming, e.g. if the system is exposed to strong external fields. 
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Wigner transform 

In a time-independent situation, all constituents of the theory depend only on time differ- 
ences, At, and vary at time scales At ~ e -1 comparable to their characteristic energies. In 
cases where systematic time-dependent changes occur (e.g. in response to the formation of 
a nonequilibrium initial state) on time scales t » At, it is convenient to pass to a Wigner 
transformed representation, G{t\,t 2 ) — >• G(e,t), where t = (ti +t 2)/2 and e is conjugate to 
At = ti — t 2 . Technically, the Wigner representation of an arbitrary operator g = {g(t, t')} 
is defined by the transform 

g(e,t) = J dAte lAu g{t + At/2,t — At/2), 

g(ti,t 2 ) = J ^ e“ ,(tl “ t2)e 5 (e, t). (11.63) 

In applications, we often need to know the Wigner representation of the product of two 
operators, fg. Using the definition above, it is straightforward to derive the Moyal product 
identity 

(fg)(e,t) = e~i^ 

t!=t 2 =t 

= / 0 , t)g{e, (<%/(£, t)d e g(e, t) - d e f(e, t)d t g(e, <)) + ... 

~(fg-^{f,g} + ^-\^i), (ii.64) 

where in the third line we have introduced the Poisson bracket 

{f,g} = d t fd e g - d e fd t g. (11.65) 

The scope of the Wigner transformation actually extends beyond the temporal variables: 
in general, the distribution functions may exhibit explicit space dependence F — > F( r). 
(This will occur, say, if a system is connected to two reservoirs kept at different temper- 
atures, voltages, etc. In this case, gradients in the particle distributions may form.) We 
expect the characteristic scale of these variations to be much larger than the microscopic 
particle wavelengths. In this case, it is convenient to pass to the spatial Wigner transform, 
G(ri, r 2 ) — > G(r, p), where r = (f i + r 2 )/2 and p is conjugate to Ar = ri — r 2 , i.e. 

g(r, p) = J dAr e~ tAr p g(r + Ar/2, r — Ar/2), 

ff(r i,r 2 ) = J e i(ri “ r2) ' p 3 (r,p), (11.66) 

where the different sign reflects the conventions standard in spatial and temporal Fourier 
transforms, respectively. 

For later reference, a number of Wigner transform identities are summarized in table 
11.1, where the spatial analogue of Eq. (11.65) is defined as {/, g} = d r fd p g — d p fd r g. 
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Table 11.1 Wigner transform identities 


Temporal 

Spatial 

Operator 

Wigner representation 

Operator 

Wigner representation 

g(t) 

9(t) 

9(r) 

5( r ) 

g(e) = g(id t ) 

fl'(e) 

g( p) = s(-*&) 

3(P) 

If, 9} 

-*{/,£} + ■•• 

\f,g\ 


tr(ff) 

MfsM) 

tr (g) 

f d ^g( r,p) 


In the following, it is convenient to bundle temporal and spatial transforms into a spatio- 
temporal Wigner transform, 

g{x,p) = 

g(xi, X 2 ) = J dpe~^ Xl ~ X2 ^ 9P g(x,p), (11.67) 

where x = (t, x) T = (aq + X2)/2, p = (e, p) T , the Minkovski metric 77 = diag(l, — 1^), and 
dx = dtd d x, dp = ded d p/( 27r) d+1 . 

Derivation of the kinetic equation 

We now set out to derive a differential equation describing the evolution of the distribution 
function. In view of the time scale separations alluded to above, it is convenient to formulate 
that equation in its Wigner representation. To this end, we consider the formal operator 
equation 

e+-H - if \ /G+ G k \ = fl 0\ 

0 0 G-J ~ VO V 

Writing G K = G + F — FG~ , this matrix equation is equivalent to the set 
(e-H-t ± )G ± = 1 

(e-H- t + )(G + F - FG~) - £ K G“ = 0. 

Multiplying the second equation by (G - ) -1 and using the first equation, we obtain 
F(e — H — ST) -(e-H - >] ' )F - t K = 0. 

At this stage, it is convenient to pass to the Wigner transform. Making use of the identities 
summarized in table 11.1, we obtain the kinetic equation 

(11.68) 

where the equality is approximate in that higher order contributions to the Moyal product 
expansion have been neglected. In Eq. (11.68), F = F(e. t, r, p) is a function of spatial and 
temporal variables. The left-hand side of the kinetic equation describes the evolution of the 
distribution function F under the single particle dynamics. 



J dAxe iAxTriP g(x + Ax/2,x-Ax/2), 
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The effect of particle interactions is encapsulated in the right-hand side, which defines 
the collision term. To lowest order in the Moyal expansion, the operator products appear- 
ing in the collision term may be understood as products of Wigner functions, F = 
Ejr (e, t, r, p)F(e, t, r, p), etc. As we will see in a moment, the term Y, K (E + F — FY,~) repre- 
sents an in-term (out-term), i.e. an interaction- mediated gain (loss) of particle occupancy. 

EXERCISE Compare the structure of the quantum kinetic equation to that of the classical 
Boltzmann equation discussed in Section 10.3. Try to anticipate the explicit structure of the 
collision term in our system of interacting electrons. 


Collision term 

So far, our discussion of the kinetic equation has been absolutely general; no reference to 
the specific mechanisms of self-energy generation has been made. To better understand the 
physics of the collision term, we now return to our example of interacting electrons. 

EXERCISE Show that the Wigner transform of composite operators with space-time dependence 
g(x , x')f(x, x') is obtained as 

g{x, x')f(x,x') ->• J dp'g(x,p)f(x,p-p). (11.69) 

Also show that 

g(x,x)f(x',x) -> [ dp'g(x,p')f(x,p -p). (11.70) 


The collision term governs the 
rates of occupation number 
changes due to the collisions into 
and out of other states. This 
being so, we expect it to be 
determined by (a) the occupation 
numbers of the collision partners, 
and (b) the density of states 
participating in the interactions. 
We thus aim for a description in terms of occupation functions F or n/, and spectral 
functions A = — 21m G + . Finally, to expose the fermion propagators contained in the 
self-energy of the boson propagator D, we will aim to express the latter in terms of E &. To 
this end, we note that the relation D -1 = V~ 1 o-[ — E 6 = ( p-i° g+ V f.K b j implies that 

T)± = ( V - 1 - tf ) -1 = D+iV- 1 - E 6 )D~, 
d k = b + t^b~. 
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We now have everything we need to compute the quasiclassical representation of the collision 
term in Eq. (11.68): 

J coi [F](p) (11 - 61) i n ' 69) * J dq^G K (p + q)D K (q) + (G + - G~)(p + q){D + - D~)(q) 

- ((G+ - G~)(P + q)D K (q) + G K (p + q)(D+ - D~)(q))F(p)] 

{1 = 7) ~l J dqD+(q)D-(q)A(p + q) 

x [(F(p + q)- F(p))E?(q) + (E+ - ^)(q)( 1 - F(p + q)F(p))] 

( 11 . 62 ^( 11 . 70 ) 1 J dqdp ' D +^ D -^ A ^ + q J A ^ + q ^ A ^ 

x [(F(p + q) - F(p))( 1 - F(p’ + q)F{p')) - ( F(p ' + q) - F(p'))( 1 - F(p + q)F(p )] . 
We now assume that F(p) = 1—2 n t depends only on energy, define the transition probability 
T(e,e',w) = J ^^^^D+(q)D-(q)A(p + q)A(p' + q)A(p'), 
and add and subtract a term n e n e _|_ u ,n e /n e /+ w to arrive at the result 


I co i[F](p ) = / dude' T(e,e’;u)(n e ( 1 - n e+u )n e ' +u ( 1 - n e > ) - (1 - n e )n e+ul ( 1 - n e / +w )n e /). 

(11.71) 


Let us discuss the meaning of this expression: 


t> The structure of the collision term resembles that of the classical Boltzmann collision 
term discussed in Section 10.3. Quantum mechanics enters through the Pauli blocking 
factors 1 — n e , etc. 

> Much as the collision term of Boltzmann theory was annihilated by the Maxwell- 
Boltzmann distribution, the collision term Eq. (11.71) vanishes on the Fermi— Dirac 
equation: interactions serve to install local equilibrium configurations, at a temper- 
ature T and chemical potential p which remain apriori unspecified. Departures from 
equilibrium result from the competition between the left-hand side and the right-hand 
side of the kinetic equation. In many instances the system may end up in a compromise 
situation governed by a Fermi-Dirac distribution with spatially varying T(q) and /j(q) . 

> Notice that the collision term vanishes in an approximation where the effects of screening 
in D ~ <j{V are ignored (technically, by ignoring the RPA self-energy of the boson line). 
Diagrammatically, the building blocks D + —D~ = D + (YA— E~)D~ and D K = D + E K D~ 
are described by the boson insertion to the self-energy diagram shown in the bottom part 
of the figure above. (Identify the two wavy lines with the self-consistent RPA D R and D A , 
respectively, and the center bubble with the self-energy E.) Heuristically, we may interpret 
this structure as the modulus square of the fundamental RPA decay amplitudes shown 
in the upper part of the figure: the out-term is due to the decay of a particle at energy 
e into a particle hole excitation (energies e' and e 1 + u>, respectively) and a particle at 
energy e + u>. This decay is mediated by the (now self-consistently screened) interaction 
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line at frequency u>. In an analogous manner, the collapse of a particle-hole excitation 
governs the “in” rate. These processes conserve energy and momentum, up to the smearing 
introduced by higher order decay processes. (Technically, the spectral functions A contain 
^-distributions — e(p)), where the subscript E indicates their broadening by retarded 
self energies. As an exercise, consider how these self energies encapsulate the generation 
of higher order particle-hole decay processes, as described by the RPA approximation.) 


11.6 A mesoscopic application 

In the previous sections we have set up the essential machinery to study quantum systems 
beyond thermal equilibrium. We have understood how interaction effects aim to drive a 
system back to a (local) equilibrium configuration and how departures from that configura- 
tion can, in principle, be described. However, we have not yet explored the actual physical 
effects a nonequilibrium environment has on the quantum phenomenology of a system. Of 
course, this latter question has many facets to it and cannot be addressed in general terms. 
Rather, we focus on one specific aspect, and we study it within the framework of a concrete 
application: quantum mechanics is largely about the coherent propagation of matter waves, 
and the resulting interference phenomena. We know that coherence is averse to any kind 
of noisy fluctuations in environmental degrees of freedom. At the same time, noise is an 
inevitable companion of any coupling to a dissipative medium, and it is likely to increase if 
that coupling generates a nonequilibrium state. 

In this section, we explore the formation of noise-levels in a quantum system coupled to 
a nonequilibrium environment, and its destructive influence on quantum coherence. Ques- 
tions of this type are attracting a lot of current attention: advances in experimentation 
make it possible to explore the physics of nanoscopic systems that are strongly influenced 
by quantum interference phenomena - from ultracold atom physics, quantum information 
devices, and spintronics, to single molecule electronics, and beyond. At the same time, the 
very smallness of these systems implies that they are easily driven out of equilibrium, or 
are in an out of equilibrium state per se. Studying the influence of nonequilibrium noise on 
quantum coherence has, therefore, emerged as a mainstream topic. 
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11.6.1 Out-of- equilibrium quantum dot 


In this section, 17 we investigate the 
interplay between noise and coherence on 
a specific setup, the out-of-equilibrium 
metallic quantum dot. Consider a metal- 
lic or semiconducting island of mesoscopic 
size. 18 The island is connected by tunnel 
electrodes to two leads, which are kept at 
a voltage difference V. This voltage gradi- 
ent will be the primary instance by which 
the system is driven out of equilibrium. 

Finally, we assume capacitive coupling to a 
gate electrode (indicated by the horizontal 
plate in the figure) which determines the 
electrostatically-preferred number of electrons on the dot. 

We describe this system by the Hamiltonian H = Y1 X = L r(H x + H t x ) + H ( j + H c . where 



H X — ^ , a H x , a t)C Xt bi 

ab 

Ht,x = ^2, C i,a T x,au,d,j, + h.c., 
a)i 

H d = Y, d l H d^d v , 


(11.72) 


Here c x —L/R ta create fermions in state |a) of the left/right lead, respectively, d M creates 
fermions in state |/x) of the dot, and H x and H,j are the corresponding single-particle Hamil- 
tonians. The operators T x= l/r describe tunneling between the leads and the dot. Finally, 
Ec is the electrostatic charging energy on the dot, and No defines its electrostatically- 
preferred charge. (Notice that No need not be integer.) The voltage bias between the leads 
will be introduced momentarily through a chemical potential difference. 


EXERCISE Recapitulate the discussion of the equilibrium physics of this system in Problems 6.7 
and 6.7. 


This section is technical in nature. A few unanswered problems aimed at developing computational fluency 
in Keldysh field theory have been included in the text. (It is not necessary to tackle these problems at first 
reading.) 

By “mesoscopic,” we mean that the temperature-dependent dephasing times over which quantum phase coher- 
ence is lost due to the interaction with environmental degrees of freedom exceed all time scales relevant to 
transport in the system. In practice, this means that we are studying systems of 0( 1/im) or less at tempera- 
tures of O mK) . System sizes/temperature ranges of this order are easily accessible to modern device 
technology. 




11.6 A mesoscopic application 


731 


Generalizing the theoretical formulation introduced in Problems 6.7 and 6.7, we next aim 
to derive a Keldysh theory of the nonequilibrium system. Our starting point is the fermion 
action S[ip,ip,V] = ^ x=LtR {S x [ip x ,ip x ] + St^[ip xt ip x ,ipd,'ipd]) + Sd[ipd,'ipd,V] + S c \y], where 
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contain the information on the biasing of the system and V c , ( i are the classical and quantum 
components of a Hubbard-Stratonovich field decoupling the interaction. 


EXERCISE Adapt the Hubbard-Stratonovich transformation introduced in Section 11.4.2 
to the charging interaction in Eq. (11.72) to reproduce the action above. In Eq. (11.73), the 
Hubbard-Stratonovich classical and quantum fields V c and V,j are defined as V c = \(V+ + 
V-), V q = V+ — V-, in terms of the fields V± decoupling the interaction on the contours. This 
differs by a factor of \[2 from earlier conventions and facilitates the interpretation of V c as a 
classical degree of freedom. 

As with the equilibrium situation studied in Problem 6.7, it will be convenient to remove 
the fluctuating “voltage” V by a gauge transformation on the dot: introducing the change 
of variables, ipd — >• e~ z ^ipd, fd — 1 fd^, where (p = <p c + 4> q a\/2 and the phases are defined 
by dt<$> c , q = Vkrf, V disappears from the bulk action while the tunneling matrix becomes 
dynamical, 19 

f x -i f x e~^ = f x . 

At this point, the fermions can be integrated out to generate the familiar “tr In.” The rest 
of the derivation then proceeds as in problem 6.7: 

EXERCISE Adapt the derivation of Problem 6.7 to the present context: Expand the “tr In” to 
second order in the tunneling matrix elements (recall what sort of assumptions may justify the 
second order truncation) to generate the tunneling action 

.S’, un [<»l = i ^2 tv(G x T x G d Tl), 

19 The gauge transformation also alters the distribution function Fd (exercise: how?). 
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Evaluate this expression under the following simplifying assumptions: (i) the Hamiltonian oper- 
ators H x and Hd are diagonal in the bases |a) and | /a), respectively, (ii) in the range of relevant 
energies, the modulus |Ta, iaM | 2 s \T X \ 2 does not significantly depend on Hilbert space indices, 
and (iii) the spectral functions A x ,d(e) = — 2ImG+ d (e) = 2m' x ,d{e) are approximately energy 
independent. (Here v x> d are the single particle density of states of leads and dot, respectively.) 
Show that under these conditions, the tunneling action assumes the form Eq. (11.75). Hint: you 
may find it convenient to represent the Green functions as G ± (e) = / _ \ ■ 

The derivation sketched in the formulation of the exercise above then leads to the tunneling 
action 

<Stun[</>] = --*^5 x tr(A x e“^A d e^), (11.75) 


where A x ,d is a matrix with energy dependence, 

A x4 = 2i^ , gK d =(g+ -g- )Fx4 , (11.76) 

the unit spectral density auxiliary “Green function” is defined by g ± = f ^ e± ^ e , , and 

g x = An 2 u d u x \T x \‘ 1 , (11.77) 

defines the tunneling conductance between dot and lead x = L/R. Adding to this the 
phase representation of the charging action in Eq. (11.73), 

S c [4>] = j dt + N 0 d t c(> q ^ , (11.78) 

we obtain a prototypical variant of the effective action in the phase representation, S[</>] = 

SM + SUM- 

In the case of strong interactions, phase fluctuations are largely controlled by the tunneling 
action. Specifically, in nearly insulated dots, g x « 1, the phases fluctuate wildly, and no 
longer represent suitable degrees of freedom. (Exercise: interpret this phenomenon in terms 
of the voltages V Ctq = d t (f) cq \ why do voltage fluctuations in the nearly decoupled dot become 
unimpeded?) As is usual in cases where a variable goes havoc, this limit is better described 
in terms of the canonically conjugate partner variable, which in the present context carries 
the significance of a charge degree of freedom, n. (Indeed, the dot charge will fluctuate 
only moderately in the limit of small g x .) 

Technically, the charge variable is introduced by a Hubbard-Stratonovich decoupling of 
the charging term. This leads to the action in the phase-charge representation, 



dt ( n c d t (j> q + n q d t (j>c 


2 E c (n c + No)n q ) + Stun [</>]• 


(11.79) 


One may think of the above transformation as passing from a Lagrangian variant of the 
theory - one variable, <j>, governed by an action with “velocity square” term (p 1 - to a, 
Hamiltonian action J(n</> — H(n,<f))) defined in the phase space of variables ( n,(f > ). Notice 
that the classical component n c is conjugate to the quantum <p q , and vice versa. 
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But what about charge quantization? We should expect charge in the isolated dot 
g x — > 0 to be quantized to integer values, but our action above does not appear to know 
of this constraint. As in many other instances before, information on the quantization of n 
resides in the boundary conditions of its conjugate variable <j>. So far, we have ignored 
these boundary conditions. But as we are dealing with phases, we should be alert and expect 
windings by multiples of 27 t as 0 C and/or <l) q propagate along the Keldysh contour. 

EXERCISE To explore the topology of the phase field, it is best to go back to the framework 
of the closed Keldysh contour s G [0, S]. It is on this contour, where the Hubbard-Stratonovich 
field (f>(s ) is defined in the first place. Being a phase field, it is defined only up to multiples of 27r, 
i.e. we have the boundary conditions 0(5) = 0(0) +2ttW, where the integer W has to be summed 
over (cf. our discussion of functional integrals over S' 1 -valued fields in Chapter 3). Implement the 
condition above by the ansatz , 0(s) = 0(s) + ^P(s — T), where 0 is periodic on the s-contour, 
and we assume the real time contour to be parameterized by t G [0, T] . Show that this condition 
translates to an unconstrained field 0 C = 0 C , and to 

+ (n-80) 

with Dirichlet conditions 0 q (O) = 0 9 (T) = 0. 

In the exercise above, it is shown that the information on phase windings resides in the 
quantum field, <p q . Heuristically, quantization conditions for the field n may then be deduced 
as follows. Assuming that the winding number dependence of the tunneling action is negli- 
gible (the rationale being that in the limit T — > oo the IT’-contribution to Eq. (11.80) is a 
nearly constant phase shift which is likely to cancel out in a term probing the local tunneling 
dynamics), we observe that W enters the action through the contribution 2nW ^ dtn c . 
Summation over W then generates the condition ^ f 0 dtn c G Z: the temporal average of 
classical charge on the dot is integer. But since the operators ~ exp(i0) in the action that 
change the charge do so in discrete steps, n c — > n c ± 1/2, and the numbers of positive 
(exp(i0)) and negative (exp(— jumps are equal, this condition calls for an initial charge 
value n c (0) G Z. In the course of time, this initial value may change, but it does so in integer 
steps, i.e. the classical charge n c (t) is quantized. 

EXERCISE If you find the argument above too vague, you may derive charge quantization 
in rigorous terms. Do not ignore the W -dependence of the tunneling term. Rather, expand 
exp (iS) schematically to nth order in Stun- The winding numbers then enter as a pure phase 
exp(27riTW), where F is an expression involving n c , and the discrete set of times at which 
the phases of the tunneling terms are evaluated. Summation over W generates the quantization 
condition F £Z. Analyse F to show that this condition is equivalent to the initial value quan- 
tization n c ( 0) G Z. In combination with the integer steps taken by the tunneling dynamics, this 
rigorously proves charge quantization. 

In effect, we may account for the windings of the phase fields by limiting the integration 
path n c (t) to integer configurations, and thence forget about the winding numbers. 
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11.6.2 Dot distribution function 


Hidden in the action (11.79) sits the distribution function of the dot (cf. Eq. (11.75) and 
(11.76)). In principle, we should think about the distribution function as an observable of 
the theory, a quantity that determines the distribution of the particles on the dot after 
all degrees of freedom (including the interaction) have been integrated out. However, the 
results of that integration actually depend on the dot distribution function, i.e. the rigorous 
determination of the latter poses a recursive problem whose solution is likely out of reach. In 
practice, one will rather work with a trial distribution, which should of course be reasonably 
close to the actual distribution. 

In the present context, we will approximate by the distribution function of the non- 
interacting dot. The rationale behind this ansatz is the assumption that the coupling to 
the leads will adjust an effective distribution on the dot, at time scales ~ i/|Tj 2 , faster than 
the relaxation times due to interactions. The quality of such assumptions must of course be 
checked in retrospect. 


Trial dot distribution function 

But what will the distribution function 
on the dot, in the absence of interactions, 
look like? There are various ways to answer 
this question. Heuristically, one may argue 
that all single particle states on the dot 
will be occupied up to energy —V/2, i.e. 
up to the threshold energy above which the 
band of the right lead begins to dry up (cf. 
the figure.) At larger energies, states on the 
dot are hybridized with isoenergetic filled 
states in the left lead, and empty states in 
the right lead. These coupling mechanisms 
are mutually independent, and occur at a 
strength gL,R, respectively. This suggests, 
that the distribution on the dot 

F d = 9 lFl +9r f R > (1L81) 

dL + 9r 

will be an additive superposition of the lead 
distributions. The corresponding distribution function n,i = {—F d + l)/2 assumes the form 
of a double step distribution (cf. the figure.) 

EXERCISE Substantiate the argument above: To this end, consider the tunneling operators T x 
as a “perturbation”, and compute the corresponding self energies. Then consider the kinetic 
equation (11.68). For a stationary solution F , the left-hand side of the equation vanishes. Show 
that the vanishing of the right-hand side enforces the double step distribution. 




*• 1 

V 

E '■ 




E=i 

-V/2 V/2 
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Readers who feel comfortable enough with these ways of fixing the distribution function 
may advance to the next section. For the benefit of all others, we devote the rest of this 
section to showing how the (non-interacting) distribution function may be obtained directly 
from the effective action. To this end, we vary the tunneling action (11.75) in <j> and adjust 
Ad such that the non-interacting configuration <j> = 0 is a stationary point. In the exercise 
below, we show the equivalence of this procedure to the kinetic equation approach above. 

We need to determine A c ; such that the linear expansion of the action in jp = <p c + 
fiqiJi vanishes. It is straightforward to verify that variations in <p c vanish regardless of 
the distribution function. As for the quantum component, we generalize (f) q ai — > <j) q = 
Re (t> q (Ti — Im (p q (T 2 , which may be interpreted as a shift of the generator <p q into the complex 
plane. It is then a straightforward exercise to verify that the variation of the action S^S = 0 
generates equation (11.81). But what might be the logic behind this variational approach? 
The answer is given in the following exercise: 

EXERCISE The distribution function on the dot can be found by solution of an appropriate 
kinetic equation, (Gq 1 — £)G = 1. In this exercise, we want to show that the component of 
this matrix equation relevant to the determination of Fj can be found by introduction of a 
suitable source. To this end, let us go back to the microscopic variant of the theory, Eq. (11.73), 
and generalize 1803 — » ^(5e ^ ^‘ ^ <73e -l '* > ' , in the dot action, Sd ■ Verify that the expansion of the 
action to first order in <j) q generates the Keldysh component of the kinetic equation. Under the 
assumption of constant spectral density (also underlying the derivation of the tunneling action), 
the vanishing of the linear variation becomes equivalent to the fulfilment of the kinetic equation. 
The stationary solution is given by Eq. (11.81). 

For notational simplicity, we assume contacts of equal transparency, 


9L=gR = 9, 


throughout the rest of the section. 


Tunneling action 

We may use the result Eq. (11.81) to bring the tunneling action into a form more manageable 
than the abstract representation Eq. (11.75). To this end, let us introduce the notation 

c = e z<l>c cos((j> q /2), s = e l<t>c sin(0 g /2). 

We substitute e l(t> = c. — iso\ and = c — isoi into Eq. (11.75) and obtain 
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where (notice the similarity to the self-energy of a boson coupled to fermions Eq. (11.62)), 


^ E + 9d,t-t'9t'-t) * 

%tLtt = \ E (9x,t'-t9d,t-t> ~ ( 9 + ~ 9~)t-t'(g + - g~)t’-t ) ■ 


Like so often, we focus on the damping contribution to the self energy, E + — E . Using 
( g + — g~) e = — i, we obtain 


(£+-E-)„ = ^ 


(F d , e -F x , e _ u ) = i-, 


(11.83) 


S - = | E/ ^ ~ = £ E ( w W + (w + sV)F b (u> + sV )) , 


where we used Eq. (11.74) and the relations, / (F(e + u>) — F(e)) = ||, / ^ (1 — F(e — 
w)F(e)) = ^F b (ui), with F(e) = tanh(e/2T), and F b ( u>) = coth(w/2T). Combining every- 
thing, we obtain 

J / S C aJ&j) (fa). ’ < u 84 > 

where F(w) = ~F b (u)) + | ^ s =±ife + sV)F b (u} + sV). The action Eq. (11.79) with the 
tunneling contribution Eq. (11.84) is the basis of all our further discussion. 


11.6.3 Observables 

Before proceeding, we should choose an observable in terms of which the system’s behav- 
ior will be described. While the system’s total conductance (as opposed to the tunneling 
conductances of its individual dot-lead interfaces) might be the first option coming to one’s 
mind, we here focus on the dot tunneling density of states, (TDoS). The latter is 
defined as 20 

v(e) = — — Imtr (G + (e)) = — — Im [ dt e tU tr (G + (t)) . (11.85) 

7T 7T J 

What makes the TDoS an interesting observable to consider? 

> The TDoS can be measured by tunneling spectroscopy. See Fig. 11.4 for an example of 
local probes of the TDoS in a mesoscopic superconductor-normal-superconductor (SNS) 
wire. (Barely discernible in the figure is the straight normal wire, adjacent to a horseshoe 
shaped superconductor.) In this system, the TDoS is suppressed due to the superconduc- 
tor proximity effect, the effect that a superconductor tends to induce pair correlations 
leading to a spectral gap in the adjacent normal metallic regions. The size of the gap as 
a function of voltage (energy) diminishes with the distance from the superconductor. 


The notation assumes a steady state configuration. In principle, i /(e,t) may be explicitly time dependent, vis 
temporal changes in distribution functions. 
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Figure 11.4 Tunneling density of states as a function of voltage (energy) in a mesoscopic SNS 
structure. The arrows mark the position where, in the normal wire, the TDoS was recorded. Fig- 
ure taken from H. le Sueur, P. Joyez, H. Pothier, C. Urbina, and D. Esteve, Phase controlled 
superconducting proximity effect probed by tunneling spectroscopy, Phys. Rev. Lett. 100 , 197002-6 
(2008). 


> At the same time, the TDoS is an observable of considerable conceptual interest. As 
is obvious from its definition, the function u(e) probes the amplitude of quasiparticle 
propagation in the system at time scales t ~ e -1 . This propagation amplitude comes with 
a quantum dynamical phase. Out of thermal equilibrium, we expect this quantum phase 
to be susceptible to all kinds of nonequilibrium fluctuations. In this sense, the TDoS 
is an observable very close to the central theme of this chapter. 

More generally, the TDoS probes the general mechanism of the orthogonality catas- 
trophe : the quantum state “bare quasiparticle plus unperturbed system” forming right 
after the tunneling event will typically be very different (“orthogonal”) from the asymp- 
totic stationary state in which the incoming particle has been accommodated into the 
system. The formation of that final state generally involves the re-adjustment of a large 
number of particles and may take much more time than, say, the tunneling process as 
such. This means that in strongly interacting systems, the TDoS is expected to show 
significant structure at small energy scales - the zero bias anomaly. 

In fact, the Coulomb blockade discussed in Problem 6.7 can be viewed as an extreme 
manifestation of the zero bias anomaly: tunneling onto an isolated quantum dot creates a 
state that is energetically forbidden, or “sub-barrier,” by an amount E c . This means that 
the TDoS vanishes over an interval from e = 0 to e = Ec- (Recall that energy is measured 
from the Fermi energy.) Equivalently, we may say that tunneling processes onto the dot 
are limited to short duration t ~ E~ x . Tunneling into a metallic system leads to more 
intricate behavior. The tunneling process generates an initial charge distribution which 
is energetically unfavourable (sub-barrier). In the course of time, that initial distribution 
will relax in what typically will be a diffusive spreading process. The action associated 
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to that process weights the quasiparticle propagator and hence the TDoS. For example 
in a two-dimensional setting, the long time relaxation action diverges, 21 which implies 
singular behavior of the TDoS at low energies. 

The open quantum dot discussed in this section forms a compromise between a closed dot 
and a bulk metallic system. Below we try to understand the TDoS of this system, and 
the ramifications of nonequilibrium fluctuations. However, before doing so, we first need to 
express u(e) in terms of our effective degrees of freedom. Fortunately, this is a straightforward 
task. It turns out to be best to work in the representation of the theory prior to the Keldysh 
rotation. Comparison with the formulae in Section 11.4.1 shows that 

m = - ^ / dte iet ( G"+(t ) - G+~(t )) . 

To give the Green functions G ± some meaning, we just need to note two things: in the pres- 
ence of interactions G“+ = -f § = e iW-(t)-*+(t'))<3-+ , 

where Go is the non-interacting Green function. For a single level, its form is shown in 
Eq. (11.52). For our many level dot this generalizes to 

G o,w = -* / dev(e)e- ie{t ~ t '\l -n d (e)) 

~ — 2'Kiv d (l — n d )(t — t'), 

where n d (t) is the Fourier transform of the function n(e) and we assumed constant non- 
interacting density of states u d as before. 22 Combining this with the analogous construction 
for the Green function G±, we obtain v(e) = z/ e (e) + iy t (e), where 

v e (e) =u d Re J dte i(t e^° ^ ^^(l - n d )(t), 

u h (e) = u d Re j ° W) n d (t). (11.86) 

Here v edl are to be interpreted as the contributions of electrons (holes) tunneling onto the 
dot, and t is an arbitrary reference time marking the beginning of the tunneling process. 
The hole contribution, v differs from v e in the replacement (1 — n d ) —i n d , and a sign 
change in the phase of the oscillatory exponential. 


11.6.4 Open quantum dot 

In this section, we study the TDoS at large values of the tunneling conductance, g » 1. 
In this limit, fluctuations of the phase 4> are largely quenched. Conversely, the conjugate 
degree of freedom, the charge n, fluctuates strongly - the open quantum dot admits large 
variations of charges and currents. The discussion of the open system is rewarding in that 

21 L.S. Levitov and A. V. Shytov, Semiclassical theory of the Coulomb anomaly, Pisma Zh. Eksp. Teor. Fiz. 66, 
200-205 (1997) [JETP Lett. 66, 214-221 (1997)]. 

22 Notice that (1 — rid)(t) = <5(t) — nd(t) is to be understood as the Fourier transform of 1 — nd(e). 
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it enables us to bridge between classical and quantum phenomena. On the one hand, the 
smallness of (quantum) fluctuations of the phase means that we stay close to the classical 
limit; in many respects, our system behaves like a classical RC-resistor unit, kept at voltage 
~ <p. On the other hand, Eq. (11.86) demonstrates that the same phase determines the 
quantum mechanics of the observable v. Indeed, we discover that the physics of the orthog- 
onality catastrophe in the system can largely be understood in terms of classical voltage 
fluctuations. 

At large g, the tunneling action may be expanded to second order in <i> around its sta- 
tionary configuration <f> = 0. 23 As a result of this expansion we obtain 


S (2) M 


1 

22fc 


J dt d t <l)cdt<t>q + 



/ 0 — iu) \ 

( 

\iu iK(uj)J 

\$q) , 


(11.87) 


where the kernel K is defined in Eq. (11.84) and we used the phase representation of the 
charging action Eq. (11.78). We have also neglected the IVo-contribution to the charging 
action, the formal reason being that for large g , topologically excited (finite winding numbers 
W) configurations of the phase field do not play a role. However, for W = 0, Ng J dt d t (b q = 
0. (Exercise: what is the physical reason for the irrelevancy of the A^-term in the widely 
open quantum dot?) 

Except for the structure of the kernel K, this action is reminiscent of the standard 
MSRJD-actions describing dissipative classical systems. To understand the ramifications of 
this connection, we go one step backwards in the direction of classical physics and decouple 
the (p q (p q -term to obtain a Langevin-type system: 

*[«] = jm, [(-jA* - gft) ^ +{ ] (n.88) 


Classical resistor network 


Integration over <j> q in Eq. (11.88) freezes the clas- 
sical field to the configuration solving the equation 


fJ- 


d t 


2 g. 


d t )<f> c = 


\2E C 2tt 
where the noise field £ is correlated according to 

mm = -K{t - f), 


V Vo 

2t r/g 2-ir/g 


and K(t) = J ^e~ iUt K(co) is the Fourier transform of the kernel K. This is the Langevin 
equation for voltage fluctuations on the classical resistor network equivalent to our quantum 
dot. The network is depicted in the figure where the central node represents the quantum 
dot, coupled by two resistors of resistance R = g~ l to a bias voltage source. The dot is also 


23 Anharmonic fluctuations are suppressed in <? -1 , but weakly singular in the infrared time scales of the theory. 
This indicates scaling of the coupling constants ( g , E c ) of the theory, which, however, we do not discuss in this 
text. 
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coupled to a capacitor of capacitance C = 1/2 Ec- That capacitor is kept at a gate voltage 
Vrj = N 0 /C (which, however, we saw is irrelevant to the physics of the open dot). 

To make the connection to classical circuit dynamics more explicit, we change notation 
in the Langevin equation above according to 24 

9 = Ec = ^c' U = d t (t> c , £ = Ct). 

The Langevin equation describing voltage fluctuations off the static value V/2 then 
assumes the form 

+ 7 t/ = ??, (11.89) 

where ( r](i)r)(t ')) = ^ K(t — t ') and we have introduced the parameter 7 = The mean- 
ing of this equation is not difficult to understand: in the absence of fluctuations, rj = 0, 
the voltage on the dot relaxes to its stationary value, V/2, at the RC-time scale charac- 
teristic for a capacitive shunt. (The relaxation time scale is actually 2/RC, because what 
matters is the resistance R/2 of a dot shunted by two parallel resistors.) In contrast to our 
previous studies of dissipative systems, the coupling to two distinct leads ( “baths” ) implies 
that the voltage distribution on the dot cannot relax to an equilibrium configuration. 

To explore this point, let us take a close look at the fluctuation correlator, K: 

t> In the absence of biasing, V = 0, and in the classical limit {K)ui — > 0, we have 
K(u>) = wcoth(u;/2T) 2 T, and ( rj(t)ri(t ')) = — t'). Comparison with our 

discussion of Langevin equations in the previous chapter then shows that the steady state 
distribution controlling the voltage on the dot takes the form 

P(V) =Afe . 

This is the Maxwell— Boltzmann distribution for a system with capacitive energy 
U 2 C/ 2. 

> Still assuming zero bias, V = 0, let us now explore the zero temperature limit. In 
this case, K(ui) = wcoth(a;/2T) — ■ * |cu|. To understand the meaning of this limit, we 
may interpret the voltage fluctuations as due to the fluctuations of a large assembly 
of oscillators of arbitrary frequency u>. (In “reality” the fluctuations represent particle- 
hole excitations in the Fermi liquids constituting the system. However, for the sake of 
the argument, that concrete realization is not important.) In the classical limit T » u, 
each spectral component stores fluctuation energy T (the equipartition theorem), which 
explains the above classical correlator. However, in quantum mechanics, the energy reads 
u}(n(w)- 1-1/2) = f coth(u;/2T). This means that at small temperatures T, it gets replaced 
by |cu|/2. Also notice that for oj > 0 the zero temperature energy is due to the factor 
1/2 in the energy balance: the observable fluctuations are due to vacuum oscillations 
of the quantum baths. As discussed in the previous chapter, the equilibrium oscillator 
fluctuations leading to the statistics above are called Nyquist— Johnson noise. 

24 The factor of 2tt in the definition of R is explained as follows: g is the dimensionless conductance of the tunnel 
barriers. The physical conductance G = R~ — ge 2 /h = ( g/2n)/K . In our units e 2 = h = 1 and this reduces to 
g = 2 -k/R. 
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> We next consider the consequences of finite biasing, V" > 0. For simplicity we keep the 
temperature at zero, T = 0. Inspection of Eq. (11.84) then reveals a crossover, 


K(u) = 


M> M»IH 

sin m « in 


The interpretation of this crossover is as follows: At large frequencies, |<u| |E|, noise is 

dominated by the equilibrium fluctuations of high frequency oscillator modes. However, 
at lower frequencies, we start sensing the distortion out of equilibrium, due to the biasing 
of the dot. The latter causes a finite mean current flow through the system. This in turn 
generates noise, which will be predominantly shot noise. 

In the previous chapter we argued that if we define the current passing through an 
elementary resistor in a time window [f, t + At] as I At = n/ At in terms of the number of 
charges, n, passing through in a time At, the statistical variance of the current - incident 
charges assumed Poisson distributed - will be given by var(JAt) = //At, where I = (I At) 
is the statistical average of the current. We may write the instantaneous current through 
the resistor as I = 1 + 61, where ( SI(t)SI(t ')) = f6(t — t') describes short range correlated 
current fluctuations. Computing the variance, 

1 n 

var(/ At ) = dtdt' (6I(t)6I(t')) = — , 


we find that / = I establishes compatibility to Poissonian shot noise. 

We now relate these findings to the Langevin equation describing our more complex 
two-resistor system. To this end we rewrite the latter as 

Cd t U = -^U + Cr], (11.90) 

R 

i.e. an equation that relates the rate of changes in the charge of the dot (left-hand side) 
to current flow (right-hand side). Specifically, Ct) = 61 l + 61 R is to be interpreted as 
the sum of current fluctuations through the left and right tunnel resistor. Each of the 
two contributions I x ,x = L, R is expected to express Poissonian shot noise statistics, 
{I x {t)I x {t')) = I6(t — t') = V/(2R)6(t — t'). Compatibility with our considerations above 
then requires 

~ ((6I L (t)6I L (t')) + (6I R (t)6I R (t'))) = ^ 6(t - t% 

in agreement with Eq. (11.89) and K(t — t') = (V/2)6(t — t'): the noise in our system is 
obtained by superposition of the shot noise generated by the two resistive tunnel barriers. 

INFO The analysis of noise levels in compounds comprising several elementary Poissonian 
noise sources plays an important role in nonequilibrium mesoscopic physics. It is custom- 
ary to quantify the noise in a composite system by a parameter known as the Fano factor. 
The Fano factor relates the DC-noise power in the system, 


S= 2 dt(6I{0)8I{t)). 


( 11 . 91 ) 
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to the noise power, So, of an elementar 


i Poissonian statistics, 


Here 51 is the fluctuation contribution to the current in the system. To make these definitions 
more concrete, consider the elementary SC'-unit shown in Fig. 10.2, first at the absence of 
external biasing V — 0, and at temperature T. The Langevin equation controlling current 
flow in the system reads CdtU = —R~ 1 U + 51. The FDT requires that (5I(t)6I(t')) = // T 
which means that, in equilibrium, S eq = 4T/R. Now consider the complementary case of zero 
temperature, T s 0, but finite bias V across the resistor. In this case, the Poissonian statistics 
of charge transmission requires (cf. the discussion above) (5I(t)5I(t')) = I => U/-R- This means 
that 

1 - 

What is the Fano factor of the double barrier quantum dot discussed above? We define I = U/R 
as the current whose statistics we wish to characterize. Comparison with Eq. (11.90) shows 
that 51 = Cr)/2. From this identification and Eq. (11.89) we find that, in equilibrium, the noise 
power S = 2T /R. This is the noise of a unit with total (series) resistance Ru,t = 2 R. Out of 
equilibrium, the correlator (5I(t)5I(0)) = ^5(t — t') implies S = V/2R = V/Rtot = I = So/2, 
or f* == 1/2. The system exhibits fluctuations lower than those of a single Poissonian resistor. 
The reason for this reduction is that the current fluctuations 51 = (5Il +<5/r)/ 2 are obtained 
by superposition of two statistically independent noise sources. 


Zero bias anomaly 

We next wish to explore the ramifications of the resistor voltage fluctuations in the quantum 
mechanics of the TDoS. To this end, consider the electron contribution o e to the density of 
states in Eq. (11.86). Using the fact that <j) c (t + 1) — = J^ +t dt' U(t'), we can rewrite 

this expression as 

v e (e) =u d Re j dt e iet .ei + ‘ dt ' u(t') e HMt+i)+Mt)) (i _ n<J )( t ) 

= u d Re J dte i€t (f) (i _ n<J )( t ) 

where U[rj\ is the solution to the differential equation 

(df + 7 ) U n (t') = r] + ^j{d{t' - (t + 1 )) + 5(t' - t)). 

Once more, this equation is easy to interpret. What is new in comparison to the Langevin 
equation (11.89) describing the source-free action is the presence of the two (5-functions on 
the right-hand side. These (5-functions reflect a unit-jump in the classical charge upon the 
entry of a particle (amplitude) in the dot. The voltage on the dot is thus driven by the 
superposition of the noise 77 , and the entry peaks. In response it relaxes, as described by the 
left-hand side of the Langevin equation. There is no reason to be puzzled by the appearance 
of two (5-functions; depending on the sign of t, only one of them contributes to the profile 
of U v in the time window t: + t] (or [t + t,t\ for negative t). The reason is that the time 
evolution of U is retarded, i.e. the equation describes the forward evolution of U in causal 



11.6 A mesoscopic application 


7 43 


response to driving sources. For example, for t > 0, only S(t' — t) affects the evolution in 
the time window [t , t + t] . 

We may interpret the phase e 2 Ji +t dt u (*) a s the “Feynman propagator” of the tunneling 
amplitude. Our dot is largely structureless, i.e. the only contribution to the action of the 
particle is the fluctuating voltage U v . To understand its effect, we need to average the phase 
over noise fluctuations. This is most efficiently done by substitution of the Fourier transform 
of U v , 




j ( e iw (*+b + e iut ) 

—ioj + 7 


and performing the Gaussian integral over r)(u>) using 


(■ ')) = 2n6(co + 

As a result of a straightforward but not particularly illuminating calculation, we then obtain 


v e {c) = t'dRe J 


where the noise action 




duj l — cos(wt) 
2ir oj 2 (u> 2 + 7 2 ) 


K(u), 


(11.93) 


and the oscillatory exponential describes the decay of the entry voltage peak on the RC- 
time scale. The integral form of S{t) reflects the finite time interval over which the voltage 
fluctuations are monitored (the factor w -2 (l — cos(wt)), the -RC-retarded nature of the volt- 
age response to fluctuations (the factor ((w 2 + 7 2 ) -1 ), and the statistics of the fluctuations 
themselves, K(u). The hole contribution, v^, differs from u e by a sign change in the oscil- 
latory phase, and in the replacement 1 — rid —> rid,. Adding the two contributions and using 
the fact that, in the open limit, the largeness of the dimensionless parameter 7 C = 2/fl>l 
permits a linearization in the exponential phase, we obtain 


u(e) = u d 1 - 


A V- f°° ■, cos((e + aV/2)t) , _ 

V 2tt7 C^Jo t ^ ' 


-7 P -s(t) 


(11.94) 


where e %aV l‘ lt / (Anit) is the Fourier transform of the dot distribution function Eq. (11.81) 

at zero temperature and equal barrier heights. 

In the unbiased limit, V = 0, the noise action can be estimated as 

s(i)V5 °2^c ln(1 + (7 ‘ ) T 


This result states that at large time scales, a conspiracy of RG-relaxation and noise fluc- 
tuations leads to a logarithmically diverging action for particle tunneling processes. The 
consequences become evident once we substitute S(t) into Eq. (11.94) for the tunneling 
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DoS. Noting that the factors (1 — e -7 ^) and cos(ef) act as infrared and ultraviolet cutoffs, 
respectively, we obtain 


Ve-Vd 1 - 




dt , . _ 

T (7i) 


or, using the definition of 7 and R, 


V=0,e«l /RC 

u(e) ~ v d 


9 ) 


1/9 


- = v d{\*\h)* 


(11.95) 


This is the celebrated zero bias anomaly. The accumulation 
of tunneling action at large time scales leads to a complete 
suppression of the tunneling DoS at small energies. Notice 
that the vanishing of the DoS at zero energy is a result of the 
weak accumulation of tunneling action, a phenomenon we may 
interpret as the long term maintenance of quantum coherence 
in the system. 

How will this picture change as we turn on a finite bias 
voltage to drive the system out of equilibrium? A glance at 
Eq. (11.94) shows that the voltage intervenes in two different ways: it splits the centre 
energy of the zero bias anomaly into two, 0 — »• ±V/2. This reflects the shift in energy of 
the step distribution on the dot, relative to the Fermi distributions in the leads. Second, 
the voltage enhances the tunneling action. For example, at low energies, e <C V, the action 
acquires a contribution oc (' y/C ) / 1 dwV / (u 2 (u 2 + q 2 )) oc RV\t\. This increases much more 
strongly, varying linearly in time, than the logarithmic increase we had in the equilibrium 
case. As a consequence, the coherent suppression of the DoS gets reduced and the zero bias 
anomaly peaks become less pronounced. In the figure above, the resulting profile for the 
DoS is shown for a few values of the bias voltage. 

The discussion of this suppressive mechanism can be carried quite a bit further. For 
instance, the rounding of the zero bias anomaly peaks can be used as a means to define 
a nonequilibrium dephasing rate, or one may study the role of the voltage biasing in 
the complementary regime of low tunneling conductances <7 1 (where it turns out to 

be considerably less pronounced, cf. Problem 11.9.3). Here, however, we will not enter this 
more detailed discussion. Instead, and in view of the relatively technical character of this 
section, let us take a step back and summarize what we have found: 

> The TDoS is a very “quantum mechanical observable.” Its low energy profile is determined 
by long time matter wave coherence, as described by the appropriate Feynman path 
integral. 

> Nonetheless, the TDoS is susceptible to classical fluctuations in the environment. The 
latter affect the classical action of the tunneling particle and, thus, the phase of the 
quantum mechanical propagator. 

> In equilibrium and at zero temperature, these fluctuations (then, zero point fluctuations) 
do not prevent the build-up of long range coherence, a phenomenon that reflects, e.g., in 
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the build-up of the zero bias anomaly and a complete suppression of the TDoS. However, 
the installment of nonequilibrium conditions, even steady state nonequilibrium, goes along 
with increased noise levels, and this tends to suppress quantum coherence. 

> Technically, the noise (Langevin noise 77 , in our above discussion) couples through a vari- 
able (voltage Ur ) ) in a manner that encapsulates dynamical aspects of the system, and may 
involve system specific retardation effects. In the case study above, we did not introduce 
this coupling mechanism by postulation of an external bath. Rather, both the collective 
variable of interest (voltage), and the noise sources emerged out of one microscopically 
defined system. Such noise generation mechanisms are typical for interacting many par- 
ticle systems that are projected onto the dynamics of a few collective variables. 


11.7 Full counting statistics 

Previously, we have seen that the statistics of currents may strongly affect the behavior 
of physical observables. It may also tell us about the internal structure of a system. For 
example, in the case study above, the noise level of currents (the Fano factor) signaled the 
presence of two tunneling barriers instead of just one. Statistical analysis of current fluctu- 
ations - both experimental and theoretical - has become a prominent tool of diagnostics to 
explore the microscopic nature of condensed matter systems through observable transport 
coefficients. The cumulative keyword for these types of analysis is full counting statistics 
(FCS). In this section we introduce the notion of full counting statistics and apply it to 
elementary examples. A brief survey of more advanced applications is given in the end of 
the section . 25 

I N FO The concept of full (photon) counting statistics was introduced in quantum optics (for 
a review, see e.g., L. Mandel and E. Wolf, Optical Coherence and Quantum Optics (Cambridge 
University Press, 1995).) Only much later have these ideas been transferred to the problem 
of transport in condensed matter (cf. L. S. Levitov and G. B. Lesovik, Charge distribution 
in quantum shot noise, Pis ma Zh. Eksp. Teor. Fiz. 58, 225-30 (1993) [JETP Lett. 58, 230-5 
(1993)]). The ramifications of this concept are included in the brief review at the end of the 
section. 


11.7.1 Generalities 


Suppose we wish to characterize transport through a quantum system in terms of the charge 
transmitted in a time interval [t, t + At] , 


N = 


dt I (t), 


(11.96) 


For more comprehensive reviews, see W. Belzig, Full counting statistics in quantum contacts , Proceedings of the 
Summer School/Conference on Functional Nanostructures, Karlsruhe, 2003; L.S. Levitov The statistical theory 
of mesoscopic noise , Quantum Noise in Mesoscopic Physics: Proceedings of the NATO Advanced Research 
Workshop, Delft, the Netherlands, 2002, and M. Kindermann and Yu. V. Nazarov, Full counting statistics in 
electric circuits , in: Quantum Noise in Mesoscopic Physics, ed. by Yu. V. Nazarov, Kluwer (Dordrecht), p.403 
( 2002 ). 
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where / is the operator measuring current flow through a section of the system. We may 
generate the expectation value of this operator by differentiation with respect to a suitably 
constructed source field. To this end, recall that the current density j(x), is obtained by 
differentiation of the action of the system with respect to its vector potential, A(x) (as 
usual x = (t,x) comprises space and time coordinates). Specifically, in the context of the 
Keldysh field theory 

jW= - j j^i)L,o z[A0<Ta/21 ’ 

where the notation Z [A (g) 03 / 2 ] indicates that the action is minimally coupled to a purely 
quantum vector potential (opposite signs on the two Keldysh contours). According to our 
general discussion of the Keldysh contour in Section 11.2.1, the coupling of A to <73 makes 
it a suitable source variable . 26 Building on this definition, we may introduce a source 
variable for currents by defining the vector potential, 

A(x, t) = x(f)ej_ J d d ~ 1 x' 6(x — x'), (11.97) 

where the surface integral confines the support of A to a planar section S of the system, e^ 
is normal to S (the generalization to curved sections is straightforward), and the definition 

of the counting field, 

X{t) = x©(t - t)0(t + At- t), 

implies a projection onto the counting time interval. Differentiation of Z [A ® 03 / 2 ] = Z(x) 
with respect to the source parameter % then generates the relation 

d I rt+At P 

-i— | =o £(x) = J_ J dS • (j(x, t)> = (TV). 

Further differentiation with respect to the source variable - and that is the prime advantage 
of the above construction - readily generates moments of the current: 

(N n ) = H)" 1^1 = 0 -Z(x). (11-98) 

This formula identifies Z(x) as the moment generating function and its logarithm 

Infl'(x) = In Z(x), 

as the cumulant generating function. 

I N FO Why would one want to know moments higher than the second of a current distri- 
bution? Suppose an experimentalist recorded the outcome of many runs of a charge transmission 
measurement. They might decide to communicate their results in a histogram “count number 
vs. events,” i.e. data whose continuous idealization we interpret as a probability distribution. To 
a first approximation, the shape of this distribution is characterized by the first four cumulative 
moments, pi,.. .,4 (see Fig. 11.5): 


To be more accurate, in Section 11.2.1 we introduce a source on the upper Keldysh contour. The sign inverted 
source on the lower contour generates the same observable. Coupling to <73 adds the two contributions which 
are compensated for by the factor of 1/2 in the formula above. 
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Figure 11.5 Hypothetical current distribution interpolating through an (equally hypothetical) dis- 
crete data set and its characterization in terms of mean, width, skewness, and kurtosis. 


1. moment, pi: the average, 

2. moment, P 2 : the width, 

3. moment, p .3 : the skewness of the distribution, 

4. moment, /m : the kurtosis. High kurtosis implies a sharply peaked distribution with fat 
tails. Low kurtosis indicates a mollified distribution, with broader shoulders. 

The central limit theorem ensures that moments higher than the fourth are typically very small. 


11.7.2 Realizations of current noise 

In this section we introduce two frequently encountered realizations of noisy current distri- 
butions in a schematic manner. We then explore how these prototypical forms relate to the 
“real-life” example of our resistively shunted quantum dot. 

Quantum point contact: consider a “quantum point con- 
tact,” i.e. an isolated scatterer embedded into a single- channel 
quantum conductor. This setup is not quite as academic as it 
may seem. By standard techniques of current device technol- 
ogy it is possible to manufacture few-channel quantum wires, 
and to introduce artificial imperfections or tunneling bridges 
(see the picture [courtesy of Nanocenter Basel] on the right for 
an example). Charge carriers which encounter the point contact get transmitted with prob- 
ability T and reflected with probability 1 — T. This means that the probability to transmit 
n charges in N events is given by the Bernoulli distribution 

P(n)= ( ^ ^ T”(l — T) N ~ n . 
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The cumulant generating function of this distribution reads 

9(x) = ln XX = Nln(l + T(e z -1)). 

A perfect single channel conductor has dimensionless conductance g = e 2 /h = 1/27 r (in 
our usual units e = h = 1). If the system is biased, N = I At = V At/ charges will 
be incoming, i.e. the statistics of current in the quantum point contact (at zero 
temperature 27 ) is described by 

VAt 

ln 9(x) = - 7 T- ln(l + T(e % - 1)). (11.99) 


Ohmic resistor: in an Ohmic resistor the situation is different. Incidents of charge trans- 
mission are still uncorrelated, but we may not think of them as the result of a single scatter- 
ing event. This suggests a description of charge transfer in terms of a Poisson distribution 
(cf. discussion on p.n 605), 

p(n)=e ~ ! '^|- — > lng(x) = v (e* - l) . (11.100) 


Bidirectional distribution: The current through a con- 
ductor connected to two terminals (for the generalization to 
multi-terminal geometries, see 25 ) is generally obtained by 
additive superposition of two counter-propagating current 

flows (see figure). For simplicity, let us assume the two distributions Pi(n,), i = 1,2 of 
counter-propagating charges to be statistically independent. The distribution p(n ) of the 
total number of transmitted charges, n = n\ — ri 2 can then be computed as follows: 


P( n ) = 


= £ 

nil 

-I 


(n- ni +n 2 ) [ dXl -i i 


27T 


ni n 2 

fa -i 

2n e 




d X2 -i 

27 r 6 


2 92^X2) 


1 '9i(x)92(-x)- 


This means that the generating function of the transmitted charge is the product g(x) = 
9 i{x) 92 {— x) °f the partial distributions. 

For example, in the case where the two constituent distributions are Poissonian, we find 


In <?(x) = *T ( e * - l) + ^2 (e * - l) , 


( 11 . 101 ) 


where v\ and determine the average rates. These rates will typically be set by the applied 
voltage bias, barrier transparencies, temperature, and the like. In the following, we return 
to the biased quantum dot studied in previous sections and explore the relevance of the 
general structures discussed above to this physical example. 


For the generalization to finite temperatures, cf. Refs. 25 . 



11.7 Full counting statistics 7 49 

11.7.3 Full counting statistics of the double barrier quantum dot 

Let us now return to consider the quantum dot system introduced in Section 11.6.1. Our 
goal is to better understand the statistics of current flow. To this end, we will monitor the 
current through, say, the tunnel barriers connecting the dot to the left lead. The first thing 
we need is an appropriate counting field. Phenomenologically, we may argue that, in the 
presence of a vector potential Ae± normal to the lead axis, the tunneling matrix elements 
utilized in Section 11.6.1 will generalize to Tl,o,h — >• T/ j a/J exp(< f dxA), where the 
integral runs over the transverse extension of the barrier. The definition Eq. (11.97) then 
means that the counting field couples to the tunneling matrix element on the Keldysh 
contour as 

Tl,o,h T L,anexp(ix{t)ai/2). 

Notice that, as with the quantum component of the interaction field, the counting field cou- 
ples to the tunneling matrix element. This observation is sufficient to determine the coupling 
of the counting field to the effective action without much further calculation. For the sake of 
a more symmetric notation, let us define generalized matrix elements T X)aM exp(ix x (t)<7i/2), 
x = L,R. (Later we will set xl = X; Xr = 0.) The tunneling action Eq. (11.75) then 
assumes the form 

•Stunlx] = - * 9xtv ( A *(x*) e_ ^A d e^') , A x (Xx) = e l ^ <ri A x e~ z ^ L<Tl , 

x=L,R 

where we have generalized to barriers of different tunnel conductance, g x . Following the 
logic of Section 11.6.2, our next step will be to determine the matrix A<j. As before, we do 
this by requiring stationarity under variations of </>. (Exercise: think why this condition is 
equivalent to the condition of zero stationary current flow onto/off the dot.) Variation of 
the action in the limit of vanishingly weak interaction, <f> ~ 0, generates the condition 

[A d ,^2g x A x ( Xx )\ =0. (11.102) 

Now, the transformations A — > exp(*xoi/2)Aexp(— *xoi/2) render the matrices A x (Xx) 
non-triangular. This means that stationary configurations A^ will in general no longer be 
of the form Eq. (11.76). However, the transformation by the counting field does not alter 
the “normalization condition,” A 2 = erg remains valid. It is straightforward to verify that 
the solution of Eq. (11.102) obeying this normalization is given by 28 

^ 9lA l {xl) + 9rAr{xr) 

(9l + 9r + 9l9r[^l(Xl), A fl ( x fl )]+) 1/2 

(Notice that the matrix in the denominator commutes with the numerator, i.e. the relative 
ordering of numerator and denominator is not an issue. Also notice that in the limit x = 0, 


In computing this result, we approximate the diagonal elements of the matrices A in Eq. (11.76) by their 
imaginary parts, ±1. Diligent readers may wish to trace the fate of the real parts and check that they regularize 
the superficially divergent constant C in Eq. (11.103) below. 
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the solution reduces to that discussed in Section 11.6.2.) Substitution of this configuration 
into the action leads to 

Stunbd. = tr (si + 9r + 9l9r[Al{xl),A r (xr))+) 1/2 ■ 

Assuming that the inverse of the counting time window (At) -1 is large in comparison to 
the energy scales relevant to the distribution functions, we will evaluate the trace over 
energy/time indices within the leading order Wigner approximation (cf. Section 11.5.1), 
tr(. ..)->-/ . . ), F(i) -»• F(e), x(t) — ► x(t)i where F(e) and x(t) are ordinary functions 

of energy and time, respectively. As a result of a straightforward (if somewhat tedious) 
calculation, one finds that the matrix [J^-l{xl), A/j(x.r)] is proportional to the unit matrix, 
and that the trace evaluates to 
dtde 
27 r 


'Stun [x] — 2 


G 9l + 9r ) 2 + 4 9l9r ^(e 5 
iAt f de 


(11.103) 

( k) - - n R ) + (e* ( R ~ l) - l)n R (l - «l))) 2 

= J + 9r) 2 + 4 9l9r ((e* - l)n L (l - n R ) + (e~ l - l)n R (l - n L ))) 2 + C, 

where in the last equality we used xl(£) = x(t) ~ X®(t ~ t)Q{t + At — t), and C is a 
constant. From this result we find the cumulant generating function In g(x) — lnexp(iS'[x]) 

^9(x) = y J ^ ((5i + 9r) 2 + ^9l9r ((e* - l)n L (l - n R ) + (e~ l - l)n R (l - n L ))) 5 . 

(11.104) 


Now, let us try to make sense of this expression. Comparison with Eq. (11.101) shows that 
In g contains the generating functions of two Poisson distributions as building blocks. To 
explore the meaning of this result, let us first consider the limit of zero temperature and 
voltage bias V, n L / R (e) = (-)((+/—) ^ — e). The generating function then reduces to 

A/-T/ l 

Infif(x) = (( 9l + 9r) 2 + 4 g L 9R ((e l - 1))) 2 + C. 


According to Eq. (11.98), the first moment of the transmitted charge through the system 
is given by 


(N) = -id | _ 0 lng(x) 


A tV 9l9r 
2tt g L + g R 


Comparison with the definition of the conductance, G = ( I)/V = (N) /V At gives G = 
( 9 l+ 9 r ) ’ we recognize as the mean conductance of two tunnel conductances shunted 

in series (in units of the conductance quantum e 2 /h = 1/27 r.) Turning to current statistics, 
let us consider the limit gL g R ■ We may then expand the square root to first order in 

the ratio 9l/9r to obtain a bi-directional Poisson distribution Eq. (11.101), with (time- 
integrated) characteristic rates identified as 


j/i = A tg L 


/ 


n L { 1 - n R ), v 2 = Atg L 


/ 


n R (l-n L ). 
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These coefficients may be interpreted as the integrated rate at which filled states in the 
right lead scatter into empty states in the right lead, and vice versa. As one may expect, 
the statistics of the current is dominantly caused by the “bottleneck” in the system, i.e. 
the conductance of the weaker tunnel barrier, gi. At finite temperature, ~ Atgy /2 ti, 
while V 2 — exp(— V/T)AtT /2 -k shows that thermal activation is necessary to push charges 
against the voltage gradient. 

The second (cumulative) moment defines the noise power (cf. Eq. (11.91)), 

5° = ^var (IV) = - A &\ =0 ln 5 [ X ] = + ia>) = 2^V coth(K/2T). 

This shows how the noise interpolates between equilibrium noise, (SI(t)SI(t')) V ~° gL,T 
and the shot noise limit (5I(t)5I(t')) V ~ T 5i|E|. In the more general case of barriers of 
comparable transparency, gL — gR, the current statistics is more complicated and described 
by the full expression Eq. (11.104). 

In principle, we might now include phase fluctuations to explore the interesting question as 
to how interaction effects will influence FCS. However, this topic 29 lies beyond the scope of 
the present text. Instead, let us conclude by briefly discussing a few more general aspects 
of full counting statistics. 


11.7.4 General ramifications of FCS 

Previously, we have introduced the concept of a counting field as a purely technical means 
to extract moments of observables from a Keldysh partition function. From a theoretical 
point of view, this has been an efficient procedure, but it is also somewhat naive: when we 
speak about moments of quantum observables, we inevitably enter the difficult territory 
of quantum measurement. First, we have to realize that, when talking about moments, 
what stands in the background is a repeated quantum measurement of observables. Second, 
in experiment, it is not the (fermion) current that is measured directly. Typically, it is 
fluctuations of secondary degrees of freedom, e.g. the bosonic modes of electromagnetic 
degrees of freedom coupling to the current that are detected. One thus needs to explore 
the connection between the original degrees of freedom (fermion current) and the detector 
degrees of freedom. These (deep) issues have been explored in the literature 25 and we 
here restrict ourselves to a very superficial discussion of some main ideas. 

An elegant (if not particularly realistic) way to model the coupling to a detector degree of 
freedom is by coupling the fermion degrees of freedom to a spin. The precession of the spin 
detector then encapsulates information on the transmitted fermion charge. Curiously, this 
procedure exactly amounts to what we have been doing in introducing our Keldysh counting 
field, and this connection is worked out in the following 

EXERCISE We here wish to explore in what sense our counting field, effectively models a 
spin detector variable (cf. the review by Levitov 25 for an extended discussion). Consider a spin 


cf. D. A. Bagrets and Y. V. Nazarov, Pull counting statistics of charge transfer in Coulomb blockade system, 
Phys. Rev. B 67, 085316-32 (2003). 
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1/2 placed near our current carrying device and coupled to the latter through the Hamiltonian 
H c = f d d rA • j a z , where j denotes the fermion current density operator, A is a vector potential 
mediating the coupling to the spin, and we assume that only the ^-component of the latter is 
coupled. Now, let us assume that the spin has been prepared in some initial state, so that the 
measurement of the operator o+ = (cr x +*<? j, )/2 would lead to the expectation value (cr+(0)) sp i n . 
The coupling to the fermion system makes the transverse spin components, probed by this 
expectation value precess. This precession is described by taking the expectation value at a later 
time, (<7+(t)) S pi n . Show that the coupling to the fermion system above implies 

(a + (t)> spin = <e iA(+A > t e- iA( - A >‘) feiniion («7 + ( 0)> spin , 

where H (A) is the fermion Hamiltonian operator minimally coupled to the vector potential, A. 
Express the fermion expectation value by a Keldysh partition function. The sign change in A 
relative to the contours then means that we have coupled the fermion system to a purely quantum 
vector potential, as we did in Section 11.7.1. Assuming that the spin-fermion coupling is local, i.e. 
that the vector potential is non-vanishing only in the immediate vicinity of a transverse section 
through the conductor, we arrive at a functional integral formally equivalent to the integral Z[x] 
employed above. 


The assumption of a spin detector is artificial in two respects. First, currents are not nor- 
mally measured by spins. Second the coupling of our system of interest to a detector will 
in general imply a “back action” of the latter on the former, and this will affect the cur- 
rent statistics. These aspects are discussed in the review by Kindermann and Nazarov cited 
above . 25 The authors argue that measurements on a (mesoscopic) device typically imply 
the embedding of the latter into a detector electromagnetic environment. 

With reasonable generality, the detector environ- 
ment may be modelled as a linear electromagnetic 
circuit (see figure), i.e. a system whose relevant 
degrees of freedom (currents, voltages, etc.) are 
controlled by some effective oscillator dynamics 
(cf. an LRC-circuit). The variables of the detector 
environment are linearly coupled to those of the 
mesoscopic device. (For example, the charge density 
on the latter may feel the potential fluctuations of 
the detector, etc.) One may then formulate an FCS 
functional for the composite system. Owing to their 
linearity, the degrees of freedom of the detector environment may be integrated out, and 
this leaves an effective functional in which the counting field and the primary variables of 
the mesoscopic device are coupled in a manner that will typically involve retardation effects. 
In this way, one has effectively described both a reasonably realistic readout procedure 
and, relatedly, the back action of the measurement device onto the microscopic system of 
interest. For further discussion of these issues we refer to the original literature. 
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In this final chapter of the book, we have introduced the Keldysh field theory as a modern 
- in spite of its formulation in the mid-60s - tool to describe quantum nonequilibrium 
phenomena. It must be conceded that the Keldysh formalism comes with a relatively steep 
learning curve. However, once one gets used to the concept of two counter propagating 
time contours, one realizes that Keldysh field theory provides an enormously powerful and, 
indeed, intuitive framework to approach quantum nonequilibrium phenomena. Practically 
all other theoretical nonequilibrium tools (at least those familiar to the authors of this text) 
can be recovered as restrictions of the Keldysh approach. Specifically, we have discussed the 
derivation of quantum master and quantum kinetic equations, the connection to equilibrium 
field theory, the classical limit (which neatly connects to the MSRJD formalism central to 
the previous chapter), nonequilibrium variants of diagrammatic perturbation theory, and 
various other aspects. However, notwithstanding its indisputable power, one should not 
get carried away and declare the Keldysh formalism the new master tool, superior to the 
equilibrium concepts discussed in earlier chapters of the book. The flexibility of Keldysh 
theory may come as a burden when one is actually carrying more information about a 
problem than that which is called for. In situations where one is at equilibrium, or close to it, 
there is no need to explicitly keep track of the (then known) distribution function, and the 
Matsubara formulation may be the more efficient option. However, at the time of writing of 
this book, quantum nonequilibrium problems are becoming more and more important, and 
this makes Keldysh theory an important concept in contemporary physics. At any rate, the 
best way for readers to proceed from here is to pick a nonequilibrium problem and conduct 
their own research! 


11.9 Problems 

11.9.1 Atom-field Hamiltonian 

The “atom-field-Hamiltonian" is a simple model Hamiltonian of quantum optics. It reduces the inter- 
action of atoms with electromagnetic field modes to a basic model of a two-level system ( “the atom" ) 
coupled to an assembly of oscillator modes. The simplistic nature of this reduction notwithstanding, the 
atom-field Hamiltonian describes ample phenomenology, and it is a prominent model system of quantum 
optics. In this problem, we study the simplest variant of the system, the exactly solvable limit of the 
interaction with a single mode. The single mode limit describes fully coherent quantum phenomenol- 
ogy. We use it as a benchmark system to compare against the "incoherent” approximations underlying 
the quantum master equation of Section 11.1. In the follow up Problem 11.9.2 we then explore what 
happens upon generalization to multi-mode coupling. 
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Consider an atom exposed to electromagnetic radiation. Assuming 
that the field modes predominantly couple two atomic states |a) 
and 1 6) (see figure), and forgetting about the complications intro- 
duced by the polarization of the electromagnetic field, this setup 
may be described by the simple model Hamiltonian, 

H = + ^u k ala k + ^ (g k a+a k + g k a-a^) , (11.105) 

k k 

where e is the energy difference between the excited state, |a), and the lower state, \b), 
the Pauli matrices cr, act in the two-dimensional Hilbert space defined by these states, and 
£g|= (<7i ±ia 2 )/2, as usual. This atom-field Hamiltonian describes excitation processes 
| b) — > | a) by field quantum absorption (at coupling constants g k ), and the corresponding 
relaxation processes by quantum emission. 

We may simplify the problem further by assuming that only a single mode of the elec- 
tromagnetic field satisfies the resonance condition oj ~ e, that needs to be met to obtain 
significant conversion efficiency. This, then, leads to the single mode Hamiltonian 

H = + Ljafa + g (a+a + (T-a*) , (11.106) 

where we have omitted the mode index “/c” and gauged the coupling constant g so as to 
become real. 



I b) — 


(a) Considering the atom as our “system,” and the mode as a “bath,” adapt the formalism 
introduced in Section 11.1 to derive an effective equation of motion for the reduced 
system density matrix, p s . In doing so, try to be critical concerning the approximate 
elements of the construction, notably the Markovian approximation. (Hint: you will 
be met with a singular effective coupling constant ~ 6(e — w). For the moment, do 
not worry about the regularization of this expression and treat it as a formal object.) 
Derive a closed expression for the diagonal elements of the reduced density operator 
p x = (x\p s \x), x = a,b and verify that the stationary limit p X;(X) = p x (t — »• oo) obeys a 
detailed balance relation 


Pa, oo = (n) 

Pb,oo {n + 1) ’ 


(11.107) 


where n is the number of mode quanta and the expectation value is over the thermal 
distribution of the bath. Accordingly, the population imbalance between the levels 
approaches the limit 

A^A.,00 (11.108) 


Discuss the meaning of this expression. Is it really appropriate for the situation at hand? 
If not, where do you think the approximation involved in its derivation failed? 

(b) Now, let us compare the predictions of the Markovian approximation to reality. To this 
end, solve the Schrodinger equation for the Hamiltonian above. (Hint: use the fact that 
the Hamiltonian couples only very few of the states |x, n ) = |a;) 0 1 n), x = a, b, where | n) 
is an occupation eigenstate of the mode system. You will encounter a time dependent 
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Figure 11.6 Imbalance between the population of the two atomic states as a function of dimen- 
sionless time gt. Notice the aperiodic changes in the population of the states and the absence of 
relaxation towards a ground state configuration A p — — 1. 

Schrodinger equation that may be solved either by Fourier transform, or by educated 
guessing.) Interpreting p x = ]T) ri (;r, n\x. n ) as the probability to find the system in state 
|x), compute the population imbalance and compare to the predictions of the Markovian 
approach Eq. (11.108). If you have access to a computer, you may find it instructive 
to plot the result for the time dependent imbalance for different values of temperature 
(presuming the mode distribution to be thermal), and frequency mismatch A = e — oj 
as a function of the dimensionless time parameter gt. You will obtain patterns such as 
the one shown in Fig. 11.6. 

(c) Discuss in qualitative terms the origin of the discrepancies between the two approaches. 
Why is the Markovian approximation not appropriate under the present circumstances, 
and why does the exact solution not predict relaxation of an initially occupied state |a) 
to the ground state, even at zero temperature. 

Answer: 


(a) In the interaction representation, a(t) = e imt a, cr±(t) = e ±let <r±, the interaction Hamil- 
tonian Hi = g (a+a + cr_a^) reads 


Hi{t) = g (e lAt cr + a + e lAt a «*) , 


where we defined the energy mismatch A = e — u> between level splitting and mode 
frequency. Defining Li = —i[Hi, ], it is then straightforward to verify that Eq. (11.4) 
assumes the form 

dtp s = ~g 2 J dt'e +lAt ' (+(n + l)[a+,a-p s (t - 1')\ - (n)[a+, p s (t - t')aJ\) 

- g 2 J dt' e~ zAt ' {—{n + l)[o-_,/5 s (f - t')a + ] + (n)[a_,a + p s (t - t ')]) , (11.109) 

where the expectation value is over the mode distribution. Assume that at some initial 
time t = 0, the density operator had diagonal form, 


ps = p a P+ + PbP - , 



756 


Nonequilibrium (quantum) 


where P± are projectors onto the upper (a) or lower ( b ) state, and p a j>- Pa + Pb = 1 are 
the probabilities of occupation of these states. It is straightforward to check that the 
evolution equation preserves the diagonal form. Adopting a Markovian approximation 
(but is it appropriate?) wherein the time dependence of p(s) under the integral above 
is considered negligible, the evolution equation for the coefficients p a ,b takes the form 
of a master equation 


dtPa = r ((n)p b ~{n+ 1 )p a ) , 

d t pb=T({n+l)p a -{n)pb), (11.110) 

where the rate T = 2tuj8{/\) is singular at resonance. This equation predicts an 
(irreversible) approach to a stationary limit satisfying the detailed balance relation 
Eq. (11.107). 

(b) It is evident that for any fixed n > 0 the Hamiltonian acts in the two-dimensional 
space spanned by the states \a,n) and \b,n+ 1). Specifically, Hi(t)\a, n) = ge~ lAt (n + 
l) 1 / 2 |6,n+ 1) and Hi(t)\b,n+ 1) = ge +zAt (n+ 1) 1/,2 | a,n). Introducing wavefunctions 
by I = v a ,n{t)\a,n)+ii; b . n+ i(t)|6, n + 1), the time dependent Schrodinger equation 
idii: = Hi(t)'ip assumes the form 

id^a,n = ge +iAt (n + l) 1 / 2 Vwi, 
id t ip b , n+1 = ge~ lAt (n + 1) 1/2 ip a>n . (11.111) 


These equations are solved by (devise your own solution strategy, there are several) 

= .A. - (fe,„(0) cos ( fi„t) - d ( " + 1 ) 1/2 ^.-«<°) + Sin(si() 

= e-,* ^,„ +1 (0) cos(fl„() - r‘' ( ’ , + 1)1,V -*” ) -^‘-« ( ° ) sinm 
where we introduced the abbreviation 


V 


Cl n = [g 2 (n + 1)+ - 


Assuming for simplicity that the atom was initially in its excited state, we have ipb,n + 1 = 
0 and V’a.n = \fPni where p n is the nth eigenvalue of the mode density operator (e.g., 
p„ = Z~ 1 exp(— f3ui(n + 1/2)) if the mode distribution is thermal). The exact result for 
the population imbalance then reads 

A P = Y1 (l^«.™| 2 ~ \^b,n\ 2 ) -^2p n (l - 2g ^2 + ^ shl 2 (fl n f)^ . 


This result is very different from the one obtained within the Markovian approach, 
Eq. (11.108): no stationary limit is approached. A short period of decay of the initial 
value Ap(0) =- 1 merges into a pattern of irregular fluctuations - a result of a super- 
position of contributions of incommensurate frequencies (see Fig. 11.6). In quantum 
optics, the phenomenon of transient near-recoveries of the initial value is known as 
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collapse and revival while the fluctuations afford an interpretation as Rabi oscil- 
lations. Clearly, the oscillatory pattern is a result of maintained quantum coherence 
and reversibility of the dynamics, we do not observe a systematic decay of the upper 
state into the lower. Notice in particular that even at zero temperature the atom does 
not relax by emission of field quanta: at T = 0 only the n = 0 term (zero field quanta) 
contributes to the sum above. This leads to oscillatory behavior of the density operator 
in which the initial state is recovered at regular intervals t ~ g, but no relaxation. For 
further discussion of the fluctuation pattern we refer to Scully and Zubairy. 30 Here, our 
main conclusion is that the prediction of irreversible dynamics derived in (a) is evidently 
incorrect. 

(c) The Markovian approximation fails because a single quantum oscillator mode does not 
behave as a bath. Indeed, the mode-atom coupling is strongest at resonance, A = 0, 
when “system” and “bath” fluctuate at comparable time scales. Technically this means 
that the (Markovian) assumption of constancy of the time evolution operator under the 
integral Eq. (11.109) is not justified. The Markov approximation requires a short lived 
bath “memory,” in the sense that bath correlation functions decay to zero at time scales 
much shorter than those at which the system density operator varies. This happens, e.g., 
if we couple to many modes instead of one (see the follow-up problem.) In this case, 
the analog of Eq. (11.108) contains many uncorrelated contributions, and the dynamics 
becomes effectively irreversible. 31 


11.9.2 Atom-field Hamiltonian II: Weisskopf-Wigner theory of spontaneous 

emission 

(Attack this problem after 11.9.1.) In the previous problem, we saw that the coupling of an atom to a 
single electromagnetic mode does not lead to radiative relaxation. The relaxation processes ubiquitous 
in the physics of atomic radiation must, then, be a consequence of the presence of many field modes. In 
this problem, we study how the large phase space of the multi-mode system justifies an approximation 
that renders the dynamics of the atom-field system effectively irreversible and does describe relaxation 
by radiation. The theory derived here plays an important role as a building block of, e.g., laser theory. 

(a) Consider the Hamiltonian Eq. (11.105) of a two-level atom coupled to a multi-mode 
field. Assuming zero temperature, derive the generalization of the Schrodinger equation 
(11.111) for the initial configuration |a) (g) |0) to the multi-mode case (|0) is the zero 
temperature photon vacuum). By formal integration of the second equation, convert the 
system to a single integro-differential equation. Assuming that \gk\ 2 = g 2 (^k) depends 
only on the mode energy and that both the density of states of the bath modes and g 2 (w) 
change only negligibly over the inverse of the time scales at which the wave functions of 


30 M.O. Scully and M.S. Zubairy, Quantum Optics , (Cambridge University Press, 1997). 

31 Even then, it is not impossible that a rare accumulation of coherent contributions leads to an accidental 
population “revival” - after all, we are still dealing with unitary (reversible) dynamics. However, for sufficiently 
large numbers of contributing modes, such events are infinitely improbable, and an effectively irreversible 
approximation is justified. 
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the problem vary - the Weisskopf— Wigner approximation - derive an approximate 
solution of this equation. 

(b) For arbitrary temperature and initial population, attack the multi-mode problem by 
generalization of the projector formalism applied in Problem 11.9.1 (a) to the single 
mode case and compare to the results of the Weisskopf-Wigner theory. Convince yourself 
that the approximation used there is equivalent to a Markovian approximation. 

Answer: 


(a) The initial state relevant to the description of an excited atom immersed into a zero 
temperature bath reads |a) (g) |0), where |0) is the vacuum of the bath system. The 
Hamiltonian couples this configuration to the states |6) ® aj.|0). Introducing a wave 
function by ip = tpo\ a ) ® |0) + <S> aj.|0), the multi- mode generalization of the 

Schrodinger equation (11.111) reads 

id t tpo = y^gfcg* A *Vfc, 

k 

id t ^k = gke~ lAkt tpo, 

where A*, = e — uik- We integrate the second equation and substitute the result into the 
first equation to arrive at 

dtMt) = M 

k Jo 

Introducing the density of bath modes p(uj) = J^k — <*>*,), we obtain 

= - J dup(u)g(u}) j dfV (e_ “ )(t_t 'Vo(0 - - 2 WVo(0> 

where in the crucial second step we assumed approximate constancy of both p(oj) ~ p 
and g(u) ~ g to evaluate the frequency integral as / dw > = 2nS(t — t'). The 

(irreversible) effective equation for tpo is now trivially solved as 32 

Mt) = 

This means that the population imbalance between the two atomic levels 

a p = iV’oi 2 - = - 1 + 2e_rt > 

k 

shows relaxation at the golden rule decay rate 

T = 2t xpg 2 . (11.112) 


We count f* dt'S(t — t') = 1/2 since the <5-function lies at the boundaries of the integration domain. 
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(b) Comparing to the discussion in the previous problem, we verify that the multi-mode 
generalization of Eq. (11.109) reads 

dtps = ~ J dio p{u)g 2 (u>) J dt' 

x [e+h'-")*' (+(n(w) + 1) [cr_|_, o-p s (t - t ')] - (n(w))[o-+, p s (t - t')o - ]) 

- + 1 )[a-, p s (t - t')a^l*t{n(uj))[a-,a + p s (t - t')])], 

where ri(cj) is the boson distribution function. However, unlike in the previous problem, 
we now have justification to assume near constancy of the density operator over the time 
scales at which the addition of uncorrelated bath contributions decays to zero. (Think 
why this is equivalent to the Weisskopf-Wigner assumption of frequency independent 
pg 2 .) Doing the integral, we obtain the master equation (11.110), where ( n ) = (n(e)) is 
the mean number of bath quanta at the resonance energy and the decay rate is given 
by Eq. (11.112). Solution of this equation obtains the population imbalance 

In the particular case of initial occupancy of |a), Ap(0) = 1 and zero temperature, 
(n(e)) = 0, this reduces to the results obtained in (a). 

Summarizing our results, we have found that in the multi-mode case the dynamics 
of the system becomes effectively irreversible. Technically, “multi-mode” means that we 
must be able to assume near constancy of the spectral density of bath modes over scales 
~ T, i.e. the rate at which the density operator changes. Under this condition, the Marko- 
vian approximation (equivalent to the Weisskopf-Wigner approximation) becomes jus- 
tified and the quantum master equation provides a convenient alternative to the direct 
solution of the Schrodinger equation. 


11.9.3 Keldysh theory of the Coulomb blockade 

(Recapitulate Section 11.6 before turning to this problem. If not stated otherwise, the notation of that 
section will be used throughout.) In this problem, we consider the out of equilibrium quantum dot 
discussed in Section 11.6 in a regime of near isolation from its environment (“leads”). Under these 
conditions, charge on the dot is quenched - the Coulomb blockade. We here discuss the dynamics of 
the formation of a Coulomb blockaded state, and its response to an external voltage bias. 

In Problems 6.7 and 6.7 we considered an equilibrium quantum dot in a state of perfect or 
near isolation from its environment. At low temperatures, the dot admits only the integer 
quantum of charges that minimizes its capacitive energy. This Coulomb blockade manifests 
itself in the partition function of the isolated quantum dot, 

Z = J2e~^ (n ~ No) \ (11.113) 

where the optimal number of charges is determined by the parameter Nq g M (which may 
be set by changing external gate voltages) and E c is the charging energy. 
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At first sight, it may appear strange that the Keldysh-description of even this simplest sig- 
nature of the Coulomb blockade is far from trivial! To understand why this is so, recall that 
the dynamics on the Keldysh contour builds on the thermal distribution of a non-interacting 
theory, while the partition sum above describes the thermal state of an interacting system. 
We thus need to describe the passage between a non-interacting and an interacting equilib- 
rium state. Naively, one might hope that the interpolation between the two states occurs 
as interactions are gradually switched on in the dynamical evolution along the Keldysh 
contour. But this is not so. The switching on of interactions results in an increase in energy 
of the TV-particle sector of Fock space, however, the distribution functions controlling the 
occupation of single particle states are left un-altered (think about this point.) We need to 
include a mechanism of relaxation-by-interaction to effect the formation of a thermal state 
similar to that described by Eq. (11.113). Once the mechanism of thermalization has been 
understood, we can describe structures way beyond the partition function Eq. (11.113). For 
example, we may explore what happens to the Coulomb blockade when the system is biased 
out of equilibrium by external voltage, or we may explore the dynamics of the approach to 
equilibrium after a sudden change in the state of the system, etc. 

But what, then, causes thermalization in the present context? It turns out that the most 
direct channel of relaxation is the coupling to the leads. Indeed, we have seen in Sec- 
tion 11.6.1 that the dot-lead coupling of the interacting systems reads (symbolic notation) 
where V’d are the fermion fields of the dot, ip x =L,R, those of the leads, and is the 
Hubbard-Stratonovich field of the interaction. This means that a quasiparticle entering the 
dot creates an excitation in the field <p (physically: a voltage fluctuation). That fluctuation 
may in turn create a particle-hole excitation (via the vertex i p x T ^ in the dot/lead 
system. We conclude that the coupling vertices provide a mechanism for the excitation of 
particle- hole pairs, and hence of relaxation in energy. Below we explore the formation of 
the Coulomb blockade out of these processes, and the generalization to a nonequilibrium 
state. To keep the discussion simple, we assume vanishing gate voltage No = 0, and zero 
temperature T = 0 throughout. (Exercise: you may find it instructive to generalize to finite 
gate voltage and temperatures.) 


(a) Our first step is purely technical: observing that at weak coupling to the external world, 
gx -C 1, fluctuations in the field <J) are strong, we will trade the integration over <p for 
an integration over all charge states of the dot. (Recall that “charge” is the degree of 
freedom conjugate to the phase </>.) To this end, expand the action Eq. (11.79) in the 
tunneling action Eq. (11.82) and integrate over the phase degrees of freedom (hint: you 
may find it convenient to switch back to the contour variables (p± before doing the inte- 
gration, and mind the quantization condition n c (0) £ Z). to obtain the representation 
of the Keldysh partition function 


.2 = V A -! 


M 




(11.114) 
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where X^o-} a sum over sign configurations a £ {—1, l} 2m , the integration measure 
Dt = rifc=i anc i ti ie charge profiles 

2 m 

n a (t) = n + a ^](-) fe 0(f - tk)S„ k ^. (11.115) 

fc= l 

Finally, the matrix elements of the kernel L are defined as 

^ (aa"Z K + crS++£7 , S“) , (11.116) 

where the self energies T, K,± have been introduced in Eq. (11.82). 

This representation expresses the partition function as a sum over quasiparticle in- 
and out-tunneling events, connected by elements of the kernel L. We next need to make 
physical sense of this expansion. 

(b) The temporal entanglement of tunneling events 

makes a closed computation of the partition ' 

function impossible. However, for sufficiently ' ” * /* 

small values of the tunneling conductance, .v 

tunneling processes containing intersecting or * • 

nested “propagator lines” L(t,t’) are negligibly small. Estimate the temporal range of 
the propagator L to derive a criterion for the applicability of the non-interacting 
blip approximation (NIBA) wherein tunneling events occur in a strictly sequential 
manner (cf. the figure.) 

(c) The lack of entanglement of tunneling events in the NIBA makes the computation of 
the Keldysh partition function a lot easier. The basic picture now is that occasional 
charge tunneling events (so-called blips in the jargon of the field 33 ) are interspersed 
into long periods of time wherein the charge contour profile stays in a diagonal state, 
n + = n_ = n c \ = n (“sojourns.”) During a blip, the quantum component n + — n_ = 
n q = £ £ {—1,0, 1} jumps to a value ±1, depending on the configuration (u-i/c-i , 

of the tunneling event, and the sign of the time difference t^k-i ~ ^ 2 k ( see the figure 
below) . 

Building on this structure, and assuming zero biasing, V = 0, derive a master equation 
for the quantity P(n,t) = P(n,t|no,0), i.e. the probability that the system evolves into 
a charge state (n+,n_) = (n,n) at time t, provided it started in (no, no) at t = 0. 
To this end, interpret P(n,t) as the Keldysh field integral Eq. (11.114), subject to the 
constraint (cf. Eq. (11.115)) no-(t) = n and n a (0) = 0. Relate P(n,t ) to P(n',t — St), 
where St E~ x is much bigger than the typical duration of blips, yet smaller than the 
average spacing between blips, At <C (<jfE c ) _1 . 


33 U. Weiss, Quantum Dissipative Systems , (World Scientific Publishing, 


1993). 
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Apply a continuum approximation (At)~ 1 (P(n,t) — 

P(n,t — At)) ~ dtP(n, t) to obtain the master equa- 
tion 

d t P(n,t)= [(A - l)W n ,„_i 

+ (E , - 1) W n>n+ i] P(n, t), (11.117) 

where E±\f(n) = /(n ± 1) are charge raising and 
lowering operators, the transition rates, 

W nn± i = —E c (n,n± l)Q(E c (n,n± 1)), 

’ 7T 

and 

E c (n, ri) = E c (n 2 - n' 2 ), (11.118) 

is the relative energy of different charging states. 

(d) Discuss the approach to a Coulomb blockaded state, as described by the master equation 
(11.117). 

(e) Generalize to the case of finite bias voltage V. Will the ground state occupancy change? 

Answer: 



(a) Expressed in terms of the contour representation <j> c = ( <f> + + <£_)/ 2, (j> q = <j> + — <j>_, the 
charging contribution to the action S c = 5| g _ 0 reads as 

S r ji,<p\ = j dt ( n + d t (j> + — — E c (n 2 + — ri 2 )) , 

where we have introduced 

, n Q 

•Jfi =n c ± y. 

The quantization condition on n c translates to u+(0) = U-(0) G Z. The relative sign 
change between the first two terms tells us that the operator e*^+ raises the charge 
on the upper contour by one, n+ — y n+ + 1 while e l< ^~ lowers the charge on the lower 
contour by one, n_ — » n_ — 1. To make this action explicit, we first transform the 
tunneling action Eq. (11.84) to contour fields, 

StoM = f f ’ 

where the matrix kernel L = {L aa i}, a, a' = ±1 is defined in Eq. (11.116). We may now 
expand exp(«iS' tun ) in powers of the coupling constant to obtain the series 


( ,‘ s [<••>; 



£ / fj L a2k _ ua2k (t2^t-t 2k ), 

{<T} J k= 1 
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I'll {t2k - t2k-l) 



Figure 11.7 A configuration contributing to the expansion of the tunneling action: charge tunneling 
events at times . . . , t 2 k-i,t 2 k, ■ ■ ■ are weighted by matrix elements of the dissipation kernel. 


where a £ {— 1, l} 2m are sign configurations and Dt = rife=i d£fc* The expansion weights 
in- and out-tunneling events at times i 2 fc-i and tzki respectively, with elements of the 
kernel L (cf. Fig. 11.7.) An interpretation in terms of charge tunneling is readily estab- 
lished by integration over the phase fields. Integrating the expansion above against the 
charging action S c we obtain the constraint dtn a = cr Efc=i( — — tk)Scr kt<T . Substi- 
tution of the solutions of this constraint Eq. (11.115) into the charging action we obtain 
the representation Eq. (11.114). 

(b) Consider the kernel L(oj) at V = 0 (exercise: confirm that at finite V the temporal 
range of L shrinks below its V = 0 value. This means that at V = 0 we obtain the 
most conservative estimates for the validity of the NIB A). The scaling L(u>) ~ u> (cf. 
Eq. (11.83)) implies L(t) ~ t~ 2 . This means that a charge tunneling event carries the 
statistical weight ~ f dte ±zEct t~ 2 ~ Eq- Its duration can be estimated as 


8t = 


f dt e ±iEct t~ 2 t 
fdte ±iE oH~ 2 


The total statistical weight of a tunneling event occurring somewhere in a time window of 
duration to is then given by ~ gEcto- This means that the average number of tunneling 
events in to is 


(m) 


m^ctor 

Em ^ x(gE C to) m 


— gEcto- 


The temporal spacing between events follows as to/(m) ~ (gE,)^ 1 , and this relates 
to the duration of the event as to /St ~ g~ x : at low tunneling g <gc 1, the spacing 
between events exceeds their duration by far, and an approximation treating events in 
an uncorrelated sequential order becomes justified. 

(c) For E~ x -C At -C {gE c )^ A . the increment of P in the time window [t — A t,t\ will be 
determined by zero or one blip processes: 


n+l 

P(n,f) ~ P{n,-At)+ C m P(ri,t-At), 


where the coefficients C nn > are one-blip transition probabilities. An individual blip is 
characterized by its sign profile (a, a') (where the figure on p762 shows that the con- 
nection between the signs and the post-blip increment in classical charge is given by 
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: n —t n, (+, — ) : n — > n — 1, (— , +) : n — > n + 1.), the center time of the 
blip to £ [t — At. t] and its duration, s. Comparison with Eq. (11.114) shows that, e.g. 

Cn,n - i = j dse*' E ‘ :( "“ 1 ’" ) 5 L_ + (s) = l ^ t L_ + (E c (n- l,n)) 

= "'f / (->:* - >’ + $ >1 ){E c (n 1 • /0) = — £ c (n- l,n)0(E c (n- l,n)), 

2 7T 

where the prefactor At results from the integration over the center time, and we intro- 
duced the relative charging energy Eq. (11.118). In an analogous manner, we obtain 

C n , n + 1 = < ^ t E c (n + l,n)Q(E c (n+l,n)), 

7T 

C nn = - — - (— E c (n , n + 1 )Q(E c (n, n + 1)) — E c (n, n — 1 )0(E c (n, n — 1))) , 

7 T 

where in computing C nn it is important to keep in mind that E ± (t) oc 0(±t) carry 
retarded and advanced causality. Substituting this result into the evolution equation 
above, dividing by At and taking the continuum limit, we obtain the master equation 
(11.117). 

(d) For our current parameter setting, IVo = 0, V = T = 0, the charge state n = 0 defines 
the energetic ground state. The master equation describes relaxation to this state at 
rates const, x T, where T = gE c / tt = 1/RC is inversely proportional to the iiC-time 
of the system (the proportionality of these rates to g underpins the role of the lead- 
coupling as the source of relaxation) and the n-dependent constant prefactor increases 
with the distance off the ground state, n = 0. 

To show the relaxation in more explicit terms, let us assume that the relaxation has 
progressed to a level where only the charge states P- 1 , 0,1 remain significantly populated. 
The restriction of the master equation to this sub-system reads 

d t p( o, t) = r(P(i, t) + p{- i, *)), 
d t P(±l,t) = — TP(±l,f), 

which is solved by P(±l,£) = e _r *P(±l,0) and 

P(0,f) = l-P(l,0)-P(-1,0)+T [ dt' e _rt (P(l, 0) + P(— 1, 0)). 

Jo 

This solution describes the relaxation towards the state P(n, t) ^ n Q 

(e) Finite voltages affect the theory through a redefined Keldysh self energy S K . Compar- 
ison with Eq. (11.83) shows that 

S K (V) - z K (0) = ~.(r- M)e(K - M). 

The transition rates thus change to 

W n>n± i W n , n±1 + j-{V- \E c (n ± l,n)|)0(K ± \E c (n - l,n)|). 
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This change is easy to interpret: for voltages \ V\ < \E c (n±l,ri)\ smaller than the charg- 
ing energies, the transition rates of the unbiased problem remain unaltered. This means 
that the excess energy of external charge carriers ~ V needs to exceed the charging 
energy to make charge transport through the dot possible; the Coulomb blockade is 
inert against moderate biasing. 34 At large voltages, |V| | E c (n± l,n)|, the tunneling 

rates cross over to values ~ gV/ 4ir. For these rates, different charging states become 
equally populated, at a rate ~ gV set by the average current through the dot interfaces. 


34 Closer inspection shows that this statement is not entirely correct: the NIB A neglects tunneling processes of 
higher order in a p-expansion. Among these, there are some (so-called elastic co-tunneling processes) for which 
the extra power in g gets rewarded by a suppression in E c that is only algebraic (as compared to the complete 
~ exp(— E c /T) Tztf o suppression of the direct tunneling processes considered here). Although the description 
of elastic co-tunneling goes well beyond the scope of this text, we note that its contribution is proportional 
to the mean single particle levels spacing, A, of the dot, i.e. the results above hold true in the double limit, 
T, A — »■ 0. 
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